Book Detail : Real-time Analytics with Storm and Cassandra

Book Title: 
Real-time Analytics with Storm and Cassandra
Resource Category: 
Author: 
Publisher: 
Publication Year: 
2 015
Number of Pages: 
220
ISBN: 
978-1-78439-549-0
Language: 
English
WishList: 
yes
Available at Shelf: 
No
Description: 

Solve real-time analytics problems effectively using Storm and Cassandra

Table of Contents (Summary): 
  1. Let's Understand Storm

  2. Getting Started with Your First Topology

  3. Understanding Storm Internals by Examples

  4. Storm in a Clustered Mode 

  5. Storm High Availability and Failover

  6. Adding NoSQL Persistence to Storm

  7. Cassandra Partitioning, High Availability, and Consistency

  8. Cassandra Management and Maintenance 

  9. Storm Management and Maintenance 

  10. Advance Concepts in Storm 

  11. Distributed Cache and CEP with Storm

Table of Contents (Expanded): 
  1. Let's Understand Storm

    • ​​Distributed computing problems 

      • Real-time business solution for credit or debit card fraud detection 

      • Aircraft Communications Addressing and Reporting system 

      • Healthcare  

      • Other applications 

    • Solutions for complex distributed use cases 

      • The Hadoop solution 

      • A custom solution  

      • Licensed proprietary solutions 

      • Other real-time processing tools 

    • A high-level view of various components of Storm 

    • Delving into the internals of Storm 

    • Quiz time 

  2. Getting Started with Your First Topology

    • ​​Prerequisites for setting up Storm 

    • Components of a Storm topology  

      • Spouts 

      • Bolts 

      • Streams  

      • Tuples – the data model in Storm  

    • Executing a sample Storm topology – local mode 

      • WordCount topology from the Storm-starter project 

    • Executing the topology in the distributed mode 

      • Set up Zookeeper (V 3.3.5) for Storm 

      • Setting up Storm in the distributed mode  

      • Launching Storm daemons

    • Executing the topology from Command Prompt 

      • Tweaking the WordCount topology to customize it  

    • Quiz time 

  3. Understanding Storm Internals by Examples

    • ​​Customizing Storm spouts  

      • Creating FileSpout 

        • Tweaking WordCount topology to use FileSpout 

        • The SocketSpout class 

    • Anchoring and acking 

      • The unreliable topology 

    • Stream groupings 

      • Local or shuffle grouping  

      • Fields grouping  

      • All grouping 

      • Global grouping 

      • Custom grouping 

      • Direct grouping  

    • Quiz time 

  4. Storm in a Clustered Mode 

    • ​​The Storm cluster setup 

    • Zookeeper configurations 

      • Cleaning up Zookeeper 

    • Storm configurations 

      • Storm logging configurations  

      • The Storm UI  

        • Section 1 

        • Section 2 

        • Section 3 

        • Section 4 

        • The visualization section  

    • Storm monitoring tools 

    • Quiz time 

  5. Storm High Availability and Failover

    • ​​An overview of RabbitMQ 

    • Installing the RabbitMQ cluster  

      • Prerequisites for the setup of RabbitMQ 

      • Setting up a RabbitMQ server 

      • Testing the RabbitMQ server  

        • Creating a RabbitMQ cluster 

        • Enabling the RabbitMQ UI 

        • Creating mirror queues for high availability  

    • Integrating Storm with RabbitMQ  

      • Creating a RabbitMQ feeder component  

      • Wiring the topology for the AMQP spout 

    • Building high availability of components 

      • High availability of the Storm cluster  

      • Guaranteed processing of the Storm cluster 

    • The Storm isolation scheduler 

    • Quiz time 

  6. Adding NoSQL Persistence to Storm

    • ​​The advantages of Cassandra 

    • Columnar database fundamentals 

      • Types of column families  

      • Types of columns  

    • Setting up the Cassandra cluster 

      • Installing Cassandra 

    • Multiple data centers  

      • Prerequisites for setting up multiple data centers 

      • Installing Cassandra data centers  

    • Introduction to CQLSH  

    • Introduction to CLI  

    • Using different client APIs to access Cassandra 

    • Storm topology wired to the Cassandra store 

    • The b est practices for Storm/Cassandra applications 

    • Quiz time 

  7. Cassandra Partitioning, High Availability, and Consistency

    • ​​Consistent hashing 

      • One or more node goes down 

      • One or more node comes back up 

    • Replication in Cassandra and strategies 

    • Cassandra consistency 

      • Write consistency 

      • Read consistency  

      • Consistency maintenance features 

    • Quiz Time

  8. Cassandra Management and Maintenance 

    • ​​Cassandra – gossip protocol  

      • Bootstrapping 

      • Failure scenario handling – detection and recovery 

    • Cassandra cluster scaling – adding a new node   

    • Cassandra cluster – replacing a dead node 

    • The replication factor 

    • The nodetool commands  

    • Cassandra fault tolerance 

    • Cassandra monitoring systems  

      • JMX monitoring  

      • Datastax OpsCenter 

    • Quiz time 

  9. Storm Management and Maintenance 

    • ​​Scaling the Storm cluster – adding new supervisor nodes 

    • Scaling the Storm cluster and rebalancing the topology 

      • Rebalancing using the GUI  

      • Rebalancing using the CLI  

    • Setting up workers and parallelism to enhance processing  

      • Scenario 1  

      • Scenario 2  

      • Scenario 3  

    • Storm troubleshooting  

      • The Storm UI  

      • Storm logs  

    • Quiz time 

  10. Advance Concepts in Storm 

    • ​​Building a Trident topology  

    • Understanding the Trident API 

      • Local partition manipulation operation  

        • Functions 

        • Filters  

        • partitionAggregate 

      • Operations related to stream repartitioning 

      • Data aggregations over the streams  

      • Grouping over a field in a stream 

      • Merge and join 

    • Examples and illustrations  

    • Quiz time 

  11. Distributed Cache and CEP with Storm

    • ​​The need for distributed caching in Storm  

    • Introduction to memcached 

      • Setting up memcache  

      • Building a topology with a cache 

    • Introduction to the complex event processing engine 

      • Esper 

      • Getting started with Esper 

      • Integrating Esper with Storm  

    • Quiz time 

 

Appendix: Quiz Answers 

Index

2.992565
Average: 3 (269 votes)

Search the Web

Custom Search

Searches whole web. Use the search in the right sidebar to search only within javajee.com!!!