Spark: Processing Web Logs
The target audience for the blog post are developers who are starting to work with Spark and Scala. In this blog post, we will write code...
Spark Introduction
The intended audience for the blog are the developers who are starting with Apache spark to provide high level overview. Spark provides a...
Storm: Business Use Cases
The blog is to share business use cases in which Storm can be used to process continuous streams of data in real-time and its benefits....
Kafka Cluster Sizing
The post is provide high level guideline on Kafka cluster sizing for the given use case. The most accurate way to perform Kafka cluster...
Understanding Yarn: Business Analogy
This blog post targeted to folks working with Hadoop, to better understand Yarn framework, make it easy to remember roles / processes and...
MapReduce: Design Patterns
Thank you to O’Reilly, Donald Miner & Adam Shook for the book MapReduce Design Patterns which I have referred throughout the post. The...