Kafka: a Distributed Messaging System for Log Processing - LinkedIn 2011

Not complete yet.

Kafka is a Distributed Streaming platform. Kafka™ is used for building real-time data pipelines and streaming apps. It is horizontally scalable, fault-tolerant, wicked fast, and runs in production in thousands of companies. The Kafka site said it has three main functionalities:

  • PUBLISH & SUBSCRIBE to streams of data like a messaging system
  • PROCESS streams of data efficiently and in real time
  • STORE streams of data safely in a distributed replicated cluster

References:

0%