Not complete yet.
Kafka is a Distributed Streaming platform. Kafka™ is used for building real-time data pipelines and streaming apps. It is horizontally scalable, fault-tolerant, wicked fast, and runs in production in thousands of companies. The Kafka site said it has three main functionalities:
PUBLISH & SUBSCRIBEto streams of data like a messaging systemPROCESSstreams of data efficiently and in real timeSTOREstreams of data safely in a distributed replicated cluster
References: