DATABASE FUNDAMENTALS
BASICS OF BIG DATA
Question
[CLICK ON ANY CHOICE TO KNOW THE RIGHT ANSWER]
|
|
Chunks
|
|
Domains
|
|
Messages
|
|
Topics
|
Detailed explanation-1: -Kafka maintains feeds of messages in categories called topics. We’ll call processes that publish messages to a Kafka topic producers. We’ll call processes that subscribe to topics and process the feed of published messages consumers..
Detailed explanation-2: -A topic is a category or feed name to which messages are published. For each topic, the Kafka cluster maintains a partitioned log that looks like this: Each partition is an ordered, immutable sequence of messages that is continually appended to-a commit log.
Detailed explanation-3: -Messages can have any format, the most common are string, JSON, and Avro. The messages always have a key-value structure; a key or value can be null. If the producer does not indicate where to write the data, the broker uses the key to partition and replicate messages.
Detailed explanation-4: -A Kafka partition is a linearly ordered sequence of messages, where each message is identified by their index (called as offset). All the data in a Kafka cluster is the disjointed union of partitions. Incoming messages are written at the end of a partition and messages are sequentially read by consumers.