What are important factors in sizing a ksqlDB cluster? (Choose three.)
Answer : A, B, C
The complexity of the schema (number of fields, data types, etc.) affects processing and memory usage.
Each ksqlDB persistent query consumes resources (CPU, memory), so more queries require more capacity.
More partitions increase parallelism, but also resource usage, especially in scaling and state management.
A customer has a use case for a ksqlDB persistent query. You need to make sure that duplicate messages are not processed and messages are not skipped.
Which property should you use?
Answer : A
processing.guarantee=exactly_once ensures that messages are processed exactly once by ksqlDB, preventing both duplicates and message loss.
Which connector type takes data from a topic and sends it to an external data system?
Answer : A
A Sink Connector reads data from a Kafka topic and writes it to an external data system, such as a database, file system, or cloud service.
How can load balancing of Kafka clients across multiple brokers be accomplished?
Answer : A
Partitions are the primary mechanism for achieving load balancing in Kafka. When a topic has multiple partitions, Kafka clients (producers and consumers) can distribute the load across brokers hosting these partitions.
Which secure communication is supported between the REST proxy and REST clients?
Answer : A
How does Kafka guarantee message integrity after a message is written on a disk?
Answer : B
Kafka ensures message immutability for data integrity. Once a message is written to a Kafka topic and persisted to disk, it cannot be modified. This immutability guarantees that consumers always receive the original message content, which is critical for auditability, fault tolerance, and data reliability.
You are managing a cluster with a large number of topics, and each topic has a lot of partitions. A team wants to significantly increase the number of partitions for some topics.
Which parameters should you check before increasing the partitions?
Answer : C
Each Kafka partition maps to multiple log segment files, and each segment results in open file descriptors on the broker. When the number of partitions increases significantly, it can exceed the OS-level limit for open files per broker process, leading to failures or degraded performance. Therefore, it is essential to check and possibly increase the ulimit -n (max open files) setting on the broker machines.