Enhance README with improved Kafka description #21124

aditya0589 · 2025-12-11T06:02:09Z

This commit updates the existing Kafka definition to a more precise, technically grounded, and documentation-ready description. The new version provides clearer context on Kafka’s purpose, core capabilities, and role in modern data systems, improving onboarding for new contributors and enhancing the overall readability of our documentation.

Motivation
The previous definition, while correct, lacked depth and did not fully convey Kafka’s strengths as a distributed event-streaming platform. Clear and accurate documentation is essential for both internal developers and external users evaluating or onboarding to the project. This improvement ensures the definition better reflects Kafka’s architectural guarantees scalability, durability, fault tolerance and aligns with industry-standard terminology.

What’s Changed

Expanded the definition to emphasize real-time streaming, data ingestion, and distribution.
Clarified Kafka’s operational guarantees (high throughput, durability, fault tolerance).
Highlighted relevant use cases including data pipelines, streaming analytics, and event-driven architectures.

Benefits

Stronger first impression for new readers of the documentation.
Aligns our description with modern Kafka usage practices and best-in-class technical narratives.
Reduces ambiguity and sets a consistent conceptual foundation for further architectural explanations.

Helps future contributors by providing clearer context up front.

Updated the description of Apache Kafka for clarity and added an architecture image.

github-actions · 2025-12-19T03:35:24Z

A label of 'needs-attention' was automatically added to this PR in order to raise the
attention of the committers. Once this issue has been triaged, the triage label
should be removed to prevent this automation from happening again.

AndrewJSchofield

Thanks for the PR. I have left some comments.

AndrewJSchofield · 2026-01-02T15:17:41Z

README.md

-[**Apache Kafka**](https://kafka.apache.org) is an open-source distributed event streaming platform used by thousands of companies for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications.
+[**Apache Kafka**](https://kafka.apache.org) is an open source, highly scalable, fault-tolerant, distributed event-streaming platform designed for real-time data ingestion, processing, and distribution. It enables applications to publish, store, and consume continuous streams of records with high throughput and durability, making it a core infrastructure component for building data pipelines, streaming analytics systems, and mission-critical, event-driven architectures.
+
+Acchitecture of Apache Kafka:


nit: "architecture" not "accitecture".

AndrewJSchofield · 2026-01-02T15:20:29Z

README.md

+[**Apache Kafka**](https://kafka.apache.org) is an open source, highly scalable, fault-tolerant, distributed event-streaming platform designed for real-time data ingestion, processing, and distribution. It enables applications to publish, store, and consume continuous streams of records with high throughput and durability, making it a core infrastructure component for building data pipelines, streaming analytics systems, and mission-critical, event-driven architectures.
+
+Acchitecture of Apache Kafka:
+<img width="772" height="562" alt="kafka drawio" src="https://github.com/user-attachments/assets/f1dda78b-0826-4408-92ba-6aef364f1b3a" />


This image is not helpful. It includes Zookeeper which was removed from Kafka in the 4.0 release. Personally, I would prefer not to include an image in the repo readme file.

README.md

aditya0589 · 2026-01-02T15:35:53Z

Fixed the changes requested. Corrected the typo and removed the image according to your preferance

Enhance README with improved Kafka description

b8acf5f

Updated the description of Apache Kafka for clarity and added an architecture image.

github-actions bot added triage PRs from the community docs small Small PRs labels Dec 11, 2025

github-actions bot added the needs-attention label Dec 19, 2025

AndrewJSchofield requested changes Jan 2, 2026

View reviewed changes

aditya0589 added 2 commits January 2, 2026 20:59

Fix typo in README and removed image

5e70a77

Merge branch 'apache:trunk' into patch-1

de95652

github-actions bot removed needs-attention triage PRs from the community labels Jan 3, 2026

AndrewJSchofield added the ci-approved label Jan 5, 2026

Merge branch 'apache:trunk' into patch-1

b0b0580

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enhance README with improved Kafka description #21124

Enhance README with improved Kafka description #21124

Uh oh!

aditya0589 commented Dec 11, 2025

Uh oh!

github-actions bot commented Dec 19, 2025

Uh oh!

AndrewJSchofield left a comment

Uh oh!

AndrewJSchofield Jan 2, 2026

Uh oh!

AndrewJSchofield Jan 2, 2026

Uh oh!

Uh oh!

aditya0589 commented Jan 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Enhance README with improved Kafka description #21124

Are you sure you want to change the base?

Enhance README with improved Kafka description #21124

Uh oh!

Conversation

aditya0589 commented Dec 11, 2025

Uh oh!

github-actions bot commented Dec 19, 2025

Uh oh!

AndrewJSchofield left a comment

Choose a reason for hiding this comment

Uh oh!

AndrewJSchofield Jan 2, 2026

Choose a reason for hiding this comment

Uh oh!

AndrewJSchofield Jan 2, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

aditya0589 commented Jan 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants