Skip to content

Conversation

@aditya0589
Copy link

This commit updates the existing Kafka definition to a more precise, technically grounded, and documentation-ready description. The new version provides clearer context on Kafka’s purpose, core capabilities, and role in modern data systems, improving onboarding for new contributors and enhancing the overall readability of our documentation.

Motivation
The previous definition, while correct, lacked depth and did not fully convey Kafka’s strengths as a distributed event-streaming platform. Clear and accurate documentation is essential for both internal developers and external users evaluating or onboarding to the project. This improvement ensures the definition better reflects Kafka’s architectural guarantees scalability, durability, fault tolerance and aligns with industry-standard terminology.

What’s Changed

  1. Expanded the definition to emphasize real-time streaming, data ingestion, and distribution.
  2. Clarified Kafka’s operational guarantees (high throughput, durability, fault tolerance).
  3. Highlighted relevant use cases including data pipelines, streaming analytics, and event-driven architectures.

Benefits

  1. Stronger first impression for new readers of the documentation.
  2. Aligns our description with modern Kafka usage practices and best-in-class technical narratives.
  3. Reduces ambiguity and sets a consistent conceptual foundation for further architectural explanations.

Helps future contributors by providing clearer context up front.

Updated the description of Apache Kafka for clarity and added an architecture image.
@github-actions github-actions bot added triage PRs from the community docs small Small PRs labels Dec 11, 2025
@github-actions
Copy link

A label of 'needs-attention' was automatically added to this PR in order to raise the
attention of the committers. Once this issue has been triaged, the triage label
should be removed to prevent this automation from happening again.

Copy link
Member

@AndrewJSchofield AndrewJSchofield left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the PR. I have left some comments.

README.md Outdated
[**Apache Kafka**](https://kafka.apache.org) is an open-source distributed event streaming platform used by thousands of companies for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications.
[**Apache Kafka**](https://kafka.apache.org) is an open source, highly scalable, fault-tolerant, distributed event-streaming platform designed for real-time data ingestion, processing, and distribution. It enables applications to publish, store, and consume continuous streams of records with high throughput and durability, making it a core infrastructure component for building data pipelines, streaming analytics systems, and mission-critical, event-driven architectures.

Acchitecture of Apache Kafka:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: "architecture" not "accitecture".

README.md Outdated
[**Apache Kafka**](https://kafka.apache.org) is an open source, highly scalable, fault-tolerant, distributed event-streaming platform designed for real-time data ingestion, processing, and distribution. It enables applications to publish, store, and consume continuous streams of records with high throughput and durability, making it a core infrastructure component for building data pipelines, streaming analytics systems, and mission-critical, event-driven architectures.

Acchitecture of Apache Kafka:
<img width="772" height="562" alt="kafka drawio" src="https://github.com/user-attachments/assets/f1dda78b-0826-4408-92ba-6aef364f1b3a" />
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This image is not helpful. It includes Zookeeper which was removed from Kafka in the 4.0 release. Personally, I would prefer not to include an image in the repo readme file.

@aditya0589
Copy link
Author

Fixed the changes requested. Corrected the typo and removed the image according to your preferance

@github-actions github-actions bot removed needs-attention triage PRs from the community labels Jan 3, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants