Update README.md

cocoindex-dev · web-flow · commit 2c6f97868b14 · 2025-05-09T13:43:07.000-07:00
diff --git a/README.md b/README.md
@@ -17,32 +17,32 @@
 [![Discord](https://img.shields.io/discord/1314801574169673738?logo=discord&color=5B5BD6&logoColor=white)](https://discord.com/invite/zpA9S2DR7s)
 </div>
 
-CocoIndex is ultra performant data transformation framework, core engine written in Rust. The problem it tries to solve is to make it easy to prepare fresh data for AI - either embedding, knowledge graph, or a series of data transformation - and take the real-time data pipeline beyond traditional SQL.
+CocoIndex is ultra performant data transformation framework, core engine written in Rust. The problem it tries to solve is to make it easy to prepare fresh data for AI - either creating embedding, building knowledge graph, or performing other data transformations - and take the real-time data pipeline beyond traditional SQL.
 
 <p align="center">
     <img src="https://cocoindex.io/images/cocoindex-features.png" alt="CocoIndex Features" width="500">
 </p>
 
-The philosophy is to have the framework handle the source updates, and having developers only focus on defining a series of data transformation, inspired by spreadsheet.
+The philosophy is to have the framework handle the source updates, and having developers only focus on defining a series of data transformation, inspired by spreadsheets.
 
 ## Data Flow programming
-CocoIndex follows [Data flow](https://en.wikipedia.org/wiki/Dataflow_programming) programming model. Compare with traditional orchestration framework, where data is opaque. In CocoIndex data and data operation are first class citizen, and there's no side effects for each data operation. All data are observable in each transformation, with lineage out of the box.
+Unlike a workflow orchestration framework where data is usually opaque, in CocoIndex, data and data operations are first class citizens. CocoIndex follows the idea of [Dataflow](https://en.wikipedia.org/wiki/Dataflow_programming) programming model. Each transformation creates a new field solely based on input fields, without hidden states and value mutation. All data before/after each each transformation is observable, with lineage out of the box.
 
-Particularly, user don't define data operations like creation, update, deletion. But rather, they define something like - for a set of source data, this is the transformation or formula. The framework takes care of the data operations like when to create, update, or delete. For example:
+Particularly, users don't define data operations like creation, update, deletion. But rather, they define something like - for a set of source data, this is the transformation or formula. The framework takes care of the data operations like when to create, update, or delete. For example:
 
 ```python
-// ingest
+# import
 data['content'] = flow_builder.add_source(...) 
 
-// transform
+# transform
 data['out'] = data['content'] 
     .transform(...)
     .transform(...)
 
-// collect data
+# collect data
 collector.collect(...)
 
-// export to db, vector db, graph db ...
+# export to db, vector db, graph db ...
 collector.export(...)
 ```
 
@@ -54,8 +54,9 @@ As a data framework, CocoIndex takes it to the next level on data freshness. **I
 </p>
 
 The frameworks takes care of
-- Change data capture
-- Figuring out what exactly needs to be updated, and only updating that without having to recompute everything throughout.
+- Change data capture.
+- Figure out what exactly needs to be updated, and only updating that without having to recompute everything.
+  
 This makes it fast to reflect any source updates to the target store. If you have concerns with surfacing stale data to AI agents and are spending lots of efforts working on infra piece to optimize the latency, the framework actually handles it for you.