Skip to content

Commit dbfc0c6

Browse files
authored
Update README.md
1 parent f8742b9 commit dbfc0c6

File tree

1 file changed

+42
-244
lines changed

1 file changed

+42
-244
lines changed

README.md

Lines changed: 42 additions & 244 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
<h1 align="center">Databend: The Next-Gen Cloud [Data+AI] Analytics</h1>
2-
<h2 align="center">The open-source, on-premise alternative to Snowflake</h2>
2+
<h2 align="center">One SQL for All Data: structured, semi-structured & unstructured multimodal data</h2>
33

44
<div align="center">
55

@@ -8,20 +8,13 @@
88
<a href="https://docs.databend.com/">Documentation</a> |
99
<a href="https://benchmark.clickhouse.com/">Benchmarking</a> |
1010
<a href="https://github.com/databendlabs/databend/issues/11868">Roadmap (v1.3)</a>
11-
1211
</h4>
1312

1413
<div>
1514
<a href="https://link.databend.com/join-slack">
1615
<img src="https://img.shields.io/badge/slack-databend-0abd59?logo=slack" alt="slack" />
1716
</a>
1817

19-
<a href="https://link.databend.com/join-feishu">
20-
<img src="https://img.shields.io/badge/feishu-databend-0abd59" alt="feishu" />
21-
</a>
22-
23-
<br>
24-
2518
<a href="https://github.com/databendlabs/databend/actions/workflows/release.yml">
2619
<img src="https://img.shields.io/github/actions/workflow/status/datafuselabs/databend/release.yml?branch=main" alt="CI Status" />
2720
</a>
@@ -37,245 +30,57 @@
3730

3831
<img src="https://github.com/databendlabs/databend/assets/172204/9997d8bc-6462-4dbd-90e3-527cf50a709c" alt="databend" />
3932

40-
## 🐋 Introduction
41-
42-
**Databend**, built in Rust, is an open-source cloud data warehouse that serves as a cost-effective [alternative to Snowflake](https://github.com/databendlabs/databend/issues/13059). With its focus on fast query execution and data ingestion, it's designed for complex analysis of the world's largest datasets.
43-
44-
**Production-Proven Scale:**
45-
- 🤝 **Enterprise Adoption**: Trusted by over **50 organizations** processing more than **100 million queries daily**
46-
- 🗄️ **Massive Scale**: Successfully managing over **800 petabytes** of analytical data
47-
48-
## ⚡ Performance
49-
50-
<div align="center">
51-
52-
[TPC-H Benchmark: Databend Cloud vs. Snowflake](https://docs.databend.com/guides/benchmark/tpch)
53-
54-
</div>
55-
56-
![Databend vs. Snowflake](https://github.com/databendlabs/wizard/assets/172204/d796acf0-0a66-4b1d-8754-cd2cd1de04c7)
57-
58-
<div align="center">
59-
60-
[Data Ingestion Benchmark: Databend Cloud vs. Snowflake](https://docs.databend.com/guides/benchmark/data-ingest)
61-
62-
</div>
63-
64-
![Databend vs. Snowflake](https://github.com/databendlabs/databend/assets/172204/c61d7a40-f6fe-4fb9-83e8-06ea9599aeb4)
33+
## The AI-Native Data Warehouse
6534

35+
Databend is the **open-source alternative to Snowflake** with **near 100% SQL compatibility** and native AI capabilities. Built in Rust with MPP architecture and S3-native storage, Databend unifies structured tables, JSON documents, and vector embeddings in a single platform. Trusted by **world-class enterprises** managing **800+ petabytes** and **100+ million queries daily**.
6636

67-
## 🚀 Why Databend
37+
## Key Features
6838

69-
- **Full Control**: Deploy on **cloud** or **on-prem** to suit your needs.
39+
**Performance & Scale**
40+
- **10x Faster**: Rust-powered vectorized execution with SIMD optimization
41+
- **90% Cost Reduction**: S3-native storage eliminates proprietary overhead
42+
- **Infinite Scale**: True compute-storage separation with elastic scaling
43+
- **Production-Proven**: Powers financial analytics, ML pipelines, and real-time AI inference
7044

71-
- **Blazing-Fast Performance**: Built with **Rust** for high-speed query execution. 👉 [ClickBench](https://databend.com/blog/clickbench-databend-top)
45+
**Enterprise Ready**
46+
- **Snowflake Compatible**: Migrate with zero SQL rewrites
47+
- **Multi-Cloud**: Deploy on AWS, Azure, GCP, or on-premise
48+
- **Security**: Role-based access, data masking, audit logging
49+
- **No Vendor Lock-in**: Complete data sovereignty and control
7250

73-
- **Cost-Effective**: Scalable architecture that boosts **performance** and reduces **costs**. 👉 [TPC-H](https://docs.databend.com/guides/benchmark/tpch)
51+
## Performance Benchmarks
7452

75-
- **AI-Enhanced Analytics**: Leverage built-in **[AI Functions](https://docs.databend.com/guides/ai-functions/)** for smarter data insights.
53+
[TPC-H Benchmark: Databend vs. Snowflake](https://docs.databend.com/guides/benchmark/tpch) | [Data Ingestion Benchmark](https://docs.databend.com/guides/benchmark/data-ingest) | [ClickBench Results](https://databend.com/blog/clickbench-databend-top)
7654

77-
- **Simplified ETL**: Direct **data ingestion** without the need for external ETL tools. 👉 [Data Loading](https://docs.databend.com/guides/load-data/)
78-
79-
- **Real-Time Data Updates**: Keep your analytics **up-to-date** with real-time incremental data updates. 👉 [Stream](https://docs.databend.com/guides/load-data/continuous-data-pipelines/stream)
80-
81-
- **Advanced Indexing**: Boost query performance with **[Virtual Column](https://docs.databend.com/guides/performance/virtual-column)**, **[Aggregating Index](https://docs.databend.com/guides/performance/aggregating-index)**, and **[Full-Text Index](https://docs.databend.com/guides/performance/fulltext-index)**.
82-
83-
- **ACID Compliance + Version Control**: Ensure reliable **transactions** with full ACID compliance and Git-like versioning.
84-
85-
- **Schema Flexibility**: Effortlessly handle **semi-structured data** with the flexible **[VARIANT](https://docs.databend.com/sql/sql-reference/data-types/variant)** data type.
86-
87-
- **Community-Driven Growth**: **Open-source** and continuously evolving with contributions from a global community.
88-
89-
90-
91-
## 📐 Architecture
55+
## Architecture
9256

9357
![Databend Architecture](https://github.com/databendlabs/databend/assets/172204/68b1adc6-0ec1-41d4-9e1d-37b80ce0e5ef)
9458

95-
## 🚀 Try Databend
96-
97-
### 1. Databend Serverless Cloud
98-
99-
The fastest way to try Databend, [Databend Cloud](https://databend.com)
100-
101-
### 2. Install Databend from Docker
102-
103-
Prepare the image (once) from Docker Hub (this will download about 170 MB data):
104-
105-
```shell
106-
docker pull datafuselabs/databend
107-
```
108-
109-
To run Databend quickly:
110-
111-
```shell
112-
docker run --net=host datafuselabs/databend
113-
```
114-
115-
## 🚀 Getting Started
116-
117-
<details>
118-
<summary>Connecting to Databend</summary>
119-
120-
- [Connecting to Databend with BendSQL](https://docs.databend.com/guides/sql-clients/bendsql)
121-
- [Connecting to Databend with JDBC](https://docs.databend.com/guides/sql-clients/jdbc)
59+
**Unified Foundation**: S3-native storage + MPP query engine + elastic compute clusters
12260

123-
</details>
61+
### Universal Data Processing by Type
62+
- **Structured**: Standard SQL with vectorized execution, ACID transactions, enterprise security, and BI integration
63+
- **Semi-Structured**: [VARIANT data type](https://docs.databend.com/sql/sql-reference/data-types/variant) with [virtual columns](https://docs.databend.com/guides/performance/virtual-column) for zero-config automatic JSON acceleration
64+
- **Unstructured**: [Vector data type](https://docs.databend.com/sql/sql-reference/data-types/vector) with HNSW indexing, [AI functions](https://docs.databend.com/sql/sql-functions/ai-functions/), and [full-text search](https://docs.databend.com/guides/performance/fulltext-index) for multimodal workloads
12465

125-
<details>
126-
<summary>Data Import and Export</summary>
66+
## Quick Start
12767

128-
- [How to load Parquet file into a table](https://docs.databend.com/guides/load-data/load-semistructured/load-parquet)
129-
- [How to export a table to Parquet file](https://docs.databend.com/guides/unload-data/unload-parquet)
130-
- [How to load CSV file into a table](https://docs.databend.com/guides/load-data/load-semistructured/load-csv)
131-
- [How to export a table to CSV file](https://docs.databend.com/guides/unload-data/unload-csv)
132-
- [How to load TSV file into a table](https://docs.databend.com/guides/load-data/load-semistructured/load-tsv)
133-
- [How to export a table to TSV file](https://docs.databend.com/guides/unload-data/unload-tsv)
134-
- [How to load NDJSON file into a table](https://docs.databend.com/guides/load-data/load-semistructured/load-ndjson)
135-
- [How to export a table to NDJSON file](https://docs.databend.com/guides/unload-data/unload-ndjson)
136-
- [How to load ORC file into a table](https://docs.databend.com/guides/load-data/load-semistructured/load-orc)
68+
### Cloud
69+
[Start with Databend Cloud](https://docs.databend.com/guides/cloud/) - Production-ready in 60 seconds
13770

138-
</details>
71+
### Self-Hosted
72+
[Installation Guide](https://docs.databend.com/guides/deploy/QuickStart/) - Deploy anywhere with full control
13973

140-
<details>
141-
<summary>Loading Data From Other Databases</summary>
74+
### Connect
75+
[BendSQL CLI](https://docs.databend.com/guides/sql-clients/bendsql) | [Developers Guide](https://docs.databend.com/guides/sql-clients/developers/)
14276

143-
- [How to Sync Full and Incremental MySQL Changes into Databend](https://docs.databend.com/guides/load-data/load-db/debezium)
144-
- [How to Sync Full and Incremental PostgreSQL Changes into Databend](https://docs.databend.com/guides/load-data/load-db/flink-cdc)
145-
- [How to Sync Full and Incremental Oracle Changes into Databend](https://docs.databend.com/guides/load-data/load-db/flink-cdc)
77+
## Products
14678

147-
</details>
79+
- **Open Source**: 100% open source, complete data sovereignty
80+
- **[Databend Cloud](https://databend.com)**: Managed service with serverless autoscaling
81+
- **Enterprise**: Advanced governance, compliance, and support
14882

149-
<details>
150-
<summary>Querying Semi-structured Data</summary>
151-
152-
- [How to query directly on Parquet file](https://docs.databend.com/guides/load-data/transform/querying-parquet)
153-
- [How to query directly on CSV file](https://docs.databend.com/guides/load-data/transform/querying-csv)
154-
- [How to query directly on TSV file](https://docs.databend.com/guides/load-data/transform/querying-tsv)
155-
- [How to query directly on NDJSON file](https://docs.databend.com/guides/load-data/transform/querying-ndjson)
156-
- [How to query directly on ORC file](https://docs.databend.com/guides/load-data/transform/querying-orc)
157-
</details>
158-
159-
<details>
160-
<summary>Visualize Tools with Databend</summary>
161-
162-
- [Deepnote](https://docs.databend.com/guides/visualize/deepnote)
163-
- [Grafana](https://docs.databend.com/guides/visualize/grafana)
164-
- [Jupyter Notebook](https://docs.databend.com/guides/visualize/jupyter)
165-
- [Metabase](https://docs.databend.com/guides/visualize/metabase)
166-
- [MindsDB](https://docs.databend.com/guides/visualize/mindsdb)
167-
- [Redash](https://docs.databend.com/guides/visualize/redash)
168-
- [Superset](https://docs.databend.com/guides/visualize/superset)
169-
- [Tableau](https://docs.databend.com/guides/visualize/tableau)
170-
171-
</details>
172-
173-
<details>
174-
<summary>Managing Users</summary>
175-
176-
- [How to Create a User](https://docs.databend.com/sql/sql-commands/ddl/user/user-create-user)
177-
- [How to Grant Privileges to a User](https://docs.databend.com/sql/sql-commands/ddl/user/grant#granting-privileges)
178-
- [How to Revoke Privileges from a User](https://docs.databend.com/sql/sql-commands/ddl/user/revoke#revoking-privileges)
179-
- [How to Create a Role](https://docs.databend.com/sql/sql-commands/ddl/user/user-create-role)
180-
- [How to Grant Privileges to a Role](https://docs.databend.com/sql/sql-commands/ddl/user/grant#granting-role)
181-
- [How to Grant Role to a User](https://docs.databend.com/sql/sql-commands/ddl/user/grant)
182-
- [How to Revoke the Role of a User](https://docs.databend.com/sql/sql-commands/ddl/user/revoke#revoking-role)
183-
</details>
184-
185-
<details>
186-
<summary>Managing Databases</summary>
187-
188-
- [How to Create a Database](https://docs.databend.com/sql/sql-commands/ddl/database/ddl-create-database)
189-
- [How to Drop a Database](https://docs.databend.com/sql/sql-commands/ddl/database/ddl-drop-database)
190-
</details>
191-
192-
<details>
193-
<summary>Managing Tables</summary>
194-
195-
- [How to Create a Table](https://docs.databend.com/sql/sql-commands/ddl/table/ddl-create-table)
196-
- [How to Drop a Table](https://docs.databend.com/sql/sql-commands/ddl/table/ddl-drop-table)
197-
- [How to Rename a Table](https://docs.databend.com/sql/sql-commands/ddl/table/ddl-rename-table)
198-
- [How to Truncate a Table](https://docs.databend.com/sql/sql-commands/ddl/table/ddl-truncate-table)
199-
- [How to Flash Back a Table](https://docs.databend.com/sql/sql-commands/ddl/table/flashback-table)
200-
- [How to Add/Drop Table Column](https://docs.databend.com/sql/sql-commands/ddl/table/alter-table-column)
201-
</details>
202-
203-
<details>
204-
<summary>Managing Data</summary>
205-
206-
- [COPY-INTO](https://docs.databend.com/sql/sql-commands/dml/dml-copy-into-table)
207-
- [INSERT](https://docs.databend.com/sql/sql-commands/dml/dml-insert)
208-
- [DELETE](https://docs.databend.com/sql/sql-commands/dml/dml-delete-from)
209-
- [UPDATE](https://docs.databend.com/sql/sql-commands/dml/dml-update)
210-
- [REPLACE](https://docs.databend.com/sql/sql-commands/dml/dml-replace)
211-
- [MERGE-INTO](https://docs.databend.com/sql/sql-commands/dml/dml-merge)
212-
</details>
213-
214-
<details>
215-
<summary>Managing Views</summary>
216-
217-
- [How to Create a View](https://docs.databend.com/sql/sql-commands/ddl/view/ddl-create-view)
218-
- [How to Drop a View](https://docs.databend.com/sql/sql-commands/ddl/view/ddl-drop-view)
219-
- [How to Alter a View](https://docs.databend.com/sql/sql-commands/ddl/view/ddl-alter-view)
220-
</details>
221-
222-
<details>
223-
<summary>AI Functions</summary>
224-
225-
- [Generating SQL with AI](https://docs.databend.com/sql/sql-functions/ai-functions/ai-to-sql)
226-
- [Creating Embedding Vectors](https://docs.databend.com/sql/sql-functions/ai-functions/ai-embedding-vector)
227-
- [Text Completion with AI](https://docs.databend.com/sql/sql-functions/ai-functions/ai-text-completion)
228-
- [Vector Distance](https://docs.databend.com/sql/sql-functions/vector-distance-functions/)
229-
</details>
230-
231-
<details>
232-
<summary>Data Management</summary>
233-
234-
- [Data Lifecycle in Databend](https://docs.databend.com/guides/data-management/data-lifecycle)
235-
- [Data Recovery in Databend](https://docs.databend.com/guides/data-management/data-recovery)
236-
- [Data Protection in Databend](https://docs.databend.com/guides/data-management/data-protection)
237-
- [Data Purge in Databend](https://docs.databend.com/guides/data-management/data-recycle)
238-
239-
</details>
240-
241-
<details>
242-
<summary>Accessing Data Lake</summary>
243-
244-
- [Apache Hive](https://docs.databend.com/guides/access-data-lake/hive)
245-
- [Apache Iceberg](https://docs.databend.com/guides/access-data-lake/iceberg/)
246-
- [Delta Lake](https://docs.databend.com/guides/access-data-lake/delta)
247-
248-
</details>
249-
250-
<details>
251-
<summary>Security</summary>
252-
253-
- [Access Control](https://docs.databend.com/guides/security/access-control)
254-
- [Masking Policy](https://docs.databend.com/guides/security/masking-policy)
255-
- [Network Policy](https://docs.databend.com/guides/security/network-policy)
256-
- [Password Policy](https://docs.databend.com/guides/security/password-policy)
257-
258-
</details>
259-
260-
<details>
261-
<summary>Performance</summary>
262-
263-
- [Review Clickbench](https://databend.com/blog/clickbench-databend-top)
264-
- [TPC-H Benchmark: Databend Cloud vs. Snowflake](https://docs.databend.com/guides/benchmark/tpch)
265-
- [Databend vs. Snowflake: Data Ingestion Benchmark](https://docs.databend.com/guides/benchmark/data-ingest)
266-
267-
</details>
268-
269-
## 🤝 Contributing
270-
271-
Databend thrives on community contributions! Whether it's through ideas, code, or documentation, every effort helps in enhancing our project. As a token of our appreciation, once your code is merged, your name will be eternally preserved in the **system.contributors** table.
272-
273-
Here are some resources to help you get started:
274-
275-
- [Building Databend From Source](https://docs.databend.com/developer/community/contributor/building-from-source)
276-
- [The First Good Pull Request](https://docs.databend.com/developer/community/contributor/good-pr)
277-
278-
## 👥 Community
83+
## Community
27984

28085
For guidance on using Databend, we recommend starting with the official documentation. If you need further assistance, explore the following community channels:
28186

@@ -284,24 +89,17 @@ For guidance on using Databend, we recommend starting with the official document
28489
- [Twitter](https://twitter.com/DatabendLabs/) (Get the news fast)
28590
- [I'm feeling lucky](https://link.databend.com/i-m-feeling-lucky) (Pick up a good first issue now!)
28691

287-
## 🛣️ Roadmap
288-
289-
Stay updated with Databend's development journey. Here are our roadmap milestones:
290-
291-
- [Roadmap 2025](https://github.com/databendlabs/databend/issues/14167)
292-
293-
## 📜 License
294-
295-
Databend is released under a combination of two licenses: the [Apache License 2.0](licenses/Apache-2.0.txt) and the [Elastic License 2.0](licenses/Elastic.txt).
92+
**Your merged code gets you into the `system.contributors` table. Forever.**
29693

297-
When contributing to Databend, you can find the relevant license header in each file.
94+
## Roadmap & License
29895

299-
For more information, see the [LICENSE](LICENSE) file and [Licensing FAQs](https://docs.databend.com/guides/products/dee/license).
96+
- **Roadmap**: [2025 Development Plan](https://github.com/databendlabs/databend/issues/14167)
97+
- **License**: [Apache License 2.0](licenses/Apache-2.0.txt) + [Elastic License 2.0](licenses/Elastic.txt) | [Licensing FAQs](https://docs.databend.com/guides/products/dee/license)
30098

301-
## 🙏 Acknowledgement
99+
## Acknowledgement
302100

303-
- **Inspiration**: Databend's design draws inspiration from industry leaders [ClickHouse](https://github.com/clickhouse/clickhouse) and [Snowflake](https://docs.snowflake.com/en/user-guide/intro-key-concepts.html#snowflake-architecture).
101+
**Inspiration**: [ClickHouse](https://github.com/clickhouse/clickhouse) and [Snowflake](https://docs.snowflake.com/en/user-guide/intro-key-concepts.html#snowflake-architecture) | **Foundation**: Apache Arrow | **Hosting**: [Vercel](https://vercel.com/?utm_source=databend&utm_campaign=oss)
304102

305-
- **Computing Model**: Our computing foundation is built upon apache arrow.
103+
---
306104

307-
- **Documentation Hosting**: The [Databend documentation website](https://docs.databend.com) proudly runs on [Vercel](https://vercel.com/?utm_source=databend&utm_campaign=oss).
105+
*Built by engineers who redefine what's possible with data.*

0 commit comments

Comments
 (0)