You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{es}} is a distributed datastore that ingests, indexes, and manages various types of data in near real-time, making them both searchable and analyzable. Built on Apache Lucene, {{es}} scales horizontally across multiple nodes to handle large data volumes while maintaining fast query performance.
12
+
{{es}} is a distributed data store that ingests, indexes, and manages diverse data types in near real time, making your data searchable and analyzable. Built on Apache Lucene, {{es}} scales horizontally across multiple nodes to handle large data volumes while maintaining fast query performance.
13
+
14
+
{{es}} enables you to make large amounts of data quickly searchable. Whether you’re building an e-commerce product search, implementing semantic search with AI, or analyzing log data, {{es}} provides a powerful foundation with efficient indexing and query capabilities.
11
15
12
-
At its core, {{es}} solves the problem of making large amounts of data quickly searchable. Whether you're building a product search for an e-commerce site, implementing semantic search with AI, or analyzing log data, {{es}} provides the foundation for these use cases through its powerful indexing and query capabilities.
{{es}} distributes data across multiple nodes in a cluster. Each node holds a portion of the data in shards, which are self-contained indexes that can be stored on any node.
20
+
{{es}} distributes data across multiple nodes within a cluster. Each node stores a portion of the data in shards, which are self-contained indexes that can reside on any node in the cluster.
17
21
18
-
This distribution enables:
22
+
The distributed {{es}} architecture enables the following:
19
23
20
-
* Horizontal scaling: Add more nodes to increase capacity
21
-
* High availability: Data is replicated across nodes to prevent loss
22
-
* Parallel processing: Queries execute across shards simultaneously
24
+
* Horizontal scaling - Add more nodes to increase capacity
25
+
* High availability - Maintained through data replication across nodes to prevent data loss
26
+
* Parallel processing - Queries execute across shards simultaneously to deliver fast performance
23
27
24
-
## Near real-time indexing [elasticsearch-near-real-time-indexing]
28
+
### Near real-time indexing [elasticsearch-near-real-time-indexing]
29
+
30
+
When you send documents to {{es}}, they become searchable within about one second. The near real-time capability makes {{es}} ideal for applications that require immediate data availability.
31
+
32
+
For example:
25
33
26
-
When you send documents to Elasticsearch, they become searchable within about one second. This near real-time capability makes Elasticsearch suitable for applications that require immediate data availability, such as:
34
+
* Live dashboards display currently collected system metrics
35
+
* Product catalogs that instantly update as inventory changes
36
+
* User-generated content that appears in search results the moment the content is created
27
37
28
-
* Live dashboards showing current system metrics
29
-
* Product catalogs that update as inventory changes
30
-
* User-generated content that appears in search results immediately
38
+
### Schema-on-write with dynamic mapping [elasticsearch-schema-on-write-with-dynamic-mapping]
31
39
32
-
## Schema-on-write with dynamic mapping [elasticsearch-schema-on-write-with-dynamic-mapping]
40
+
When you index documents, {{es}} automatically detects the field types. For example, when a document includes a `price` field with a `29.99` value, {{es}} infers that the value is a floating-point number. You can also define explicit mappings to control exactly how data is stored and indexed.
33
41
34
-
Elasticsearch automatically detects field types when you index documents. If you send a document with a price field containing 29.99, Elasticsearch infers it's a floating-point number. You can also define explicit mappings to control exactly how data is stored and indexed.
42
+
Mappings play a key role in the following:
35
43
36
-
Mappings are important for:
44
+
* Storage and query performance optimization
45
+
* Specific search feature enablement, such as autocomplete or geospatial search
46
+
* Data consistency across documents
37
47
38
-
* Optimizing storage and query performance
39
-
* Enabling specific search features (like autocomplete or geo-search)
{{es}} functions as a vector database for AI and {{ml}} applications, storing dense vector embeddings alongside traditional text and numeric data.
43
51
44
-
Elasticsearch serves as a vector database for AI and machine learning applications. It stores dense vector embeddings alongside traditional text and numeric data, enabling:
52
+
Vector capabilities enable the following:
45
53
46
-
* Semantic search: Find content by meaning rather than exact keywords
47
-
* Hybrid search: Combine keyword and vector search for best results
48
-
* RAG systems: Provide relevant context to large language models
54
+
* Semantic search - Find content based on meaning rather than exact keywords
55
+
* Hybrid search - Combine keyword and vector-based search results for greater accuracy
56
+
*Retrieval-augmented generation (RAG) systems - Provide relevant context to large language models
49
57
50
-
## How Elasticsearch works [how-elasticsearch-works]
58
+
## How {{es}} works [how-elasticsearch-works]
51
59
52
-
### Data flow [elasticsearch-data-flow]
60
+
To enable fast and scalable search, {{es}} ingests, analyzes, and indexes data so queries execute across shards and return results in milliseconds.
*Elasticsearch Serverless: Fully managed, auto-scaling deployment (recommended for new projects)
134
-
*Elastic Cloud: Managed Elasticsearch with more configuration control
135
-
* Self-managed: Install and operate Elasticsearch yourself (requires expertise)
152
+
***[{{serverless-full}}](/deploy-manage/deploy/elastic-cloud/serverless.md)** - Fully managed, auto-scaling deployment, which is recommended for new projects
153
+
***[{{ech}}](/deploy-manage/deploy/elastic-cloud/cloud-hosted.md)** - Managed {{es}} with more configuration control
154
+
***[Self-managed](/deploy-manage/deploy/self-managed.md)** - Install and operate {{es}} yourself, which requires expertise
136
155
137
156
### Cluster sizing [elasticsearch-cluster-sizing]
138
157
139
-
* Small deployments: 3-5 nodes for development and small production use cases
140
-
* Medium deployments: 10-20 nodes for moderate data volumes and query loads
141
-
* Large deployments: 50+ nodes for high-volume production systems
158
+
* Small deployments - 3-5 nodes for development and small production use cases
159
+
* Medium deployments - 10-20 nodes for moderate data volumes and query loads
160
+
* Large deployments - 50 or more nodes for high-volume production systems
142
161
143
162
### Data modeling best practices [elasticsearch-data-modeling-best-practices]
144
163
145
-
* One document type per index: Keep related data together
146
-
* Denormalize data: Include related information in documents to avoid "joins"
147
-
* Use appropriate field types: Match data types to query patterns
148
-
* Plan for growth: Consider time-based indices for logs and events
164
+
* One document type per index - Keep related data together
165
+
* Denormalize data - Include related information in documents to avoid joins
166
+
* Use appropriate field types - Match data types to query patterns
167
+
* Plan for growth - Consider time-based indices for logs and events
149
168
150
169
## Next steps [elasticsearch-next-steps]
151
170
152
-
Ready to try Elasticsearch? Here's how to get started:
171
+
Ready to try {{es}}? Here's how to get started:
153
172
154
-
* Get started with Elasticsearch - Run your first queries in 5 minutes
155
-
* Tutorial: Build a search application - Create a full-featured search experience
156
-
* Understanding Elasticsearch architecture - Deep dive into distributed systems concepts
173
+
*[Get started](/solutions/search/get-started.md) - Run your first queries in 5 minutes
174
+
% how* Tutorial: Build a search application - Create a full-featured search experience
175
+
*[Understanding {{es}} architecture](/deploy-manage/distributed-architecture.md) - Deep dive into distributed systems concepts
0 commit comments