You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/reference/intro.asciidoc
+71-61Lines changed: 71 additions & 61 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,42 +1,70 @@
1
1
[[elasticsearch-intro]]
2
2
== What is {es}?
3
-
_**You know, for search (and analysis)**_
4
-
5
-
{es} is the distributed search and analytics engine at the heart of
6
-
the {stack}. {ls} and {beats} facilitate collecting, aggregating, and
7
-
enriching your data and storing it in {es}. {kib} enables you to
8
-
interactively explore, visualize, and share insights into your data and manage
9
-
and monitor the stack. {es} is where the indexing, search, and analysis
10
-
magic happens.
11
-
12
-
{es} provides near real-time search and analytics for all types of data. Whether you
13
-
have structured or unstructured text, numerical data, or geospatial data,
14
-
{es} can efficiently store and index it in a way that supports fast searches.
15
-
You can go far beyond simple data retrieval and aggregate information to discover
16
-
trends and patterns in your data. And as your data and query volume grows, the
17
-
distributed nature of {es} enables your deployment to grow seamlessly right
18
-
along with it.
19
-
20
-
While not _every_ problem is a search problem, {es} offers speed and flexibility
21
-
to handle data in a wide variety of use cases:
22
-
23
-
* Add a search box to an app or website
24
-
* Store and analyze logs, metrics, and security event data
25
-
* Use machine learning to automatically model the behavior of your data in real
26
-
time
27
-
* Use {es} as a vector database to create, store, and search vector embeddings
28
-
* Automate business workflows using {es} as a storage engine
29
-
* Manage, integrate, and analyze spatial information using {es} as a geographic
30
-
information system (GIS)
31
-
* Store and process genetic data using {es} as a bioinformatics research tool
32
-
33
-
We’re continually amazed by the novel ways people use search. But whether
34
-
your use case is similar to one of these, or you're using {es} to tackle a new
35
-
problem, the way you work with your data, documents, and indices in {es} is
36
-
the same.
3
+
4
+
{es-repo}[{es}] is a distributed search and analytics engine, scalable data store, and vector database built on Apache Lucene.
5
+
It's optimized for speed and relevance on production-scale workloads.
6
+
Use {es} to search, index, store, and analyze data of all shapes and sizes in near real time.
7
+
8
+
[TIP]
9
+
====
10
+
{es} has a lot of features. Explore the full list on the https://www.elastic.co/elasticsearch/features[product webpage^].
11
+
====
12
+
13
+
{es} is the heart of the {estc-welcome-current}/stack-components.html[Elastic Stack] and powers the Elastic https://www.elastic.co/enterprise-search[Search], https://www.elastic.co/observability[Observability] and https://www.elastic.co/security[Security] solutions.
14
+
15
+
{es} is used for a wide and growing range of use cases. Here are a few examples:
16
+
17
+
* *Monitor log and event data*. Store logs, metrics, and event data for observability and security information and event management (SIEM).
18
+
* *Build search applications*. Add search capabilities to apps or websites, or build enterprise search engines over your organization's internal data sources.
19
+
* *Vector database*. Store and search vectorized data, and create vector embeddings with built-in and third-party natural language processing (NLP) models.
20
+
* *Retrieval augmented generation (RAG)*. Use {es} as a retrieval engine to augment Generative AI models.
21
+
* *Application and security monitoring*. Monitor and analyze application performance and security data effectively.
22
+
* *Machine learning*. Use {ml} to automatically model the behavior of your data in real-time.
23
+
24
+
This is just a sample of search, observability, and security use cases enabled by {es}.
25
+
Refer to our https://www.elastic.co/customers/success-stories[customer success stories] for concrete examples across a range of industries.
26
+
// Link to demos, search labs chatbots
27
+
28
+
[discrete]
29
+
[[elasticsearch-intro-elastic-stack]]
30
+
.What is the Elastic Stack?
31
+
*******************************
32
+
{es} is the core component of the Elastic Stack, a suite of products for collecting, storing, searching, and visualizing data.
33
+
https://www.elastic.co/guide/en/starting-with-the-elasticsearch-platform-and-its-solutions/current/stack-components.html[Learn more about the Elastic Stack].
34
+
*******************************
35
+
// TODO: Remove once we've moved Stack Overview to a subpage?
36
+
37
+
[discrete]
38
+
[[elasticsearch-intro-deploy]]
39
+
=== Deployment options
40
+
41
+
To use {es}, you need a running instance of the {es} service.
42
+
You can deploy {es} in various ways:
43
+
44
+
* <<run-elasticsearch-locally,*Local dev*>>. Get started quickly with a minimal local Docker setup.
45
+
* {cloud}/ec-getting-started-trial.html[*Elastic Cloud*]. {es} is available as part of our hosted Elastic Stack offering, deployed in the cloud with your provider of choice. Sign up for a https://cloud.elastic.co/registration[14 day free trial].
46
+
* {serverless-docs}/general/sign-up-trial[*Elastic Cloud Serverless* (technical preview)]. Create serverless projects for autoscaled and fully managed {es} deployments. Sign up for a https://cloud.elastic.co/serverless-registration[14 day free trial].
47
+
48
+
**Advanced deployment options**
49
+
50
+
* <<elasticsearch-deployment-options,*Self-managed*>>. Install, configure, and run {es} on your own premises.
51
+
* {ece-ref}/Elastic-Cloud-Enterprise-overview.html[*Elastic Cloud Enterprise*]. Deploy Elastic Cloud on public or private clouds, virtual machines, or your own premises.
52
+
* {eck-ref}/k8s-overview.html[*Elastic Cloud on Kubernetes*]. Deploy Elastic Cloud on Kubernetes.
53
+
54
+
[discrete]
55
+
[[elasticsearch-next-steps]]
56
+
=== Learn more
57
+
58
+
Here are some resources to help you get started:
59
+
60
+
* <<getting-started, Quickstart>>. A beginner's guide to deploying your first {es} instance, indexing data, and running queries.
61
+
* https://elastic.co/webinars/getting-started-elasticsearch[Webinar: Introduction to {es}]. Register for our live webinars to learn directly from {es} experts.
62
+
* https://www.elastic.co/search-labs[Elastic Search Labs]. Tutorials and blogs that explore AI-powered search using the latest {es} features.
63
+
** Follow our tutorial https://www.elastic.co/search-labs/tutorials/search-tutorial/welcome[to build a hybrid search solution in Python].
64
+
** Check out the https://github.com/elastic/elasticsearch-labs?tab=readme-ov-file#elasticsearch-examples--apps[`elasticsearch-labs` repository] for a range of Python notebooks and apps for various use cases.
37
65
38
66
[[documents-indices]]
39
-
=== Data in: documents and indices
67
+
=== Documents and indices
40
68
41
69
{es} is a distributed document store. Instead of storing information as rows of
42
70
columnar data, {es} stores complex data structures that have been serialized
@@ -65,8 +93,7 @@ behavior makes it easy to index and explore your data--just start
65
93
indexing documents and {es} will detect and map booleans, floating point and
66
94
integer values, dates, and strings to the appropriate {es} data types.
67
95
68
-
Ultimately, however, you know more about your data and how you want to use it
69
-
than {es} can. You can define rules to control dynamic mapping and explicitly
96
+
You can define rules to control dynamic mapping and explicitly
70
97
define mappings to take full control of how fields are stored and indexed.
71
98
72
99
Defining your own mappings enables you to:
@@ -89,7 +116,7 @@ used at search time. When you query a full-text field, the query text undergoes
89
116
the same analysis before the terms are looked up in the index.
90
117
91
118
[[search-analyze]]
92
-
=== Information out: search and analyze
119
+
=== Search and analyze
93
120
94
121
While you can use {es} as a document store and retrieve documents and their
95
122
metadata, the real power comes from being able to easily access the full suite
@@ -160,27 +187,8 @@ size 70 needles, you’re displaying a count of the size 70 needles
160
187
that match your users' search criteria--for example, all size 70 _non-stick
161
188
embroidery_ needles.
162
189
163
-
[discrete]
164
-
[[more-features]]
165
-
===== But wait, there’s more
166
-
167
-
Want to automate the analysis of your time series data? You can use
168
-
{ml-docs}/ml-ad-overview.html[machine learning] features to create accurate
169
-
baselines of normal behavior in your data and identify anomalous patterns. With
170
-
machine learning, you can detect:
171
-
172
-
* Anomalies related to temporal deviations in values, counts, or frequencies
173
-
* Statistical rarity
174
-
* Unusual behaviors for a member of a population
175
-
176
-
And the best part? You can do this without having to specify algorithms, models,
177
-
or other data science-related configurations.
178
-
179
190
[[scalability]]
180
-
=== Scalability and resilience: clusters, nodes, and shards
181
-
++++
182
-
<titleabbrev>Scalability and resilience</titleabbrev>
183
-
++++
191
+
=== Scalability and resilience
184
192
185
193
{es} is built to be always available and to scale with your needs. It does this
186
194
by being distributed by nature. You can add servers (nodes) to a cluster to
@@ -209,7 +217,7 @@ interrupting indexing or query operations.
209
217
210
218
[discrete]
211
219
[[it-depends]]
212
-
==== It depends...
220
+
==== Shard size and number of shards
213
221
214
222
There are a number of performance considerations and trade offs with respect
215
223
to shard size and the number of primary shards configured for an index. The more
@@ -237,7 +245,7 @@ testing with your own data and queries].
237
245
238
246
[discrete]
239
247
[[disaster-ccr]]
240
-
==== In case of disaster
248
+
==== Disaster recovery
241
249
242
250
A cluster's nodes need good, reliable connections to each other. To provide
243
251
better connections, you typically co-locate the nodes in the same data center or
@@ -257,11 +265,13 @@ secondary clusters are read-only followers.
257
265
258
266
[discrete]
259
267
[[admin]]
260
-
==== Care and feeding
268
+
==== Security, management, and monitoring
261
269
262
270
As with any enterprise system, you need tools to secure, manage, and
263
271
monitor your {es} clusters. Security, monitoring, and administrative features
264
272
that are integrated into {es} enable you to use {kibana-ref}/introduction.html[{kib}]
265
273
as a control center for managing a cluster. Features like <<downsampling,
266
274
downsampling>> and <<index-lifecycle-management, index lifecycle management>>
267
275
help you intelligently manage your data over time.
276
+
277
+
Refer to <<monitor-elasticsearch-cluster>> for more information.
Copy file name to clipboardExpand all lines: docs/reference/search/search-your-data/near-real-time.asciidoc
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,7 +2,7 @@
2
2
3
3
[[near-real-time]]
4
4
=== Near real-time search
5
-
The overview of <<documents-indices,documents and indices>> indicates that when a document is stored in {es}, it is indexed and fully searchable in _near real-time_--within 1 second. What defines near real-time search?
5
+
When a document is stored in {es}, it is indexed and fully searchable in _near real-time_--within 1 second. What defines near real-time search?
6
6
7
7
Lucene, the Java libraries on which {es} is based, introduced the concept of per-segment search. A _segment_ is similar to an inverted index, but the word _index_ in Lucene means "a collection of segments plus a commit point". After a commit, a new segment is added to the commit point and the buffer is cleared.
0 commit comments