adding reference architectures section #114653

georgewallace · 2024-10-11T19:15:03Z

No description provided.

github-actions · 2024-10-11T19:15:14Z

Documentation preview:

✨ Changed pages

elasticsearchmachine · 2024-10-11T19:16:06Z

Pinging @elastic/es-docs (Team:Docs)

…g. (elastic#114904)

…ploysDefaultElser elastic#114913

…dex mode (elastic#114573) Here we check for the existence of a `host.name` field in index sort settings when the index mode is `logsdb` and decide to inject the field in the mapping depending on whether it exists or not. By default `host.name` is required for sorting in LogsDB. This reduces the chances for errors at mapping or template composition time as a result of injecting the `host.name` field only if strictly required. A user who wants to override index sort settings without including a `host.name` field would be able to do so without finding an additional `host.name` field in the mappings (injected automatically). If users override the sort settings and a `host.name` field is not included we don't need to inject such field since sorting does not require it anymore. As a result of this change we have the following: * the user does not provide any index sorting configuration: we are responsible for injecting the default sort fields and their mapping (for `logsdb`) * the user explicitly provides non-empty index sorting configuration: the user is also responsible for providing correct mappings and we do not modify index sorting or mappings Note also that all sort settings `index.sort.*` are `final` which means doing this check once, when mappings are merged at template composition time, is enough.

We will deprecate the `_source.mode` mapping level configuration in favor of the index-level `index.mapping.source.mode` setting. As a result, we go through the documentation and update it to reflect the introduction of the setting.

@anniegale9538

* (Doc+) link video for resolving shards too large 👋 howdy, team (cc: @anniegale9538 )! Playing forward elastic#111254, [this video](https://www.youtube.com/watch?v=sHyNYnwbYro) demonstrates an example resolving shards too large via reindex under [this section](https://www.elastic.co/guide/en/elasticsearch/reference/master/size-your-shards.html#shard-size-recommendation) as it's a top support ask. --------- Co-authored-by: Liam Thompson <[email protected]>

* (Doc+) Cross-link max shards 👋 It appears we have two docs of similar content about max open shards. This one contains the error users search (so is what we linked the error to in elastic#110993) but the other I believe is a placeholder doc for the health api code. Should maybe consolidate some day but in the mean time at least cross-link. --------- Co-authored-by: Liam Thompson <[email protected]>

@masseyke

@masseyke noticed this in his review of elastic#114847. I fixed it in the backport to `8.x` via elastic#114872, but this PR is needed to get the same fix into `main`.

These tests should be fixed and can be unmuted. The associated github issues have already been closed.

This links to our 6 newest [Support Troubleshooting](https://www.youtube.com/playlist?list=PL_mJOmq4zsHbQlfEMEh_30_LuV_hZp-3d) videos which are about resolving general ILM Health & the top five ILM rollover errors to the existing [Troubleshooting ILM errors](https://www.elastic.co/guide/en/elasticsearch/reference/master/index-lifecycle-error-handling.html). It side quests to link the watermark error to [its troubleshooting doc](https://www.elastic.co/guide/en/elasticsearch/reference/master/fix-watermark-errors.html).

…ding requests (elastic#114870) * Make ESQL EnrichPolicyResolver try to do proper connection before sending requests * Make encureConnected be !skipUnavailable

With recent changes in Lucene 9.12 around not forking execution when not necessary (see apache/lucene#13472), we have removed the search worker thread pool in elastic#111099. The worker thread pool had unlimited queue, and we feared that we couuld have much more queueing on the search thread pool if we execute segment level searches on the same thread pool as the shard level searches, because every shard search would take up to a thread per slice when executing the query phase. We have then introduced an additional conditional to stop parallelizing when there is a queue. That is perhaps a bit extreme, as it's a decision made when creating the searcher, while a queue may no longer be there once the search is executing. This has caused some benchmarks regressions, given that having a queue may be a transient scenario, especially with short-lived segment searches being queued up. We may end up disabling inter-segment concurrency more aggressively than we would want, penalizing requests that do benefit from concurrency. At the same time, we do want to have some kind of protection against rejections of shard searches that would be caused by excessive slicing. When the queue is above a certain size, we can turn off the slicing and effectively disable inter-segment concurrency. With this commit we set that threshold to be the number of threads in the search pool.

…wnloader (elastic#114924)

Currently the incremental and non-incremental bulk variations will return different error codes when the json body provided is invalid. This commit ensures both version return status code 400. Additionally, this renames the incremental rest tests to bulk tests and ensures that all tests work with both bulk api versions. We set these tests to randomize which version of the api we test each run.

…core validation (elastic#114877)

…y under snapshot (elastic#114784) * make named parameter for identifier and pattern snapshot

…#114271) * skip validating remote cluster index names in parser

This remove all recovery source specific SFM singletons. Whether recovery source is enabled can be checked via `DocumentParserContext`. This reduces the number of SFM instances by half.

…115459) The blob store may be triggered to create a local directory while in a reduced privilege context. This commit guards the creation of directories with doPrivileged.

…=reference/esql/esql-across-clusters/line_197} elastic#115575

…ic#114580) It was deprecated in elastic#104209 (8.13) and shouldn't be set or returned in 9.0 The Desired Nodes API is an internal API, and users shouldn't depend on its backward compatibility.

This PR adds detailed documentation for `logsdb` mode, covering several key aspects of its default behavior and configuration options. It includes: - default settings for index sorting (`index.sort.field`, `index.sort.order`, etc.). - usage of synthetic `_source` by default. - information about specialized codecs and how users can override them. - default behavior for `ignore_malformed` and `ignore_above` settings, including precedence rules. - explanation of how fields without `doc_values` are handled and what we do if they are missing.

…2569) KibanaThreadPoolIT checks the Kibana system user can write (using the system read/write threadpools) even when the normal read/write threadpools are blocked. This commit re-enables a key part of the test which was disabled. closes elastic#107625

* Propagate root subobjects setting to downsample indexes * exclude tests from rest compat * remove subobjects propagation

* changelog entry

The test issue was fixed by elastic#110807 closes elastic#110801

…#115138) * fix: correctly update search status for a nonexistent local index * Check for cluster existence before updation * Remove unnecessary `println` * Address review comment: add an explanatory code comment * Further clarify code comment

`MetadataStateFormat.FORMAT.loadLatestState` can actually return null when the state directory hasn't been initialized yet, so we have to keep the null check when loading retention leases during the initialization of the engine. See elastic#39359

…ic#114698) Since metadata storage was moved to Lucene in elastic#50907 (7.16.0), we shouldn't encounter any on-disk global metadata files, so we can remove support for loading them.

…ic#114665) * Fixing remote ENRICH by pushing the Enrich inside FragmentExec * Improve handling of more complex cases such as several enriches

…present (elastic#115586) Sometimes the test framework adds a global legacy template. When this happens, a test that is using another legacy template to create an index emits a warning since the index matches two legacy templates. This PR allows that warning.

…ative long values (elastic#115594)

This change introduces a new index mode, lookup, for indices intended for lookup operations in ES|QL. Lookup indices must have a single shard and be replicated to all data nodes by default. Aside from these requirements, they function as standard indices. Documentation will be added later when the lookup operator in ES|QL is implemented.

…uiteIT test {yaml=cluster.stats/30_ccs_stats/cross-cluster search stats search} elastic#115600

…gic (elastic#115487) The approach taken by `ExpressionList` becomes very expensive for large numbers of indices/datastreams. It implies that large lists of concrete names (as they are passed down from the transport layer via e.g. security) are copied at least twice during iteration. Removing the intermediary list and inlining the logic brings down the latency of searches targetting many shards/indices at once and allows for subsequent optimizations. The removed tests appear redundant as they tested an implementation detail of the IndexNameExpressionResolver which itself is well covered by its own tests.

Fixes a timeout in the Inference API where if connecting to an existing deployment and that deployment does not exist the listener was not called.

…ices.create/10_basic/Create lookup index} elastic#115605

We can randomly inject a global template that defaults to 2 shards instead of 1. This causes the lookup index YAML tests to fail. To avoid this, the change requires specifying the default_shards setting for these tests

elasticsearchmachine added needs:triage Requires assignment of a team area label v9.0.0 labels Oct 11, 2024

georgewallace added >docs General docs changes Team:Docs Meta label for docs team labels Oct 11, 2024

elasticsearchmachine removed the needs:triage Requires assignment of a team area label label Oct 11, 2024

georgewallace marked this pull request as draft October 11, 2024 19:16

szabosteve and others added 22 commits October 24, 2024 20:40

[DOCS] Adds link to tutorial and API docs to trained model autoscalin…

999e2c2

…g. (elastic#114904)

Mute org.elasticsearch.xpack.inference.DefaultEndPointsIT testInferDe…

76599ff

…ploysDefaultElser elastic#114913

Fix this log level (elastic#114921)

a60e76c

@masseyke noticed this in his review of elastic#114847. I fixed it in the backport to `8.x` via elastic#114872, but this PR is needed to get the same fix into `main`.

Reenable incremental bulk tests (elastic#114922)

cf54058

These tests should be fixed and can be unmuted. The associated github issues have already been closed.

Add 8.16 to branches.json

18fc3cf

Bump 8.x to version 8.17.0

a34b39d

Make ESQL EnrichPolicyResolver try to do proper connection before sen…

cffae1f

…ding requests (elastic#114870) * Make ESQL EnrichPolicyResolver try to do proper connection before sending requests * Make encureConnected be !skipUnavailable

Reducing error-level stack trace logging for normal events in GeoIpDo…

d393ce7

…wnloader (elastic#114924)

Adding deprecation warnings for rank and sub_searches (elastic#114854)

02f3b41

Fixing number of shards for random_rerank_retriever tests to ensure s…

94f3f92

…core validation (elastic#114877)

[ES|QL] Make named parameter for identifier and pattern available onl…

064ec14

…y under snapshot (elastic#114784) * make named parameter for identifier and pattern snapshot

[ES|QL] Skip validating remote cluster index names in parser (elastic…

1dcaf88

…#114271) * skip validating remote cluster index names in parser

Squash transport versions into 8.15 (elastic#114827)

3b39471

Add diagnostic output to dra workflow scripts (elastic#114973)

dbeedf0

Reduce the number of SFM singletons. (elastic#114969)

9a7a49b

This remove all recovery source specific SFM singletons. Whether recovery source is enabled can be checked via `DocumentParserContext`. This reduces the number of SFM instances by half.

kkrik-es and others added 26 commits October 24, 2024 20:41

Ignore _field_names warning in testRollupAfterRestart (elastic#115563)

948d4a5

Guard blob store local directory creation with doPrivileged (elastic#…

6a56a7c

…115459) The blob store may be triggered to create a local directory while in a reduced privilege context. This commit guards the creation of directories with doPrivileged.

Remove unused elasticsearch cloud docker image (elastic#115357)

88d3c99

[DOCS][101] Add BYO vectors ingestion tutorial (elastic#115112)

407b86b

Mute org.elasticsearch.smoketest.DocsClientYamlTestSuiteIT test {yaml…

a830d45

…=reference/esql/esql-across-clusters/line_197} elastic#115575

Don't return or accept node_version in the Desired Nodes API (elast…

af86ecc

…ic#114580) It was deprecated in elastic#104209 (8.13) and shouldn't be set or returned in 9.0 The Desired Nodes API is an internal API, and users shouldn't depend on its backward compatibility.

Propagate root subobjects setting to downsample indexes (elastic#115358)

4588246

* Propagate root subobjects setting to downsample indexes * exclude tests from rest compat * remove subobjects propagation

Make a minor change to trigger release note process (elastic#113975)

dc5013c

* changelog entry

Reenable CacheFileTests (elastic#115582)

f84a07e

The test issue was fixed by elastic#110807 closes elastic#110801

Remove loading on-disk cluster metadata from the manifest file (elast…

47ac338

…ic#114698) Since metadata storage was moved to Lucene in elastic#50907 (7.16.0), we shouldn't encounter any on-disk global metadata files, so we can remove support for loading them.

Fixing remote ENRICH by pushing the Enrich inside FragmentExec (elast…

d2b89e5

…ic#114665) * Fixing remote ENRICH by pushing the Enrich inside FragmentExec * Improve handling of more complex cases such as several enriches

Update BlobCacheBufferedIndexInput::readVLong to correctly handle neg…

0435ddb

…ative long values (elastic#115594)

Mute org.elasticsearch.xpack.security.CoreWithSecurityClientYamlTestS…

ee9612d

…uiteIT test {yaml=cluster.stats/30_ccs_stats/cross-cluster search stats search} elastic#115600

[ML] Fix timeout attaching to missing deployment (elastic#115517)

6e30721

Fixes a timeout in the Inference API where if connecting to an existing deployment and that deployment does not exist the listener was not called.

Mute org.elasticsearch.test.rest.ClientYamlTestSuiteIT test {yaml=ind…

1101c0b

…ices.create/10_basic/Create lookup index} elastic#115605

Do not run lookup index YAML with two shards (elastic#115608)

969cf35

We can randomly inject a global template that defaults to 2 shards instead of 1. This causes the lookup index YAML tests to fail. To avoid this, the change requires specifying the default_shards setting for these tests

fixing merge conflict

7b15ec4

updates

5c2c7f2

updates

4a8f9e6

georgewallace closed this Oct 25, 2024

georgewallace deleted the ref_arch branch October 25, 2024 02:55

georgewallace restored the ref_arch branch October 25, 2024 02:56

georgewallace deleted the ref_arch branch October 25, 2024 03:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

adding reference architectures section #114653

adding reference architectures section #114653

Uh oh!

georgewallace commented Oct 11, 2024

Uh oh!

github-actions bot commented Oct 11, 2024

Uh oh!

elasticsearchmachine commented Oct 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

76 participants

adding reference architectures section #114653

adding reference architectures section #114653

Uh oh!

Conversation

georgewallace commented Oct 11, 2024

Uh oh!

github-actions bot commented Oct 11, 2024

Uh oh!

elasticsearchmachine commented Oct 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

76 participants