-
Notifications
You must be signed in to change notification settings - Fork 25.6k
(Doc+) Split API | add where need to have sufficient disk and how much #119784
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -121,11 +121,15 @@ newly split index. | |
A split operation: | ||
|
||
. Creates a new target index with the same definition as the source | ||
index, but with a larger number of primary shards. | ||
index, but with a larger number of primary shards. Created shards allocate | ||
to the same nodes as their correlating source primary shard, so it must | ||
already have sufficient disk to host the copy of the data. | ||
|
||
. Hard-links segments from the source index into the target index. (If | ||
the file system doesn't support hard-linking, then all segments are copied | ||
into the new index, which is a much more time consuming process.) | ||
+ | ||
TIP: Elastic Cloud's backing file systems do not support hard linking. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. This is probably best answered by @elastic/es-core-infra team who did #61145 . Adding the author of that PR to this (but anyone from the team feel free to step in to review). There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Regarding #61145, we did merge a change for it but ended up removing it because, it turned out, Cloud was in fact using a filesystem that correctly reported quotas (XFS IIRC). Regarding this tip in the docs, I can only imagine it has something to do with attempting to hard-link across Docker volumes, which doesn't work, but you'd need to check with someone from the Hosted ESS team to get an up-to-date answer. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. @stefnestor might you know who to include from Hosted ESS team to pitch in / review? |
||
|
||
. Hashes all documents again, after low level files are created, to delete | ||
documents that belong to a different shard. | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
After consulting @tlrx (thanks!) indeed if the shards are on different nodes, the new index shards will be allocated and split on the nodes of their corresponding shards.
We should probably also clarify the above in the following sentence later in the document:
to something like