Skip to content

Commit 450ab6b

Browse files
Update solutions/observability/observability-ai-assistant.md
Co-authored-by: Mike Birnstiehl <[email protected]>
1 parent a2ef057 commit 450ab6b

File tree

1 file changed

+6
-4
lines changed

1 file changed

+6
-4
lines changed

solutions/observability/observability-ai-assistant.md

Lines changed: 6 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -493,10 +493,12 @@ Results for other languages or models may vary.
493493
::::
494494

495495
### Limitations [obs-ai-anonymization-limitations]
496-
* **Performance (NER)** – Running a named entity recognition model can add latency depending on the request. To improve performance of the model, consider scaling up your ML nodes by adjusting deployment parameters: increase `number_of_allocations` for better throughput and `threads_per_allocation` for faster individual requests. For details, refer to the [start trained model deployment API](https://www.elastic.co/docs/api/doc/elasticsearch/operation/operation-ml-start-trained-model-deployment).
497-
* **Structured JSON** – The NER model we validated (`elastic/distilbert-base-uncased-finetuned-conll03-english`) is trained on natural English text and often misses entities inside JSON or other structured data. If thorough masking is required, prefer regex rules and craft them to account for JSON syntax.
498-
* **False negatives / positives** – No model or pattern is perfect. Model accuracy may vary depending on model and input.
499-
* **JSON malformation risk** – Both NER inference and regex rules can potentially create malformed JSON when anonymizing JSON data such as function responses. This can occur by replacing text across character boundaries, which may break JSON structure causing the whole request to fail. If this occurs, you may need to adjust your regex pattern or disable the NER rule.
496+
Anonymization has the following limitations:
497+
498+
* **Performance (NER)**: Running an NER model can add latency depending on the request. To improve performance of the model, consider scaling up your ML nodes by adjusting deployment parameters: increase `number_of_allocations` for better throughput and `threads_per_allocation` for faster individual requests. For details, refer to [start trained model deployment API](https://www.elastic.co/docs/api/doc/elasticsearch/operation/operation-ml-start-trained-model-deployment).
499+
* **Structured JSON**: The NER model we validated (`elastic/distilbert-base-uncased-finetuned-conll03-english`) is trained on natural English text and often misses entities inside JSON or other structured data. If thorough masking is required, prefer regex rules and craft them to account for JSON syntax.
500+
* **False negatives / positives**: No model or pattern is perfect. Model accuracy may vary depending on model and input.
501+
* **JSON malformation risk**: Both NER inference and regex rules can potentially create malformed JSON when anonymizing JSON data such as function responses. This can occur by replacing text across character boundaries, which may break JSON structure causing the whole request to fail. If this occurs, you may need to adjust your regex pattern or disable the NER rule.
500502

501503

502504
## Known issues [obs-ai-known-issues]

0 commit comments

Comments
 (0)