How do you chunk non-English legal docs for RAG agents? #6
Isaac24Karat
started this conversation in
General
Replies: 1 comment
-
|
Follow-up after testing different chunking strategies on Hebrew legal docs:
Next: I’ll explore overlap tuning vs embedding window width to reduce hallucination. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Standard chunking often breaks sentence boundaries or inserts mid-paragraph splits in Hebrew and German legal docs.
Anyone tried:
Would love to hear real-world methods that worked.
Beta Was this translation helpful? Give feedback.
All reactions