You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Added a dedicated entity parser that normalizes strings, strips possessive suffixes, and deduplicates people, places, organizations, and topics before returning them
Introduced reusable helpers for capitalizing text, removing trailing possessives, and stripping punctuation
Updated the keyword parser to apply these helpers so extracted keywords and keyphrases are capitalized and free of trailing possessives
Loaded NLP plugin hints into the main parse workflow to enrich entity detection
Expanded test coverage to verify entity capitalization, possessive stripping, and keyword/phrase normalization
Removed domain specific tweaks from scripts as they are no longer required