Skip to content

1.2.0 Improved entity and keyword parsing

Choose a tag to compare

@fmacpro fmacpro released this 13 Sep 17:59
· 79 commits to master since this release
654804d
  • Added a dedicated entity parser that normalizes strings, strips possessive suffixes, and deduplicates people, places, organizations, and topics before returning them
  • Introduced reusable helpers for capitalizing text, removing trailing possessives, and stripping punctuation
  • Updated the keyword parser to apply these helpers so extracted keywords and keyphrases are capitalized and free of trailing possessives
  • Loaded NLP plugin hints into the main parse workflow to enrich entity detection
  • Expanded test coverage to verify entity capitalization, possessive stripping, and keyword/phrase normalization
  • Removed domain specific tweaks from scripts as they are no longer required