Any way to improve this summary? #7517
-
I have found spacy's text summary to be pretty good, but in some cases, it create assertions that are just wrong. Can I do anything to improve this? Sample text:
Results:
Notice "a son: Braxton Black and Rose White;" That's just wrong. Son should be "Alpha Morris Solomon (Carrie)", and Braxton and rose should be great grandchildren. `
|
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 1 reply
-
So it looks like you're applying extractive summarization on top of spaCy using the sentence tokenizer and POS tags. What's happening is you're running up against the limits of extractive summarization - if your basic unit is sentences, there's not really much you can do with your example document. In fact it would be reasonable to interpret it as a single sentence, in which case there's nothing you can do. If you want to improve your results with minimal changes, you might look at using a custom Sentencizer to change how sentence splits are detected, possibly treating all colons and semicolons as sentence dividers. If you're focused on summarizing obituaries like this though I would honestly just focus on extracting phrases like "4 daughters" and removing named entities marked as PERSON; this seems like a reasonable summary:
Also, does your original document actually write it like |
Beta Was this translation helpful? Give feedback.
So it looks like you're applying extractive summarization on top of spaCy using the sentence tokenizer and POS tags. What's happening is you're running up against the limits of extractive summarization - if your basic unit is sentences, there's not really much you can do with your example document. In fact it would be reasonable to interpret it as a single sentence, in which case there's nothing you can do.
If you want to improve your results with minimal changes, you might look at using a custom Sentencizer to change how sentence splits are detected, possibly treating all colons and semicolons as sentence dividers. If you're focused on summarizing obituaries like this though I would hone…