Skip to content
Discussion options

You must be logged in to vote

Is there a way to compute the approximate probability of a sentence?

We don't have any special feature for this, no.

For example, most news articles repeat someone else's analysis, instead of doing original reporting and presenting new insights. Can we use spacy/BERT to detect novelty?

I don't think "novelty", in terms of whether news is original reporting or not, has anything to do with the probability of a sentence in general. You probably want to look at document clustering or event detection.

There are some limited circumstances in which probability of a sentence, or the common stand-in of parser perplexity, can be useful, but it's usually more for detecting things like weird gram…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@polm
Comment options

Answer selected by svlandeg
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
usage General spaCy usage
2 participants