Use the following algorithm to add a Summarize(text string, count int) []string method to our nlp package.
- For every word in the sentence count how many times it appears in the document.
- This is known as
token frequencyortf.
- This is known as
- For every sentence in the document, give is a score which is the sum of all the token frequencies for the tokens in the sentence.
- e.g. if we have the following token frequencies: please=1, reinstall=3,
universe=3, and=10, reboot=2. Then the score of the sentence
Please reinstall universe and rebootis1 + 3 + 3 + 10 + 2 = 19
- e.g. if we have the following token frequencies: please=1, reinstall=3,
universe=3, and=10, reboot=2. Then the score of the sentence
- Return the
counttop scored sentences (this is known assentence extraction)
- Test your function. Have directory with texts to summarize and expected output.
- Create testable example for
Summarize - Create a benchmark for your server under load, you can use vegeta to generate load
- Tag a new version of your code. Create a test to see it's
go getable - Publish a docker image for
nlpdto docker hub - Add gRPC front end
nlpd. Have an option to run either HTTP server or gRPC or both.- Make sure metrics still work in
gRPConly mode
- Make sure metrics still work in
- Head over to godoc and check how your package documentation looks. See it in the eyes of a new user and fix it
- Use http/httptest to test your HTTP handlers