Skip to content
Discussion options

You must be logged in to vote

Just before answering your specific questions - while it's good to understand what the code is doing, and general principles of what works and doesn't, NLP is often empirical. For any specific project, one of the most important things is to get an end-to-end pipeline working as a baseline and to work on improving performance iteratively. This helps you make sure that your task is feasible, and lets you avoid overthinking details that may end up not having significant impact on your task. In particular, with CPU models it's very fast to train them and try out different configurations.

1.Consider an example how does '5centimeter'using MultiHashEmbed get a vector ,Suppose its tokenised as […

Replies: 1 comment 11 replies

Comment options

You must be logged in to vote
11 replies
@marzooq-unbxd
Comment options

@marzooq-unbxd
Comment options

@polm
Comment options

@marzooq-unbxd
Comment options

@polm
Comment options

Answer selected by marzooq-unbxd
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / ner Feature: Named Entity Recognizer feat / tokenizer Feature: Tokenizer
2 participants