Skip to content

This parser can extract Name, Email Address, Skills etc. entities from resumes. You can always go for more according to your wish. This is an end to end deployed project.

Notifications You must be signed in to change notification settings

Rithik-Alias/Resume-Parser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 

Repository files navigation

This was the result that we got on sending a random resume to the model. You can see how accurate it is. image

Resume-Parser

This project has 3 phases

  • Annotation of resumes
  • Training the model
  • Deployment using Flask

Annotation of Resume

You can use UBIAI tool to do annotation of pdf documents. It has very good interface for annotation. But its not open source!!!
The link to UBIAI : UBIAI
Or else you can go for label-studio The details about installation of label studio is available in its GitHub repository. label-studio
Export your annotated file in .conll format

Training the model

I used Spacy transformers using BERT here to create the model. The python notebook for training the model is available in train folder.

!python -m spacy convert Dev.conll ./ -t json -n 3 -c iob
!python -m spacy convert Train.conll ./ -t json -n 3 -c iob
If you are using UBIAI tool replace the .conll files with .tsv files that you get from UBIAI tool.

!python -m spacy init fill-config base_config.cfg config.cfg The base_config file specified in this line is available from (https://spacy.io/usage/training?ref=hackernoon.com) image

After the training the model will be saved in the directory NER/model-best/

You can test the model and the python notebook for training the model is available in Test folder.

Unfortunately from spacy 3.0 onwards, there is no option to get the confidence score of extracted entities.

You can get the validation score while training in meta.json file inside the model folder.

Deploying the model

The model deployment code is available in web folder. After installing flask pip install flask run app.py

About

This parser can extract Name, Email Address, Skills etc. entities from resumes. You can always go for more according to your wish. This is an end to end deployed project.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published