ResumeParser

This project extracts relevant information from resumes in PDF format, including skills, experience, education, and more. The extracted data is output as CSV and JSON files.

Features

Extracts technical and non-technical skills.
Categorizes internships and jobs into technical and non-technical roles.
Extracts educational details such as degree, course, CGPA, HSC, and SSC.
Supports OCR fallback for non-standard PDFs.

Getting Started

Prerequisites

Python 3.x

Installation

Clone the repository:

git clone https://github.com/AdiD-code/ResumeParser.git
cd resume-info-extraction

Install the dependencies

The requirements.txt file includes the necessary Python libraries for the project. To install them, run: Install Python libraries using pip:

pip install -r requirements.txt

Install Poppler

For Windows: Download the binaries from Poppler for Windows and add the path to poppler/bin to your system's PATH environment variable.

Set environment variables for Poppler (Windows only): Add the path to the poppler/bin directory to your system's PATH environment variable. This allows the program to find the Poppler tools.

For macOS: Use homebrew:

brew install poppler

For Linux: Install using your package manager:

sudo apt-get install poppler-utils

Install Tesseract OCR (for OCR capabilities)

For Windows: Download and install from Tesseract at UB Mannheim.

Set environment variables for Tesseract-OCR (Windows only): Add the path to the installed Tesseract application to your system's PATH environment variable. This allows the program to find the Tesseract-OCR.

For macOS: Use Homebrew:

brew install tesseract

For Linux: Install using your package manager:

sudo apt-get install tesseract-ocr

Make sure to install additional system dependencies like Poppler and Tesseract as described above.

To Do List

Improve accuracy for 'HSC' and 'SSC' scores

Improve 'College' extraction

Improve 'Course' extraction

Output

The extracted data will be saved in both CSV and JSON formats.

The output includes: Technical and non-technical skills. Details about internships and jobs, categorized by technical and non-technical roles. Educational details such as degree, course, CGPA, HSC, and SSC. Feel free to customize the file paths and commands as needed for your environment.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
src		src
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

ResumeParser

Features

Getting Started

Prerequisites

Installation

To Do List

Improve accuracy for 'HSC' and 'SSC' scores

Improve 'College' extraction

Improve 'Course' extraction

Output

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Uh oh!

Uh oh!

AdiD-code/ResumeParser

Folders and files

Latest commit

History

Repository files navigation

ResumeParser

Features

Getting Started

Prerequisites

Installation

To Do List

Improve accuracy for 'HSC' and 'SSC' scores

Improve 'College' extraction

Improve 'Course' extraction

Output

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages