code-mixed-ner

Code-Mixed NER: Named Entity Recognition on English–Indian Text

This project demonstrates a small-scale multilingual NER pipeline for code-mixed Indian languages (Telugu/Tamil + English). Inspired by internship work at Palmtree Infotech.

📊 Dataset

Hand-annotated sentences mixing English with Telugu/Tamil
Entity labels: PERSON, LOCATION, ORGANIZATION, FOOD, etc.
Format: CSV (sentences + annotation spans)

⚙️ Tools Used

fastText (language detection)
Meta NLLB (translation)
spaCy (custom NER)
GLiNER (zero-shot NER)
HuggingFace Transformers
Pandas, Python

🧪 Sample Output

Sentence	Entities Detected
"I loved the dosai at Sangeetha, Chennai!"	FOOD: dosai, ORG: Sangeetha, LOC: Chennai

🚀 How to Run

Clone the repo
Navigate to notebooks/ner_experiments.ipynb
Install dependencies from requirements.txt
Run the notebook to test entity extraction with spaCy or GLiNER

📁 Folder Structure

data/: CSV files with sentences and annotations
notebooks/: Jupyter notebooks with demo pipeline
scripts/: Optional scripts for training/evaluation

📌 Disclaimer

This is a public reconstruction of internship work using synthetic data and open-source tools. No proprietary data or internal IP is shared here.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
docs		docs
notebook		notebook
README.md		README.md
annotations.csv		annotations.csv
data.csv		data.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

code-mixed-ner

Code-Mixed NER: Named Entity Recognition on English–Indian Text

📊 Dataset

⚙️ Tools Used

🧪 Sample Output

🚀 How to Run

📁 Folder Structure

📌 Disclaimer

About

Uh oh!

Releases

Packages

Languages

NitGS/code-mixed-ner

Folders and files

Latest commit

History

Repository files navigation

code-mixed-ner

Code-Mixed NER: Named Entity Recognition on English–Indian Text

📊 Dataset

⚙️ Tools Used

🧪 Sample Output

🚀 How to Run

📁 Folder Structure

📌 Disclaimer

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages