SHL Assessment Recommendation Engine

An intelligent recommendation system for SHL assessments using semantic search and LLM re-ranking.

✅ Requirements Checklist

Requirement	Status	Details
Crawl SHL Catalog	✅ Done	377 Individual Test Solutions crawled
Build Recommendation Engine	✅ Done	Semantic search + LLM re-ranking
Natural Language Query Support	✅ Done	Text/JD input supported
Return 1-10 Assessments	✅ Done	Returns top 10 relevant assessments
Assessment Name + URL	✅ Done	Full metadata included
API Endpoint (JSON)	✅ Done	FastAPI at `/recommend`
Web Frontend	✅ Done	Streamlit app
GitHub Repository	✅ Done	Ready for submission
Submission CSV	✅ Done	`submission.csv` generated
Mean Recall@10 Evaluation	✅ Done	22.11% with LLM reranking
LLM Integration	✅ Done	Groq Llama-3.3-70B (Free)
Balanced Recommendations	✅ Done	Hard + Soft skills mix

🌟 Features

Semantic Search: sentence-transformers embeddings (all-MiniLM-L6-v2)
LLM Re-ranking: Groq Llama-3.3-70B for intelligent re-ordering
377 Assessments: Complete SHL Individual Test Solutions catalog
REST API: FastAPI backend with OpenAPI docs
Web Interface: Streamlit frontend for testing

🚀 Quick Start

1. Clone & Install

git clone https://github.com/yourusername/shl-recommendation-engine.git
cd shl-recommendation-engine

# Create virtual environment
python -m venv .venv
.venv\Scripts\activate  # Windows
# source .venv/bin/activate  # Linux/Mac

# Install dependencies
pip install -r requirements.txt

2. Configure API Key (Optional)

For LLM re-ranking, get a free HuggingFace API key:

Go to https://huggingface.co and create a free account
Go to Settings → Access Tokens (https://huggingface.co/settings/tokens)
Click "New token" → Name it → Select "Read" → Create
Copy the token (starts with hf_...)

# Create .env file
echo "HF_API_KEY=hf_your_token_here" > .env

3. Run the Application

Option A: Streamlit Frontend

streamlit run app.py

Opens at http://localhost:8501

Option B: FastAPI Backend

uvicorn api:app --reload

API: http://localhost:8000
Docs: http://localhost:8000/docs

📊 API Usage

POST /recommend

import requests

response = requests.post("http://localhost:8000/recommend", json={
    "query": "Looking for Java developers with team collaboration skills",
    "top_k": 10,
    "use_llm": True
})

print(response.json())

Response Format

{
  "query": "Looking for Java developers...",
  "method": "semantic+llm_rerank",
  "recommendations": [
    {
      "rank": 1,
      "assessment_name": "Core Java (Entry Level) - New",
      "url": "https://www.shl.com/products/...",
      "test_types": ["Knowledge & Skills"],
      "reason": "Directly assesses Java programming skills"
    }
  ],
  "processing_time_ms": 250.5
}

🏗️ Architecture

┌─────────────────┐    ┌──────────────────┐    ┌─────────────────┐
│  User Query     │───▶│  Semantic Search │───▶│  LLM Re-ranking │
│  (Job Desc/URL) │    │  (FAISS + MiniLM)│    │  (Gemini Flash) │
└─────────────────┘    └──────────────────┘    └─────────────────┘
                                │                       │
                                ▼                       ▼
                       ┌──────────────────┐    ┌─────────────────┐
                       │  Top-20 Candidates│───▶│  Top-10 Balanced│
                       │  (by similarity)  │    │  (hard+soft mix)│
                       └──────────────────┘    └─────────────────┘

📁 Project Structure

shl-recommendation-engine/
├── api.py                        # FastAPI backend (POST /recommend, GET /health)
├── app.py                        # Streamlit frontend
├── pipeline.py                   # End-to-end pipeline
├── evaluate.py                   # Evaluation script (Mean Recall@10)
├── generate_submission.py        # Generate submission CSV
├── submission.csv                # Predictions for test set
├── requirements.txt              # Dependencies
├── README.md                     # Documentation
├── .env.example                  # Environment template
├── data/
│   ├── catalog_enriched.json     # 377 Individual Test Solutions
│   ├── vector_store.faiss        # FAISS vector index
│   └── metadata.pkl              # Test metadata
├── src/
│   ├── crawler.py                # Web crawler for SHL catalog
│   └── vector_store.py           # Vector store implementation
└── llm/
    ├── prompt.txt                # LLM re-ranking prompt
    └── rerank.py                 # Groq LLM re-ranking module

📈 Evaluation Results

Mean Recall@10 on the training dataset:

Method	Mean Recall@10
Semantic Only	20.56%
Semantic + LLM	22.11%

Run evaluation:

python evaluate.py --dataset "Gen_AI Dataset.xlsx"

Generate submission:

python generate_submission.py --dataset "Gen_AI Dataset.xlsx" --output submission.csv

🔧 Configuration

Environment Variables

Variable	Description	Required
`GROQ_API_KEY`	Groq API key (FREE at console.groq.com)	For LLM re-ranking

Vector Store Settings

Model: all-MiniLM-L6-v2 (384 dimensions)
Index: FAISS IndexFlatIP (cosine similarity)
Candidate pool: 20 for re-ranking, configurable

📝 Assessment Types

Code	Type	Description
K	Knowledge & Skills	Technical/hard skills (programming, etc.)
P	Personality	Personality assessments (OPQ, etc.)
B	Behavioral	Behavioral competencies
A	Ability	Cognitive abilities
S	Simulation	Job simulations

🛠️ Development

Rebuild Vector Index

python -c "from src.vector_store import SHLVectorStore; s = SHLVectorStore(); s.build_index(); s.save()"

Run Tests

python -m pytest tests/

📜 License

MIT License

🙏 Acknowledgments

SHL for the comprehensive assessment catalog
Sentence Transformers for the embedding models
Groq for providing free LLM API access

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SHL Assessment Recommendation Engine

✅ Requirements Checklist

🌟 Features

🚀 Quick Start

1. Clone & Install

2. Configure API Key (Optional)

3. Run the Application

📊 API Usage

POST /recommend

Response Format

🏗️ Architecture

📁 Project Structure

📈 Evaluation Results

🔧 Configuration

Environment Variables

Vector Store Settings

📝 Assessment Types

🛠️ Development

Rebuild Vector Index

Run Tests

📜 License

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.devcontainer		.devcontainer
.streamlit		.streamlit
data		data
llm		llm
src		src
.env.example		.env.example
.gitignore		.gitignore
APPROACH_DOCUMENT.md		APPROACH_DOCUMENT.md
APPROACH_DOCUMENT.pdf		APPROACH_DOCUMENT.pdf
Gen_AI Dataset.xlsx		Gen_AI Dataset.xlsx
Procfile		Procfile
README.md		README.md
SETUP.md		SETUP.md
SHL AI Intern RE Generative AI assignment.pdf		SHL AI Intern RE Generative AI assignment.pdf
api.py		api.py
app.py		app.py
evaluate.py		evaluate.py
generate_submission.py		generate_submission.py
pipeline.py		pipeline.py
rahul_kadyan.csv		rahul_kadyan.csv
requirements.txt		requirements.txt
streamlit_app.py		streamlit_app.py

Folders and files

Latest commit

History

Repository files navigation

SHL Assessment Recommendation Engine

✅ Requirements Checklist

🌟 Features

🚀 Quick Start

1. Clone & Install

2. Configure API Key (Optional)

3. Run the Application

📊 API Usage

POST /recommend

Response Format

🏗️ Architecture

📁 Project Structure

📈 Evaluation Results

🔧 Configuration

Environment Variables

Vector Store Settings

📝 Assessment Types

🛠️ Development

Rebuild Vector Index

Run Tests

📜 License

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages