🏆 AAIPL – Adversarial Language Model Agents

Built using Llama 3.1 for AMD AI Premier League (AAIPL)

📌 Overview

This project implements a competitive AI framework where two intelligent agents compete in a head-to-head multiple choice question battle.

The system consists of:

🧠 Q-Agent – Generates difficult, domain-specific MCQs
🤖 A-Agent – Solves MCQs using structured reasoning
📊 Evaluation Module – Measures accuracy and robustness

The entire system was developed using Llama 3.1 within the provided hackathon Jupyter environment.

🎯 Problem Statement

In the AMD AI Premier League (AAIPL), teams are required to build:

A Question Agent capable of generating challenging and valid MCQs.
An Answer Agent capable of solving opponent-generated questions accurately.

The objective is to maximize:

Question difficulty
Question correctness
Answer accuracy
System reliability

🧠 Model Used

🔹 Llama 3.1

Large Language Model optimized for reasoning
Used for both Q-Agent and A-Agent
Controlled via structured prompt engineering
Tuned for multi-step logical analysis

🏗 System Architecture

Jupyter Notebook Environment
        ↓
Llama 3.1 Model
   ├── Q-Agent Prompt Layer
   └── A-Agent Prompt Layer
        ↓
Evaluation Logic
        ↓
Match Results & Accuracy

🧠 Q-Agent Design (Question Generator)

The Q-Agent generates domain-specific MCQs using structured prompts.

✅ Features

Generates 4-option MCQs (A–D)
Ensures only one correct answer
Includes multi-step reasoning questions
Self-verification for logical consistency
Difficulty-aware question generation

🔎 Strategy

Conceptual traps
Edge-case reasoning
Strict output formatting
Validation before final output

🤖 A-Agent Design (Answer Solver)

The A-Agent solves questions using step-by-step reasoning.

✅ Features

Chain-of-thought reasoning
Option-wise logical elimination
Internal consistency check
Confidence estimation

📌 Example Output

Final Answer: C  
Confidence: 87%

📊 Evaluation Strategy

The system evaluates performance using:

Accuracy (% correct answers)
Question validity rate
Logical consistency check
Multi-match simulation testing

Multiple simulated matches were conducted to validate robustness.

⚙️ Tech Stack

Python 3.10+
Llama 3.1
PyTorch
HuggingFace Transformers
Jupyter Notebook (Hackathon Platform)

📂 Project Structure

AAIPL_Project.ipynb
│
├── Q-Agent Implementation
├── A-Agent Implementation
├── Evaluation Loop
└── Match Simulation

🚀 Key Innovations

Adversarial co-design of question and answer agents
Structured prompt engineering for difficulty control
Self-verification mechanism in Q-Agent
Reasoning-based validation in A-Agent
Modular AI architecture within notebook environment

🔮 Future Improvements

LoRA-based domain fine-tuning
Reinforcement learning for adaptive difficulty
Multi-domain expansion
API-based deployment
Competitive benchmarking system

👥 Team

ML Engineer – Model design & optimization
Developer – Integration & implementation
Designer – Documentation & presentation

🏁 Conclusion

This project demonstrates how structured prompt engineering combined with Llama 3.1 can create an effective adversarial AI competition framework capable of generating and solving high-difficulty MCQs with strong reasoning and accuracy.

Built entirely within the provided AAIPL Jupyter environment.

⭐ If you found this project interesting, feel free to star the repository!

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
AMD AI IIT DELHI		AMD AI IIT DELHI
Presentation - AI Reasoning Agent.pdf		Presentation - AI Reasoning Agent.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🏆 AAIPL – Adversarial Language Model Agents

Built using Llama 3.1 for AMD AI Premier League (AAIPL)

📌 Overview

🎯 Problem Statement

🧠 Model Used

🔹 Llama 3.1

🏗 System Architecture

🧠 Q-Agent Design (Question Generator)

✅ Features

🔎 Strategy

🤖 A-Agent Design (Answer Solver)

✅ Features

📌 Example Output

📊 Evaluation Strategy

⚙️ Tech Stack

📂 Project Structure

🚀 Key Innovations

🔮 Future Improvements

👥 Team

🏁 Conclusion

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🏆 AAIPL – Adversarial Language Model Agents

Built using Llama 3.1 for AMD AI Premier League (AAIPL)

📌 Overview

🎯 Problem Statement

🧠 Model Used

🔹 Llama 3.1

🏗 System Architecture

🧠 Q-Agent Design (Question Generator)

✅ Features

🔎 Strategy

🤖 A-Agent Design (Answer Solver)

✅ Features

📌 Example Output

📊 Evaluation Strategy

⚙️ Tech Stack

📂 Project Structure

🚀 Key Innovations

🔮 Future Improvements

👥 Team

🏁 Conclusion

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages