OmdenaKnowledge_AIAgentsInferenceBenchmarking

Overview

This project demonstrates benchmarking of AI agents for date fruit classification using LLaMA 3.1-8B-instant model via the Groq API. The implementation includes comprehensive data analysis, feature extraction, model evaluation, and performance benchmarking within a Jupyter Notebook environment.

Frameworks and Libraries

AI and ML Libraries:
- groq: API connection to LLaMA 3.1-8B-instant model
- langgraph: For building agent workflow graphs
- sklearn: For data preprocessing, scaling, and evaluation metrics
Data Processing:
- pandas: For dataset manipulation and analysis
- numpy: For numerical operations
- matplotlib & seaborn: For data visualization and benchmark reporting
Utilities:
- dotenv: For secure API key management
- re: For text processing with regular expressions
- json: For benchmark data storage
- datetime: For timestamping benchmark results

Dataset

The project uses the Date Fruit Dataset (Date_Fruit_Datasets.xlsx) containing features such as:

Area, Perimeter, Major/Minor Axis measurements
Eccentricity, Solidity, Convex Area
Texture and color features
Classification labels (BERHI, DEGLET, DOKOL, etc.)

Key Components

1. DateFruitAgent

A sophisticated agent that processes and analyzes date fruit features:

Feature preprocessing and scaling
Analysis of fruit characteristics
Classification into fruit categories
Comprehensive reporting

2. Benchmarking System

Metrics tracked and visualized:

Latency (processing time)
Model response analysis
Classification accuracy
Feature importance

3. Visualization & Reporting

Performance charts and metrics visualizations
Benchmark summaries
Classification distribution reports
Feature analysis documentation

Benchmark Results

The benchmarking shows:

Average analysis time: ~2-3 seconds for feature analysis
Classification latency: ~4-5 seconds per sample
Varying performance based on sample complexity
Model accuracy evaluation against ground truth

Output Files

Benchmark JSON files in reports/benchmark/
Visualization charts in reports/charts/
Comprehensive analysis reports in reports/

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
Data Analysis with SQL Queries		Data Analysis with SQL Queries
ai-disaster-tweets-detection-agent		ai-disaster-tweets-detection-agent
data		data
reports		reports
venv		venv
.gitignore		.gitignore
AI_agent_Data_analysis.ipynb		AI_agent_Data_analysis.ipynb
AI_agent_Data_analysis_with_benchmark.ipynb		AI_agent_Data_analysis_with_benchmark.ipynb
CrewAI_data_analyst_Agent.ipynb		CrewAI_data_analyst_Agent.ipynb
README.md		README.md
classification_benchmark.ipynb		classification_benchmark.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

OmdenaKnowledge_AIAgentsInferenceBenchmarking

Overview

Frameworks and Libraries

Dataset

Key Components

1. DateFruitAgent

2. Benchmarking System

3. Visualization & Reporting

Benchmark Results

Output Files

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 6

Languages

OmdenaAI/OmdenaKnowledge_AIAgentsInferenceBenchmarking

Folders and files

Latest commit

History

Repository files navigation

OmdenaKnowledge_AIAgentsInferenceBenchmarking

Overview

Frameworks and Libraries

Dataset

Key Components

1. DateFruitAgent

2. Benchmarking System

3. Visualization & Reporting

Benchmark Results

Output Files

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 6

Languages

Packages