ModelMemz

A lightweight CLI chat tool that lets you swap between LLMs while keeping a short-term memory of the conversation.

I built this because I noticed that most LLM APIs are stateless - they don’t remember anything between messages unless you manually include the full conversation history. I wanted something lightweight that could simulate memory across model calls, and also let me switch between different LLMs mid-conversation without losing context. This project keeps a running memory of recent messages and feeds them into each API call so the model can respond more naturally, like a real conversation.

What it does

Stores all messages in chat_history.json.
Replays the last N turns to whichever model you choose, so each model sees the same context.
Lets you switch models by typing a letter (A-C).
Generate a response using Groq’s Chat Completion API under the hood.

Built-in models

Key	Model ID
A	gemma2-9b-it
B	llama-3.3-70b-versatile
C	llama3-8b-8192

Requirements

pip install requests groq
export GROQ_API_KEY="your-real-groq-key"

python main.py

Demo:

Made by Yash Thapliyal 2025

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
chat_history.json		chat_history.json
config.py		config.py
main.py		main.py
memory_store.py		memory_store.py
model_router.py		model_router.py
session_manager.py		session_manager.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ModelMemz

What it does

Built-in models

Requirements

Demo:

About

Uh oh!

Releases

Packages

Languages

YashDThapliyal/ModelMemz

Folders and files

Latest commit

History

Repository files navigation

ModelMemz

What it does

Built-in models

Requirements

Demo:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages