Assistant

Don't think it will do everything for you, but it can chat and generate images for you. Parental advice: be nice so you don't get hunted when AI takes over...

This project use ollama to access any llm for chatting, easy setup is the main reason for choosing ollama. For image generation, we use stable diffusion v1.5 which will be downloaded from the diffusers package, you don't need to bother about it. If you want to change the image generation model, change it in line 4 in sd.py, do this only if you have enough GPU VRAM.

Before starting the app:

install ollama from https://ollama.com/download according to your platform and install it
or
run this command curl -fsSL https://ollama.com/install.sh | sh in terminal/cmd/whatever you use
check if installed successfuly by ollama --version
if version is displayed, then it means success, your parents are proud, fn
next pull the llm model you want to use like this: ollama pull gemma3:1b. I'm pulling gemma3:1b model because thats the only one my 8yrs old laptop can handle now, if you are rich, pull any bigger models that you like.
run the model now: ollama run gemma3:1b, this starts the chat in terminal, go ahead ask it a question. Also ollama exposes them to http://localhost:11434/api/generate which is used in the app for communication to the model. Dont bother going to this url in the browser, you are blocked already, lol
if you pulled any other model, then you should update the line 5 in chat.py. Papa's money, huh?`

To start the project:

Run the bash script main.sh after starting ollama
- main.sh activates the python virtual environment, so change the name of the venv for you project. Also the script starts the backend and the frontend. You can use the UI at http://localhost:5173/.
To start the backend only
1. Create a virtual environment of python if needed: python -m venv .venv
2. Activate python: source .venv/bin/activate
3. Install the packages from requirements.txt: cd backend and then pip install -r backend/requirements.txt
4. cd backend and run: uvicorn main:app --reload
5. go to localhost:8000/docs
To start frontend
```
 `cd frontend` > `npm install && npm run dev`.
```

Note: the first time you request to generate an image ie; the first time with and without control image, both first times the application will take some time depending on your network bandwidth to download the stable diffusion and controlNet models.

`PS: DON'T get offended by the sarcasm used here, its just for fun or my mood at the time of writing this. I am using my laptop of 8yrs with i7 7th gen, Nvidia GTX 1050 Ti 4GB and 16GB RAM, so I had to use the smallest working models to get this working, if you have a better pc, good for you, try out other bigger models and let me know how it feels to be up high in cloud 9!!!`

requirements for my GTX 1050 ti

torch==[2.6.0] torchvision==[0.21.0]

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
LICENSE		LICENSE
main.sh		main.sh
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Assistant

Before starting the app:

or

To start the project:

To start the backend only

To start frontend

requirements for my GTX 1050 ti

About

Uh oh!

Releases

Packages

Languages

License

robinrj6/Assistant

Folders and files

Latest commit

History

Repository files navigation

Assistant

Before starting the app:

or

To start the project:

To start the backend only

To start frontend

requirements for my GTX 1050 ti

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages