LLM Service API

Note:

This is a takehome assignment I completed for a job I didn't end up getting. I built it over 4 days and don't plan on revising it further, but regard it as a good sample of my work within such constraints. For that reason I've decided to make it public, without sharing the original prompt.

LLM Service API

The LLM Service API is built using Django and Django REST Framework and allows users to make queries to OpenAI, Google Gemini, and our in-house LLM. It is easily configurable and extensible to allow the addition of other LLM providers.

Requirements

Python 3.13
uv -- Note: these instructions assume you are using uv for project and dependency management. If you use another tool, the commands you run will be slightly different (no 'uv') and you'll have to do extra work with pip to create a virtual environment, install dependencies, etc.
An OpenAI API key (to interact with OpenAI LLMs)
A Gemini API key (to interact with Gemini LLMs)

Installation

Clone this repository
Navigate to the llm_service_api directory
Copy the contents of env-template into a file named .env in the project's root directory, and fill in appropriate values without quotation marks. This app uses python-decouple to manage environment variables, and will not run unless they are set.
Run uv sync to synchronize the project dependecies with your environment
Run uv run python manage.py migrate to apply the initial database migrations

Running the API

To start the server, run uv run python manage.py runserver

Using the endpoints

Web interface

Django REST Framework autogenerates a browsable and testable API. Just open http://127.0.0.1:8000 (or your local host if the address/port is different) in a web browser. Enter a JSON payload into the provided text box and press the Post button to send it to the endpoint.

Using curl

You can also use curl or another utility to send requests directly from the command line. Below you'll find examples of both the request JSON (if applicable) and the corresponding curl command.

API endpoints

Welcome — `GET /api/`

Returns a welcome message. If you cannot access this, either the server is not running or you entered the wrong address.

curl::

curl http://127.0.0.1:8000/api/

Sample response:

{
"message":"Welcome to the LLM API!"
}

Signup — `POST /api/signup/`

Creates a user.

JSON:

{
  "username": "charming_user",
  "password": "secure_pwd"
}

curl:

curl -X POST http://localhost:8000/api/signup/ \
  -H "Content-Type: application/json" \
  -d '{"username": "charming_user", "password": "secure_pwd"}'

Sample response:

{
"message":"User created successfully"
}

Token — `POST /api/token/`

Gets an access token and a refresh token. The username and password must correspond to a user you've created with the Signup endpoint.

JSON:

{
  "username": "$USERNAME",
  "password": "$PASSWORD"
}

curl:

curl -X POST http://localhost:8000/api/token/ \
  -H "Content-Type: application/json" \
  -d '{"username": "$USERNAME", "password": "$PASSWORD"}'

Sample response:

{
  "access": "eyJ0eXAiOiJKV1QiLCJhbGci...",
  "refresh": "eyJ0eXAiOiJKV1QiLCJhbGci..."
}

Refresh — `POST /api/token/refresh/`

Gets a new access token. Use this if you get an error saying your token has expired.

JSON:

{
  "refresh": "$REFRESH_TOKEN"
}

curl:

curl -X POST http://localhost:8000/api/token/refresh/ \
  -H "Content-Type: application/json" \
  -d '{"refresh": "$REFRESH_TOKEN"}'

Sample response:

{
  "access": "eyJ0eXAiOiJKV1QiLCJhbGci...",
}

Supported Models — `GET /api/supported-models/?provider=$PROVIDER`

Returns a list of models supported by the specified provider (openai, gemini, or inhouse).

curl:

curl "http://localhost:8000/api/supported-models/?provider=gemini" \
  -H "Authorization: Bearer $ACCESS_TOKEN"

Sample response:

{
     "models": ["gemini-pro", "gemini-pro-vision" ...]
}

Chat Completion — `POST /api/chat/completions/?provider=$PROVIDER`

Submits a prompt and receive a response from the selected provider (openai, gemini, or inhouse).

JSON:

{
  "model": "gpt-3.5-turbo",
  "messages": [
    {"role": "user", "content": "How is the weather on Venus?"}
  ],
  "stream": false
}

curl:

curl -X POST "http://localhost:8000/api/chat/completions/?provider=openai" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $ACCESS_TOKEN" \
  -d '{
        "model": "gpt-3.5-turbo",
        "messages": [
          {"role": "user", "content": "How is the weather on Venus?"}
        ],
        "stream": false
      }'

Sample response:

{"id":"chatcmpl-BSpaQfs3CEkCL79NzcNP8gVrIAGsg","choices":[{"finish_reason":"stop","index":0,"logprobs":null,"message":{"content":"The weather on Venus is extremely inhospitable, with temperatures reaching up to 900 degrees Fahrenheit (475 degrees Celsius). Venus also has a thick atmosphere made up mostly of carbon dioxide and clouds of sulfuric acid, creating a runaway greenhouse effect that traps heat and makes it the hottest planet in our solar system. The atmospheric pressure on Venus is about 92 times that of Earth's surface pressure, which is equivalent to being nearly a kilometer underwater on Earth. Additionally, Venus experiences hurricane-force winds that can reach speeds of up to 224 miles per hour (360 kilometers per hour) in its upper atmosphere.","refusal":null,"role":"assistant","annotations":[],"audio":null,"function_call":null,"tool_calls":null}}],"created":1746211182,"model":"gpt-3.5-turbo-0125","object":"chat.completion","service_tier":"default","system_fingerprint":null,"usage":{"completion_tokens":123,"prompt_tokens":14,"total_tokens":137,"completion_tokens_details":{"accepted_prediction_tokens":0,"audio_tokens":0,"reasoning_tokens":0,"rejected_prediction_tokens":0},"prompt_tokens_details":{"audio_tokens":0,"cached_tokens":0}}}

Tip: if you just want to play with the LLM endpoints without dealing with authentication, you can comment out the following lines in the relevant function of views.py:

authentication_classes = [JWTAuthentication]
permission_classes = [IsAuthenticated]

But be careful!

Adding LLM providers

The API is highly configurable, and adding other providers (or removing them) is straightforward:

Create a corresponding API key entry in .env and update env-template
Set the new API key value near the top of project/settings.py (follow the syntax of the other API key entries, e.g. BLARG_KEY = config('BLARG_KEY'))
Add the provider to the LLM_PROVIDERS dictionary at the bottom of project/settings.py (again, follow the existing syntax)
Create a provider module in llm_api/providers/. If the provider is OpenAI compatible, the module will be very similar to llm_api/providers/openai.py; otherwise you will have to consult the provider's documentation.

Now you will have a new provider, the name of which will be the same as the corresponding key in the LLM_PROVIDERS dict of settings.py.

To remove a provider, just follow these instructions in reverse! Delete the provider, any references in settings.py, and the .env entry.

The inhouse LLM

You can assume that the OpenAI and Gemini servers are up and running, but the inhouse LLM may not be. Check with the team before trying to use it!

Tests

The app currently includes some basic unit tests covering core functionality. Running these before you push is a good way to make sure you don't break the app! Run tests with uv run pytest; analyze test coverage with uv run coverage run -m pytest; generate a coverage report with uv run coverage report

License

GNU General Public License v3

Feedback

For bug reports, please open an issue with a description of what you expected vs what you got
For suggestions, please open an issue with a description of the desired behavior
For gratuitous praise, please send a toot to https://fosstodon.org/@jeffjacobson
Thanks for reading this far!

Name		Name	Last commit message	Last commit date
Latest commit History 81 Commits
llm_api		llm_api
project		project
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
env-template		env-template
manage.py		manage.py
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Note:

LLM Service API

Requirements

Installation

Running the API

Using the endpoints

Web interface

Using curl

API endpoints

Welcome — `GET /api/`

Signup — `POST /api/signup/`

Token — `POST /api/token/`

Refresh — `POST /api/token/refresh/`

Supported Models — `GET /api/supported-models/?provider=$PROVIDER`

Chat Completion — `POST /api/chat/completions/?provider=$PROVIDER`

Adding LLM providers

The inhouse LLM

Tests

License

Feedback

About

Uh oh!

Releases

Packages

Languages

License

jwjacobson/llm_service_api

Folders and files

Latest commit

History

Repository files navigation

Note:

LLM Service API

Requirements

Installation

Running the API

Using the endpoints

Web interface

Using curl

API endpoints

Welcome — GET /api/

Signup — POST /api/signup/

Token — POST /api/token/

Refresh — POST /api/token/refresh/

Supported Models — GET /api/supported-models/?provider=$PROVIDER

Chat Completion — POST /api/chat/completions/?provider=$PROVIDER

Adding LLM providers

The inhouse LLM

Tests

License

Feedback

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Welcome — `GET /api/`

Signup — `POST /api/signup/`

Token — `POST /api/token/`

Refresh — `POST /api/token/refresh/`

Supported Models — `GET /api/supported-models/?provider=$PROVIDER`

Chat Completion — `POST /api/chat/completions/?provider=$PROVIDER`

Packages