Spring AI ELT Vector Store: How to store documents in Chroma

Overview

A reference project that demonstrates how to:

build a Spring Boot 3.5 application on Java 21
read pdf documents from local storage
transform it based on the TokenTextSplitter
call OpenAI models through spring-ai
store and retrieve embeddings in Chroma
read and chunk documents with the Pdf Reader document reader
expose simple REST end-points for data ingestion

Tech Stack

Spring Boot
Spring AI
OpenAI Integration
Chroma as Vector Store
Java 21
Docker
Maven

Prerequisites

JDK 21
Maven
OpenAI API key
IDE (IntelliJ IDEA, Eclipse, or VS Code)

2 Configuration

All secrets are read from environment variables or the .env file in the project root (use the provided .env.template as a starting point):

Variable	Purpose
`OPENAI_API_KEY`	key for OpenAI completions / chat

Example:

bash
# .env
OPENAI_API_KEY=sk-********************************
API_NINJAS_KEY=ninjas_********************************

Quick Start

Installation

Clone the repository:

git clone [repository-url]
cd spring-ai-etl-vector-store

Configure OpenAI: Create application.properties in src/main/resources/ and add:

spring.ai.openai.api-key=your-api-key-here

Build and run:

docker-compose up

mvn clean install
mvn spring-boot:run

API Usage

Get Capital City

Endpoint: GET /api/v1/etl/run-ingestion

Project Structure

spring-ai-etl-vecgtor-store/
├── src/
│   ├── main/
│   │   ├── java/
│   │   └── resources/
│   └── test/
└── pom.xml

Development

Building

mvn clean install

Running Tests

mvn test

Contributing

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Support

For support and questions, please open an issue in the repository.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src/main		src/main
.gitignore		.gitignore
README.md		README.md
docker-compose.yaml		docker-compose.yaml
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spring AI ELT Vector Store: How to store documents in Chroma

Overview

Tech Stack

Prerequisites

2 Configuration

Quick Start

Installation

API Usage

Get Capital City

Project Structure

Development

Building

Running Tests

Contributing

Support

About

Uh oh!

Releases

Packages

Languages

eacarvalho/spring-ai-etl-vector-store

Folders and files

Latest commit

History

Repository files navigation

Spring AI ELT Vector Store: How to store documents in Chroma

Overview

Tech Stack

Prerequisites

2 Configuration

Quick Start

Installation

API Usage

Get Capital City

Project Structure

Development

Building

Running Tests

Contributing

Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages