Semantic Vault — AI-Powered Semantic Search for Your Markdown Notes

🔍 Ask questions to your knowledge base with OpenAI, Gemini, or Local LLMs
🗂️ Tag Generator Included — Perfect for Obsidian Vaults

📖 Overview

Semantic Vault brings semantic search to your Markdown and text notes, optimized for Obsidian users but adaptable to any folder with .md or .txt files. Ask natural language questions and get relevant answers based on your notes.

Includes an AI-powered tag generator to enrich your notes automatically — great for organizing Obsidian vaults.

🚀 Features

✅ Semantic Search with Embeddings — Find relevant notes intelligently, no file limits!
✅ AI-Powered Semantic Search (Chat interface)
✅ Beautiful Web Interface — Modern browser-based UI
✅ Supports OpenAI, Gemini, and Ollama Local LLMs
✅ CPU-Only Embeddings — No GPU required, runs efficiently on any machine
✅ Markdown & Text File Support (.md and .txt files)
✅ AI Tag Generation Script — YAML Compatible
✅ Easy Setup with requirements.txt

🛠️ Requirements

Python 3.9+
Dependencies (use the provided requirements.txt)

pip3 install -r requirements.txt

No GPU Required — Semantic search runs efficiently on CPU
Optional:
- OpenAI API Key → Get one
- Google Gemini API Key → Get one
- Ollama for local LLMs → See Ollama Setup

📂 Project Structure

semantic-vault/
├── semanticVault.py      # Main Semantic Search Script
├── auto_tag_generation.py # AI Tag Generator for Markdown Notes
├── templates/
│   └── index.html        # Web UI template
├── .embeddings_cache/    # Cached embeddings (auto-generated)
├── requirements.txt      # Python dependencies
├── .env                  # API keys configuration (create this)
└── README.md

🔧 Setup

1. Clone the Repository

git clone https://github.com/renantmagalhaes/semantic-vault.git
cd semantic-vault

2. Install Dependencies

pip3 install -r requirements.txt

3. Configure Environment

Create a .env file:

OPENAI_API_KEY="your_openai_key_here"
GEMINI_API_KEY="your_openai_key_here"

Only required if using OpenAI or Gemini models.

💡 Semantic Search Usage

1. Set Your Vault Path

Edit semanticVault.py:

VAULT_PATH = "/path/to/your/obsidian/vault"

2. Configure Semantic Search (Optional)

Semantic search is enabled by default and works with unlimited files! Edit semanticVault.py:

USE_SEMANTIC_SEARCH = True  # Enable semantic search (recommended)
TOP_K_NOTES = 10            # Number of most relevant notes to use (adjust as needed)

Benefits:

✅ No file limits — Works with vaults of any size
✅ Cost efficient — Only sends relevant notes to LLM (saves API costs)
✅ Better accuracy — Finds semantically relevant notes, not random ones
✅ CPU-only — No GPU required, runs on any machine

3. Choose Your AI Model

In semanticVault.py:

USE_MODEL = "openai"  # Options: "openai", "gemini", "ollama"

Note: Semantic search works with all three providers! The embeddings are generated locally using CPU, then only the most relevant notes are sent to your chosen LLM.

4. Run the Search Tool

CLI Mode (Command Line):

python3 ./semanticVault.py

Ask your question in the terminal, get AI-driven answers based on your notes.

Web Interface Mode (Recommended):

python3 ./semanticVault.py --web

Then open your browser to http://localhost:5000 to access the beautiful web interface!

The web UI features:

🎨 Modern, responsive design with gradient themes
💬 Chat-style interface for natural conversations
📊 Real-time statistics (note count, model type)
⚡ Smooth animations and loading indicators
📱 Mobile-friendly responsive layout

How Semantic Search Works

First Run: Generates embeddings for all your notes (one-time, ~1-5 minutes depending on vault size)
- Embeddings are cached automatically in .embeddings_cache/
- Uses lightweight CPU-only model (all-MiniLM-L6-v2)
Subsequent Queries:
- Finds the top K most relevant notes using semantic similarity
- Only sends those relevant notes to the LLM (saves costs!)
- Typically finds results in <1 second
Automatic Updates: Embeddings are regenerated only when notes change

Performance:

First run: 10-30 seconds (small vault) to 2-5 minutes (large vault)
Subsequent queries: Near-instant (uses cached embeddings)
No GPU needed: Runs efficiently on CPU

🏷️ AI Tag Generator for Notes

Enrich your notes with relevant, AI-suggested tags.

Usage

python3 ./auto_tag_generation.py

Optional flags:

Flag	Description
`--dry-run`	Preview changes, no files modified
`--force`	Overwrite all existing tags with new ones

Edit tag_generation.py:

VAULT_PATH = "/path/to/your/obsidian/vault"
USE_MODEL = "openai"  # or "gemini" or "ollama"

✅ Tags are inserted in YAML frontmatter — ideal for Obsidian users.

🖥️ Ollama Local LLM (Optional)

Prefer privacy or offline capabilities? Run fully local models with Ollama.

Install Ollama

curl -fsSL https://ollama.com/install.sh | sh

For Linux/macOS or check latest guides on ollama.com.

Pull Models

Examples:

ollama run mistral
ollama run llama3

Edit:

USE_MODEL = "ollama"
OLLAMA_MODEL = "mistral"

Supports lightweight, privacy-friendly models locally.

🌟 Future Plans

Full-featured Obsidian Plugin (Separate project)
Persistent chat mode to refine questions without losing context
More advanced tag generation modes
CLI improvements and advanced filters
Custom embedding models and fine-tuning options

📢 Contributing

Open to contributions — PRs, issues, suggestions welcome!

📜 License

MIT License — Free to use, modify, and distribute.

🤖 Acknowledgments

OpenAI
Google Gemini
Ollama
Sentence Transformers (for semantic search embeddings)
Inspired by Obsidian Copilot

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
images		images
templates		templates
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
auto_tag_generation.py		auto_tag_generation.py
requirements.txt		requirements.txt
semanticVault.py		semanticVault.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Semantic Vault — AI-Powered Semantic Search for Your Markdown Notes

📖 Overview

🚀 Features

🛠️ Requirements

📂 Project Structure

🔧 Setup

1. Clone the Repository

2. Install Dependencies

3. Configure Environment

💡 Semantic Search Usage

1. Set Your Vault Path

2. Configure Semantic Search (Optional)

3. Choose Your AI Model

4. Run the Search Tool

How Semantic Search Works

🏷️ AI Tag Generator for Notes

Usage

🖥️ Ollama Local LLM (Optional)

Install Ollama

Pull Models

🌟 Future Plans

📢 Contributing

📜 License

🤖 Acknowledgments

✨ Stay in Control of Your Knowledge — Search Smarter

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Semantic Vault — AI-Powered Semantic Search for Your Markdown Notes

📖 Overview

🚀 Features

🛠️ Requirements

📂 Project Structure

🔧 Setup

1. Clone the Repository

2. Install Dependencies

3. Configure Environment

💡 Semantic Search Usage

1. Set Your Vault Path

2. Configure Semantic Search (Optional)

3. Choose Your AI Model

4. Run the Search Tool

How Semantic Search Works

🏷️ AI Tag Generator for Notes

Usage

🖥️ Ollama Local LLM (Optional)

Install Ollama

Pull Models

🌟 Future Plans

📢 Contributing

📜 License

🤖 Acknowledgments

✨ Stay in Control of Your Knowledge — Search Smarter

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages