tg-note

Intelligent Knowledge Base Builder - Telegram bot that automatically transforms your messages, reposts, and articles into a structured knowledge base using AI agents.

📑 Table of Contents

Overview
Key Features
Quick Start
Usage
- Bot Commands
- Working with Content
Agent Types
Architecture
Configuration Reference
Development
- Running Tests
- Code Quality
Deployment
Roadmap
Contributing
License

🎯 Overview

tg-note is a Telegram bot that acts as your personal knowledge curator. It receives messages, reposts, and articles through Telegram, analyzes them using AI agent systems, and automatically saves the important information to your GitHub-based knowledge base in structured Markdown format.

Perfect for:

📚 Building a personal knowledge base from Telegram channels
🔬 Organizing research papers and scientific articles
📰 Archiving news and insights from multiple sources
🧠 Creating a searchable second brain

✨ Key Features

🤖 AI-Powered Analysis: Intelligent content categorization and structuring using agent systems
📝 Automatic Markdown Generation: Converts any content into well-formatted Markdown files
🗂️ Smart Organization: Automatic categorization by topics (AI, biology, physics, tech, etc.)
🔄 GitHub Integration: Direct commits to your knowledge base repository
👥 Multi-User Support: Each user can have their own knowledge base, or multiple users can collaborate on a shared KB with automatic synchronization
🔐 Secure Git Credentials: Per-user encrypted GitHub/GitLab tokens via Telegram (NEW!)
📦 Message Grouping: Intelligently combines related messages into single notes
🔍 Deduplication: Tracks processed messages to avoid duplicates
🎯 Flexible Agents: Choose between stub or Qwen Code CLI agents
⚡ Async Architecture: Fast, non-blocking message processing
⚙️ Telegram Settings Management: Configure bot settings directly via Telegram commands
📄 File Format Recognition: Automatic content extraction from various file formats using Docling

🎮 New: Button-Based Interface

All knowledge base management is now done through interactive buttons!

No more typing commands - Just click buttons in the menu
Guided setup - Step-by-step KB creation and configuration
Easy switching - Switch between multiple knowledge bases with one click
Visual feedback - See your current KB and mode at a glance

Start with /start and follow the buttons!

🚀 Quick Start

Prerequisites

Python 3.11 or 3.12 (Python 3.13 is not supported)
Poetry (Python dependency manager)
Git
Telegram Account
Node.js 20+ (optional, for qwen_code_cli agent)
Docker & Docker Compose (optional, for containerized deployment)

Installation

Clone the repository

git clone https://github.com/ArtyomZemlyak/tg-note.git
cd tg-note

Install Poetry (if not already installed)

curl -sSL https://install.python-poetry.org | python3 -
# or using pipx:
# pipx install poetry

Install dependencies

poetry install

This will automatically create a virtual environment and install all dependencies.

Optional features (extras)

# Enable MCP and mem-agent extras (tooling, memory backends)
poetry install -E mcp -E mem-agent

# Vector search capabilities
poetry install -E vector-search

Alternatively with pip/pipx:

# Editable install with extras
pip install -e ".[mcp,mem-agent,vector-search]"

# Or isolate via pipx
pipx install "."

Configuration

Create configuration files

# Copy example configuration
cp config.example.yaml config.yaml

Get Telegram Bot Token

Open @BotFather in Telegram
Send /newbot and follow instructions
Copy the token provided

Create .env file (for sensitive credentials)

cat > .env << EOF
# Required: Telegram Bot Token
TELEGRAM_BOT_TOKEN=your_bot_token_here

# Optional: API keys for agents
# OPENAI_API_KEY=your_openai_key
# OPENAI_BASE_URL=https://api.openai.com/v1  # Optional, for custom endpoints
# QWEN_API_KEY=your_qwen_key
# ANTHROPIC_API_KEY=your_anthropic_key
# GITHUB_TOKEN=your_github_token
EOF

Configure config.yaml (basic settings)

# Knowledge Base Settings
KB_PATH: ./knowledge_base
KB_GIT_ENABLED: true
KB_GIT_AUTO_PUSH: true
KB_GIT_REMOTE: origin
KB_GIT_BRANCH: main

# Agent Configuration
AGENT_TYPE: "stub"  # Options: stub, autonomous, qwen_code_cli

# Processing Settings
MESSAGE_GROUP_TIMEOUT: 30  # seconds

# Logging
LOG_LEVEL: INFO
LOG_FILE: ./logs/bot.log

# User Access Control
# Accepts empty (allow all), comma-separated (e.g., "123,456"), or JSON list (e.g., [123,456])
ALLOWED_USER_IDS: ""

(Optional) Install Qwen Code CLI for advanced AI processing

# Install Node.js 20+ first, then:
npm install -g @qwen-code/qwen-code@latest

# Authenticate (2000 free requests/day)
qwen

# Update config.yaml
AGENT_TYPE: "qwen_code_cli"

Running the Bot

Start the bot

# Recommended: Use the console script
poetry run tg-note

# Or directly with Python
poetry run python main.py

# If installed with pip (outside Poetry)
tg-note

# Or activate the virtual environment first:
poetry shell
tg-note

You should see:

INFO - Starting tg-note bot...
INFO - Configuration validated successfully
INFO - Processing tracker initialized
INFO - Repository manager initialized
INFO - Telegram bot started successfully
INFO - Bot initialization completed
INFO - Press Ctrl+C to stop

Configure your knowledge base in Telegram

Open your bot in Telegram and:

/start                    # Open interactive main menu

Then click ➕ Создать БЗ (Create KB) button and follow the prompts:

Choose 📁 Локальная БЗ for local KB, or
Choose 🌐 GitHub репозиторий for GitHub-based KB
Enter the KB name or GitHub URL when prompted

Alternative (Legacy Command):

/setkb my-notes           # Create local knowledge base
# or
/setkb https://github.com/username/kb-repo  # Use GitHub repository

Start sending messages!

Just forward any message or write text - the bot will automatically process and save it to your knowledge base.

Stop the bot

Press Ctrl+C in the terminal

📖 Usage

Working Modes

The bot has three working modes that you can switch between:

📝 Note Mode (Default) - `/note`

Analyzes and saves your messages to the knowledge base.

Automatic categorization
Structured markdown generation
Git integration

🤔 Ask Mode - `/ask`

Answers questions about your knowledge base content.

Searches through your KB
Provides answers with sources
Russian language support

🤖 Agent Mode - `/agent`

Full autonomous access to your knowledge base.

Can answer questions (like Ask mode)
Can add/edit/delete content
Can restructure and organize KB
Shows detailed file changes

Switch between modes: /note | /ask | /agent

Bot Commands

Main Interactive Menu:

Command	Description	Example
`/start`	Open interactive main menu with buttons	`/start`
	- 📚 Knowledge Base management
	- 🔄 Mode switching (note/ask/agent)
	- ⚙️ Settings configuration
	- 🔧 MCP server management
	- 💬 Context management
	- ❓ Help and documentation

Knowledge Base (Button-Based):

Command	Description	Example
`/kb`	Open KB management menu	`/kb`
	- Create local or GitHub KB
	- Switch between multiple KBs
	- View KB information
	- Setup MkDocs documentation

Working Modes (via buttons or commands):

Command	Description	Example
`/note`	Switch to note creation mode	`/note`
`/ask`	Switch to question mode	`/ask`
`/agent`	Switch to agent mode (full access)	`/agent`
`/setkb <name\|url>`	Setup knowledge base	`/setkb my-notes` or `/setkb https://github.com/user/repo`
`/kb`	Show current KB info	`/kb`
`/status`	Display processing statistics	`/status`
`/settings`	Open interactive settings menu	Navigate through categories and settings
`/viewsettings [category]`	View all or filtered settings	`/viewsettings knowledge_base`
`/resetsetting <name>`	Reset to default	`/resetsetting AGENT_TIMEOUT`
`/kbsettings`	KB settings quick access	View and modify KB configuration
`/agentsettings`	Agent settings quick access	View and modify agent configuration

⚙️ Settings Management (NEW!)

Per-User Configuration

Each user can customize bot behavior via Telegram commands:

# Open interactive settings menu
/settings
# Navigate through categories → Select setting → Enter new value

# View all settings or filter by category
/viewsettings
/viewsettings knowledge_base

# Reset to global defaults
/resetsetting KB_GIT_AUTO_PUSH

# Quick access to categories
/kbsettings       # Knowledge Base settings
/agentsettings    # Agent configuration

How Settings Work:

Use /settings to open the interactive menu
Choose a category (Knowledge Base, Agent, Processing, etc.)
Click on any setting to view its details
For boolean settings: Click Enable/Disable buttons
For other settings: Send the new value as a text message
Settings include descriptions, types, and allowed values

Features:

✅ Interactive UI: Inline keyboards with category navigation
✅ Type Safety: Automatic validation and type conversion
✅ Per-User Overrides: Each user can customize their settings
✅ Auto-Generated: New settings automatically appear in UI
✅ Secure: Credentials cannot be changed via Telegram

Available Settings Categories:

📚 Knowledge Base (Git, paths, auto-push)
🤖 Agent (model, timeout, tools enabled)
⚙️ Processing (message grouping, deduplication)
📝 Logging (level, file path)

See Settings Management guide for details.

🔐 Secure Git Credentials (NEW!)

Per-User Encrypted Tokens

Each user can securely add personal GitHub/GitLab tokens directly via Telegram:

/settoken           # Add your personal token
/listcredentials    # View saved tokens
/removetoken        # Remove tokens

Security Features:

🔐 AES-128 encryption (Fernet)
🗑️ Auto-delete token messages
👤 Per-user credential storage
🌐 Support for GitHub and GitLab

Read full documentation →

Working with Content

Supported Content Types:

✅ Text messages
✅ Forwarded messages from channels
✅ Photos with captions
✅ Documents (PDF, DOCX, PPTX, XLSX, MD, HTML, TXT)
✅ Automatic file format recognition using Docling
✅ Multiple consecutive messages (auto-grouped)

Processing Workflow:

Send or forward content to the bot
Bot analyzes and categorizes the content
Creates a structured Markdown note
Saves to appropriate category in your KB
Commits to Git (if enabled)
Notifies you of completion

Example Output Structure:

knowledge_base/
├── topics/
│   ├── ai/
│   │   └── 2025-10-02-neural-networks-breakthrough.md
│   ├── biology/
│   │   └── 2025-10-01-crispr-advancement.md
│   └── physics/
│       └── 2025-09-30-quantum-computing.md
└── index.md

🤖 Agent Types

The system supports three types of agents for content processing:

Agent Compatibility Matrix

Feature	AutonomousAgent	QwenCodeCLIAgent	StubAgent
Language	Python	Node.js (subprocess)	Python
MCP Tools	✅ Python MCP	✅ Qwen Native MCP	❌ No
Built-in Tools	✅ Yes	✅ Yes	❌ No
Custom Tools	✅ Easy (Python)	✅ Via MCP Config	❌ No
Free Tier	Provider-dependent	2000/day	✅ Always
Setup Complexity	Medium	Medium-High	Low
AI Quality	High	High	Basic

💡 MCP Support Note: Both AutonomousAgent and QwenCodeCLIAgent support MCP, but with different approaches:

AutonomousAgent: Uses Python MCP client (DynamicMCPTool) - MCP servers managed from Python code

QwenCodeCLIAgent: Uses Qwen's native MCP client - MCP servers configured in .qwen/settings.json as standalone processes

Learn more about MCP integration →

1. autonomous (Recommended for MCP) ✅

Python-based autonomous agent with OpenAI-compatible API support and full MCP support.

Features:

✅ MCP Tools Support - Full access to Model Context Protocol tools
✅ OpenAI-compatible API integration
✅ Autonomous planning and decision-making
✅ Built-in tools: web search, git, github, file management
✅ Function calling support
✅ Works with various LLM providers

Setup:

pip install openai  # included in requirements

Configuration:

AGENT_TYPE: "autonomous"
AGENT_MODEL: "gpt-3.5-turbo"  # or any compatible model
AGENT_ENABLE_MCP: true  # Enable MCP tools
AGENT_ENABLE_WEB_SEARCH: true
AGENT_ENABLE_FILE_MANAGEMENT: true

Environment Variables:

OPENAI_API_KEY=sk-...
OPENAI_BASE_URL=https://api.openai.com/v1  # Optional, for custom endpoints

📚 Detailed Documentation →

2. qwen_code_cli (Best for Free Tier) ✅

Uses Qwen Code CLI for advanced AI processing.

Features:

✅ Full integration with Qwen3-Coder models
✅ Automatic TODO planning
✅ Built-in tools: web search, git, github, shell
✅ Free tier: 2000 requests/day
✅ Vision model support
✅ MCP support via qwen native mechanism (requires .qwen/settings.json)

Setup:

npm install -g @qwen-code/qwen-code@latest
qwen  # authenticate

Configuration:

AGENT_TYPE: "qwen_code_cli"
AGENT_QWEN_CLI_PATH: "qwen"
AGENT_ENABLE_WEB_SEARCH: true
AGENT_ENABLE_MCP: true  # Optional - requires .qwen/settings.json configuration

MCP Configuration (optional, for custom tools via MCP):

Create .qwen/settings.json in your KB directory:

{
  "mcpServers": {
    "mcp-hub": {
      "url": "http://127.0.0.1:8765/sse",
      "timeout": 10000,
      "trust": true,
      "description": "MCP Hub - Built-in memory tools and server registry"
    }
  },
  "allowMCPServers": ["mcp-hub"]
}

ℹ️ MCP Integration: Qwen CLI has built-in MCP support! It can connect to MCP servers configured as standalone processes. This is different from AutonomousAgent's Python MCP client. Learn more →

📚 Detailed Documentation →

3. stub (Testing Only)

Simple stub agent for testing and MVP.

Features:

⚡ Fast and lightweight
🔧 No external dependencies
📋 Basic categorization
🧪 Perfect for testing

Configuration:

AGENT_TYPE: "stub"

Best for: Quick testing, MVP demos, development without API keys

Choosing the Right Agent

Use AutonomousAgent when:

✅ You need MCP tools (memory, custom integrations)
✅ You have an OpenAI-compatible API key
✅ You want Python-native integration
✅ You need custom tool development

Use QwenCodeCLIAgent when:

✅ You want free tier (2000 requests/day)
✅ You need vision model support
✅ You prefer official Qwen integration
✅ You can create standalone MCP servers (if using MCP)

Use StubAgent when:

✅ Testing without API keys
✅ MVP development
✅ Quick prototyping

🏗️ Architecture

System Components

┌─────────────────┐
│  Telegram Bot   │  ← User interface
│ (pyTelegramBotAPI) │
└────────┬────────┘
         │
         ▼
┌─────────────────┐
│Message Processor│  ← Grouping & parsing
│  (aggregator)   │
└────────┬────────┘
         │
         ▼
┌─────────────────┐
│  Agent System   │  ← AI analysis
│ (qwen/stub)     │
└────────┬────────┘
         │
         ▼
┌─────────────────┐
│Knowledge Base   │  ← Markdown files
│   Manager       │
└────────┬────────┘
         │
         ▼
┌─────────────────┐
│   Git Ops       │  ← Version control
│ (auto commit)   │
└─────────────────┘

Data Flow

Input: User sends message/repost to Telegram bot
Aggregation: Related messages grouped together (30s timeout)
Parsing: Extract text, media, links, generate hash
Deduplication: Check if already processed
Agent Processing: AI analyzes and structures content
KB Storage: Save as Markdown in appropriate category
Git Commit: Auto-commit to repository
Notification: Inform user of completion

Project Structure

tg-note/
├── config/
│   ├── __init__.py
│   └── settings.py              # Pydantic settings
├── src/
│   ├── bot/
│   │   ├── handlers.py          # Telegram event handlers
│   │   ├── telegram_bot.py      # Main bot class
│   │   └── utils.py             # Helper functions
│   ├── processor/
│   │   ├── message_aggregator.py  # Message grouping
│   │   └── content_parser.py      # Content extraction
│   ├── agents/
│   │   ├── base_agent.py        # Abstract base class
│   │   ├── stub_agent.py        # Simple stub
│   │   ├── autonomous_agent.py  # Python agent (OpenAI-compatible)
│   │   ├── qwen_code_cli_agent.py  # CLI integration
│   │   └── agent_factory.py     # Agent factory
│   ├── knowledge_base/
│   │   ├── manager.py           # KB management
│   │   ├── git_ops.py           # Git operations
│   │   ├── repository.py        # Repo manager
│   │   └── user_settings.py     # User preferences
│   └── tracker/
│       └── processing_tracker.py  # Deduplication
├── tests/                       # Unit tests
├── data/                        # Runtime data (auto-created)
├── logs/                        # Log files
├── config.yaml                  # Main configuration
├── .env                         # Credentials (git-ignored)
├── pyproject.toml               # Project metadata and dependencies
├── poetry.lock                  # Locked dependency versions
└── main.py                      # Entry point

⚙️ Configuration Reference

Environment Variables (`.env`)

# Required
TELEGRAM_BOT_TOKEN=your_token_here

# Optional API Keys
OPENAI_API_KEY=sk-...
OPENAI_BASE_URL=https://api.openai.com/v1  # Optional, for custom endpoints
QWEN_API_KEY=your_qwen_key
ANTHROPIC_API_KEY=sk-ant-...
GITHUB_TOKEN=ghp_...

YAML Configuration (`config.yaml`)

# Knowledge Base
KB_PATH: ./knowledge_base
KB_GIT_ENABLED: true
KB_GIT_AUTO_PUSH: true
KB_GIT_REMOTE: origin
KB_GIT_BRANCH: main

# Agent
AGENT_TYPE: "qwen_code_cli"  # stub, autonomous, qwen_code_cli
AGENT_MODEL: "qwen-max"
AGENT_TIMEOUT: 300
AGENT_ENABLE_WEB_SEARCH: true
AGENT_ENABLE_GIT: true
AGENT_ENABLE_GITHUB: true
AGENT_ENABLE_SHELL: false
AGENT_ENABLE_FILE_MANAGEMENT: true
AGENT_ENABLE_FOLDER_MANAGEMENT: true

# Processing
MESSAGE_GROUP_TIMEOUT: 30
PROCESSED_LOG_PATH: ./data/processed.json

# Logging
LOG_LEVEL: INFO  # DEBUG, INFO, WARNING, ERROR, CRITICAL
LOG_FILE: ./logs/bot.log

# Security
# Accepts empty (allow all), comma-separated (e.g., "123,456"), or JSON list (e.g., [123,456])
ALLOWED_USER_IDS: ""  # Empty = all allowed

Priority: Environment Variables > .env file > config.yaml

📚 Full Configuration Guide →

🛠️ Development

Running Tests

# Run all tests
poetry run pytest

# With coverage report
poetry run pytest --cov=src --cov-report=html

# Specific test file
poetry run pytest tests/test_tracker.py -v

# Or activate shell first:
poetry shell
pytest

Code Quality

# Format code
poetry run black src/ tests/

# Lint
poetry run flake8 src/ tests/

# Type checking
poetry run mypy src/

Project Commands

# Check configuration
poetry run python -c "from config import settings; print(settings)"

# View processing stats
poetry run python -c "from src.tracker.processing_tracker import ProcessingTracker; \
           t = ProcessingTracker('./data/processed.json'); \
           print(t.get_stats())"

# Verify structure
poetry run python verify_structure.py

🚀 Deployment

tg-note offers multiple deployment options to suit different needs and infrastructure requirements.

🐳 Docker Deployment (Recommended)

The project provides comprehensive Docker support with multiple deployment configurations:

Prefer the tested vector-search stack used in day-to-day runs: see the recommended compose example based on docker-compose.vector.my.yml with Infinity + Qdrant + Docling + MCP Hub.

Quick Start

Prerequisites:

Docker and Docker Compose
.env file with required tokens (see Configuration)

Quick Start (All Services Enabled)

# Start all services (vLLM, SGLang, Docling, Qdrant, Infinity)
# IMPORTANT: vLLM and SGLang both use port 8001 - comment out one of them in docker-compose.yml!
docker-compose up -d --build

# View logs
docker-compose logs -f

Customization Options:

Simple Deployment (No GPU Required)

# Comment out vllm-server, sglang-server, qdrant, infinity in docker-compose.yml
docker-compose up -d --build

Use SGLang instead of vLLM

# Comment out vllm-server section in docker-compose.yml
docker-compose up -d --build

Disable Vector Search

# Comment out qdrant and infinity sections in docker-compose.yml
docker-compose up -d --build

Services Overview

Service	Description	GPU Required	Port
bot	Main Telegram bot with Qwen CLI	No	-
mcp-hub	MCP Hub server (memory, tools)	No	8765
vllm-server	LLM inference (mem-agent)	Yes	8001
qdrant	Vector database (optional)	No	6333
infinity	Embedding service (optional)	No	7997

Configuration

Environment Variables (.env):

# Required
TELEGRAM_BOT_TOKEN=your_bot_token_here

# Optional API Keys
OPENAI_API_KEY=sk-...
QWEN_API_KEY=your_qwen_key
ANTHROPIC_API_KEY=sk-ant-...

# Vector Search (if using vector stack)
QDRANT_PORT=6333
INFINITY_MODEL=BAAI/bge-small-en-v1.5

# Docker-specific (auto-configured)
MCP_HUB_URL=http://mcp-hub:8765/sse

Configuration Files:

config.yaml - Main application configuration
.env - Environment variables and secrets
docker-compose*.yml - Service orchestration

Docker Volumes:

./knowledge_base → Bot's knowledge base storage
./data → Processed messages, user settings
./logs → Application logs
~/.qwen → Qwen CLI authentication

Qwen CLI Setup (Inside Container)

# Authenticate Qwen CLI
docker exec -it tg-note-bot bash -lc "qwen"

# Set approval mode for automated operation
docker exec -it tg-note-bot bash -lc "qwen <<<'/approval-mode yolo --project'"

Health Checks

# Check service status
docker ps

# Test MCP Hub health
curl http://localhost:8765/health

# Test vector search (if enabled)
curl http://localhost:6333/healthz  # Qdrant
curl http://localhost:7997/health   # Infinity

Management Commands

# Stop all services
docker-compose down

# Rebuild and restart
docker-compose up -d --build --force-recreate

# View specific service logs
docker-compose logs -f bot
docker-compose logs -f mcp-hub

# Execute commands in running container
docker exec -it tg-note-bot bash
docker exec -it tg-note-hub bash

🔄 Automated Deployment

The project includes automated deployment scripts for production environments:

Git-based Auto-Deploy

Setup:

# Copy and configure auto-deploy
cp scripts/auto_deploy.conf.example scripts/auto_deploy.conf
# Edit configuration paths and options
nano scripts/auto_deploy.conf

# Make script executable
chmod +x scripts/auto_deploy.sh

Cron Setup (every 5 minutes):

# Add to crontab
crontab -e

# Add this line:
*/5 * * * * /bin/bash -lc 'cd /path/to/your/repo && scripts/auto_deploy.sh -c scripts/auto_deploy.conf >> logs/cron.tail 2>&1'

Systemd Timer (alternative):

# Create service and timer files
sudo cp scripts/auto_deploy.service /etc/systemd/system/
sudo cp scripts/auto_deploy.timer /etc/systemd/system/

# Enable and start
sudo systemctl daemon-reload
sudo systemctl enable --now auto-deploy.timer

Safety Features:

Blocks deployment if critical files change (.env, docker-compose.yml, etc.)
Prevents overlapping deployments with file locks
Detailed logging and error reporting

🖥️ Systemd Service (Bare Metal)

For direct host deployment without Docker:

# Create service file
sudo nano /etc/systemd/system/tg-note.service

[Unit]
Description=TG-Note Telegram Bot
After=network.target

[Service]
Type=simple
User=youruser
WorkingDirectory=/path/to/tg-note
ExecStart=/usr/local/bin/poetry run python main.py
Restart=always
RestartSec=10

[Install]
WantedBy=multi-user.target

# Enable and start service
sudo systemctl daemon-reload
sudo systemctl enable tg-note
sudo systemctl start tg-note
sudo systemctl status tg-note

📋 Production Checklist

Before Deployment:

Docker and Docker Compose installed
.env file configured with all required tokens
Knowledge base repository prepared (GitHub or local)
Firewall configured (if exposing ports)
SSL certificates ready (if using HTTPS)

Deployment Steps:

Clone repository and configure environment
Start services: docker-compose up -d --build
Authenticate Qwen CLI (if using qwen_code_cli agent)
Verify health: curl http://localhost:8765/health
Test bot functionality in Telegram
Configure automated deployment (optional)

Security Hardening:

Set ALLOWED_USER_IDS to restrict bot access
Use strong, unique API keys
Enable log rotation
Regular security updates
Backup knowledge base and data directories

Monitoring:

Set up log monitoring
Monitor disk space for knowledge base
Track API usage and costs
Set up alerts for service failures

🔧 Troubleshooting

Common Issues:

Bot not responding:

# Check bot logs
docker-compose logs -f bot

# Verify environment variables
docker exec tg-note-bot env | grep TELEGRAM

MCP Hub connection issues:

# Check MCP Hub health
curl http://localhost:8765/health

# Check MCP Hub logs
docker-compose logs -f mcp-hub

Qwen CLI authentication:

# Re-authenticate
docker exec -it tg-note-bot bash -lc "qwen"

# Check authentication status
docker exec tg-note-bot qwen --version

Vector search not working:

# Check Qdrant health
curl http://localhost:6333/healthz

# Check Infinity health
curl http://localhost:7997/health

# Check vector search logs
docker-compose logs -f

Performance Issues:

Monitor resource usage: docker stats
Check disk space: df -h
Review logs for errors: docker-compose logs --tail=100

🗺️ Roadmap

✅ Completed

✅ Core infrastructure and project structure
✅ Telegram bot with async support
✅ Message aggregation and parsing
✅ Agent system (stub, qwen_code_cli)
✅ Knowledge base management with Git
✅ Multi-user support with personal KBs
✅ Deduplication tracking
✅ Comprehensive test suite
✅ Settings management via Telegram
✅ File format recognition with Docling (NEW!)

🚧 In Progress

🚧 Enhanced error handling and recovery
🚧 CI/CD pipeline improvements
🚧 Advanced monitoring and alerting

📋 Planned

📋 Enhanced vision model support for image analysis
📋 Audio and video file processing
📋 Web interface for KB browsing
📋 Vector database for semantic search
📋 PostgreSQL storage option
📋 Backup and restore system
📋 Advanced analytics and metrics
📋 Batch message processing
📋 Real-time streaming results
📋 Custom agent plugins
📋 Multi-language support

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Guidelines

Fork the repository
Create your feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

Development Setup

# Install all dependencies (including dev)
poetry install

# Activate virtual environment
poetry shell

# Run tests before committing
poetry run pytest

# Format code
poetry run black src/ tests/

📚 Documentation

📖 Complete Documentation

View Full Documentation →

Our comprehensive documentation is hosted on GitHub Pages and includes:

Getting Started - Installation, configuration, and first steps
User Guide - Commands, content management, and settings
Agent System - AI agents, tools, and autonomous processing
Architecture - System design and component details
Development - Contributing, testing, and code quality
Deployment - Production setup, Docker, and CI/CD

Quick Links

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

This project is built on top of amazing open source projects and wouldn't be possible without them. Special thanks to:

🤖 AI & Agent Systems

Qwen Agent - AI agent framework powering intelligent content processing
OpenAI Python - Official Python library for OpenAI API (used for LLM integrations)
Sentence Transformers - State-of-the-art embeddings for semantic search
Hugging Face Hub - Model management and distribution

💬 Telegram Bot Framework

pyTelegramBotAPI - Powerful and simple Telegram Bot API library

📄 Document Processing

Docling - Advanced document parsing and content extraction
PyYAML - YAML parser for configuration management

🔧 Core Infrastructure

Pydantic - Data validation and settings management
GitPython - Git repository automation and version control
Loguru - Modern and elegant logging solution
Cryptography - Secure credentials encryption

🔍 Vector Search & Memory

FAISS - Efficient similarity search and clustering
Qdrant Client - Vector database client for semantic search
FastMCP - Model Context Protocol implementation

🌐 Network & HTTP

aiohttp - Asynchronous HTTP client/server framework
Requests - HTTP library for Python

🧪 Development & Testing

pytest - Testing framework
pytest-asyncio - Async testing support
Black - Code formatting
pre-commit - Git hooks framework

🐳 Deployment

Docker - Containerization platform
Docker Compose - Multi-container orchestration

Thank you to all contributors and maintainers of these projects! 🙌

📞 Support & Contact

Built with ❤️ by Artem Zemliak

⭐ Star this repository if you find it helpful!

Name		Name	Last commit message	Last commit date
Latest commit History 797 Commits
.cursor/.agent-tools		.cursor/.agent-tools
.github		.github
config		config
data		data
docker/docling-mcp		docker/docling-mcp
docs_site		docs_site
examples		examples
logs		logs
scripts		scripts
src		src
tests		tests
.cursorrules		.cursorrules
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.markdownlint.json		.markdownlint.json
.pre-commit-config.yaml		.pre-commit-config.yaml
AGENTS.md		AGENTS.md
Dockerfile.bot		Dockerfile.bot
Dockerfile.hub		Dockerfile.hub
LICENSE		LICENSE
README.md		README.md
RELEASE_NOTES_0.2.3.md		RELEASE_NOTES_0.2.3.md
config.example.yaml		config.example.yaml
docker-compose.override.yml.example		docker-compose.override.yml.example
docker-compose.vector.my.yml		docker-compose.vector.my.yml
docker-compose.yml		docker-compose.yml
main.py		main.py
mkdocs.yml		mkdocs.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements-docs.txt		requirements-docs.txt

License

ArtyomZemlyak/tg-note

Folders and files

Latest commit

History

Repository files navigation

tg-note

📑 Table of Contents

🎯 Overview

✨ Key Features

🎮 New: Button-Based Interface

🚀 Quick Start

Prerequisites

Installation

Configuration

Running the Bot

📖 Usage

Working Modes

📝 Note Mode (Default) - /note

🤔 Ask Mode - /ask

🤖 Agent Mode - /agent

Bot Commands

⚙️ Settings Management (NEW!)

🔐 Secure Git Credentials (NEW!)

Working with Content

🤖 Agent Types

Agent Compatibility Matrix

1. autonomous (Recommended for MCP) ✅

2. qwen_code_cli (Best for Free Tier) ✅

3. stub (Testing Only)

Choosing the Right Agent

🏗️ Architecture

System Components

Data Flow

Project Structure

⚙️ Configuration Reference

Environment Variables (.env)

YAML Configuration (config.yaml)

🛠️ Development

Running Tests

Code Quality

Project Commands

🚀 Deployment

🐳 Docker Deployment (Recommended)

Quick Start

Services Overview

Configuration

Qwen CLI Setup (Inside Container)

Health Checks

Management Commands

🔄 Automated Deployment

Git-based Auto-Deploy

🖥️ Systemd Service (Bare Metal)

📋 Production Checklist

🔧 Troubleshooting

🗺️ Roadmap

✅ Completed

🚧 In Progress

📋 Planned

🤝 Contributing

Guidelines

Development Setup

📚 Documentation

📖 Complete Documentation

Quick Links

📄 License

🙏 Acknowledgments

🤖 AI & Agent Systems

💬 Telegram Bot Framework

📄 Document Processing

🔧 Core Infrastructure

🔍 Vector Search & Memory

🌐 Network & HTTP

🧪 Development & Testing

🐳 Deployment

📞 Support & Contact

About

Resources

License

Uh oh!

📝 Note Mode (Default) - `/note`

🤔 Ask Mode - `/ask`

🤖 Agent Mode - `/agent`

Environment Variables (`.env`)

YAML Configuration (`config.yaml`)

Packages