Knowledge Graph Synthesis System

A system for transforming unstructured text into structured knowledge through automated graph creation, expansion, and analysis.

Overview

The Knowledge Graph Synthesis System processes text input to extract entities and relationships, builds a knowledge graph, expands it through recursive reasoning, analyzes its structure, creates abstractions, and generates theories and insights. The system supports both Russian and English languages and works with multiple Large Language Model providers.

Features

Text Processing: Hierarchical segmentation with contextual summarization
Knowledge Extraction: Entity and relationship extraction with coreference resolution
Graph Management: Creation, storage, and visualization of knowledge graphs
Recursive Reasoning: Autonomous expansion of knowledge graphs through questioning and reasoning
Graph Analysis: Calculation of structural metrics, community detection, and pattern identification
Meta-Graph Creation: Abstraction of concepts into higher-level representations
Theory Formation: Generation of theories and hypotheses with testing
Results Generation: Production of documents, visualizations, and insights

Supported LLM Providers

Claude (Anthropic)
GPT (OpenAI)
Gemini (Google)
DeepSeek
Ollama (local models)

Installation

Prerequisites

Python 3.9 or higher
Git

Setup

Clone the repository:

git clone https://github.com/yourusername/knowledge-graph-synthesis.git
cd knowledge-graph-synthesis

Create a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```

Create a .env file based on the example:

cp .env.example .env
# Edit .env with your API keys and configuration

Usage

Command Line Interface

Process a text file and generate insights:

python src/main.py process --input text.txt --output results/ --language en --provider claude

Streamlit Interface

Start the interactive Streamlit interface:

python -m streamlit run src/frontend/app.py

Configuration

Configure LLM providers in your .env file:

CLAUDE_API_KEY=your_api_key
GPT_API_KEY=your_api_key
GEMINI_API_KEY=your_api_key
DEEPSEEK_API_KEY=your_api_key
# For Ollama, no API key is needed

Development

Project Structure

src/
├── config/                  # Configuration management
├── text_processing/         # Text processing module
├── knowledge_extraction/    # Knowledge extraction module
├── graph_management/        # Graph management module
├── reasoning/               # Reasoning module
├── analysis/                # Graph analysis module
├── meta_graph/              # Meta-graph module
├── theory_formation/        # Theory formation module
├── results/                 # Results generation module
├── llm/                     # LLM provider integration
├── storage/                 # Storage services
├── utils/                   # Utility services
├── frontend/                # Frontend integration
└── main.py                  # Application entry point

Running Tests

pytest

Contributing

Fork the repository
Create a feature branch: git checkout -b feature/your-feature
Commit your changes: git commit -am 'Add some feature'
Push to the branch: git push origin feature/your-feature
Submit a pull request

Documentation

Detailed documentation is available in the docs/ directory:

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

This project was inspired by research in knowledge graph construction and reasoning
Thanks to the developers of all the libraries and tools used in this project

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.scripts		.scripts
cache/summaries		cache/summaries
docs		docs
output # Output directory for results		output # Output directory for results
src		src
tokenizers		tokenizers
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
app.log # Log file path		app.log # Log file path
requirements.txt		requirements.txt
run.py		run.py
segments.json		segments.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Knowledge Graph Synthesis System

Overview

Features

Supported LLM Providers

Installation

Prerequisites

Setup

Usage

Command Line Interface

Streamlit Interface

Configuration

Development

Project Structure

Running Tests

Contributing

Documentation

License

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Languages

shipaleks/knowledge-graph-synthesis

Folders and files

Latest commit

History

Repository files navigation

Knowledge Graph Synthesis System

Overview

Features

Supported LLM Providers

Installation

Prerequisites

Setup

Usage

Command Line Interface

Streamlit Interface

Configuration

Development

Project Structure

Running Tests

Contributing

Documentation

License

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages