neo4j-graphrag-python/docs/source/index.rst at 95848ffca4146d2b76b62f05d3cef6024fa08f47 · neo4j/neo4j-graphrag-python

GraphRAG for Python

This package contains the official Neo4j GraphRAG features for Python.

The purpose of this package is to provide a first party package to developers, where Neo4j can guarantee long term commitment and maintenance as well as being fast to ship new features and high performing patterns and methods.

⚠️ This package is a renamed continuation of neo4j-genai. The package neo4j-genai is deprecated and will no longer be maintained. We encourage all users to migrate to this new package to continue receiving updates and support.

Neo4j versions supported:

Neo4j >=5.18.1
Neo4j Aura >=5.18.0
Neo4j 2026.01+ (enables SEARCH clause with in-index filtering)

Python versions supported:

Python 3.14
Python 3.13
Python 3.12
Python 3.11
Python 3.10

.. toctree::
    :maxdepth: 3
    :caption: Contents:
    :hidden:

    Introduction <self>
    user_guide_rag.rst
    user_guide_kg_builder.rst
    user_guide_pipeline.rst
    api.rst
    types.rst

Usage

Installation

This package requires Python (>=3.10).

To install the latest stable version, use:

pip install neo4j-graphrag

Note

It is always recommended to install python packages for user space in a virtual environment.

Optional Dependencies

Extra dependencies can be installed with:

pip install "neo4j-graphrag[openai]"

List of extra dependencies:

LLM providers (at least one is required for RAG and KG Builder Pipeline):
- ollama: LLMs from Ollama
- openai: LLMs from OpenAI (including AzureOpenAI)
- google: LLMs from Vertex AI
- cohere: LLMs from Cohere
- anthropic: LLMs from Anthropic
- mistralai: LLMs from MistralAI
sentence-transformers : to use embeddings from the sentence-transformers Python package
Vector database (to use :ref:`External Retrievers`):
- weaviate: store vectors in Weaviate
- pinecone: store vectors in Pinecone
- qdrant: store vectors in Qdrant
experimental: experimental features mainly from the Knowledge Graph creation pipelines.
nlp: installs spaCy for nlp pipelines, used by SpaCySemanticMatchResolver component from the Knowledge Graph creation pipelines.
fuzzy-matching: installs rapidfuzz to fuzzy matching using string similarity, used by FuzzyMatchResolver component from the Knowledge Graph creation pipelines.

Note

The `nlp` extra (spaCy) is currently not supported on Python 3.14 due to an upstream spaCy import-time issue (see spaCy #13895). Use Python 3.13 or earlier for spaCy-based features until that is resolved upstream.

Examples

Creating a vector index

When creating a vector index, make sure you match the number of dimensions in the index with the number of dimensions the embeddings have.

See :ref:`the API documentation<create-vector-index>` for more details.

from neo4j import GraphDatabase
from neo4j_graphrag.indexes import create_vector_index

URI = "neo4j://localhost:7687"
AUTH = ("neo4j", "password")

INDEX_NAME = "vector-index-name"

# Connect to Neo4j database
driver = GraphDatabase.driver(URI, auth=AUTH)

# Creating the index
create_vector_index(
    driver,
    INDEX_NAME,
    label="Document",
    embedding_property="vectorProperty",
    dimensions=1536,
    similarity_fn="euclidean",
)

Note

Assumed Neo4j is running

On Neo4j 2026.01+, you can also specify filterable_properties to enable in-index filtering with the SEARCH clause. See :ref:`filterable-index-creation` for details.

Populating the Neo4j Vector Index

Note that the below example is not the only way you can upsert data into your Neo4j database. For example, you could also leverage the Neo4j Python driver.

from neo4j import GraphDatabase
from neo4j_graphrag.indexes import upsert_vectors
from neo4j_graphrag.types import EntityType

URI = "neo4j://localhost:7687"
AUTH = ("neo4j", "password")

# Connect to Neo4j database
driver = GraphDatabase.driver(URI, auth=AUTH)

# Upsert the vector
vector = ...
upsert_vectors(
    driver,
    ids=["1234"],
    embedding_property="vectorProperty",
    embeddings=[vector],
    entity_type=EntityType.NODE,
)

Note

Assumed Neo4j is running with a defined vector index

Performing a similarity search

While the library has more retrievers than shown here, the following examples should be able to get you started.

from neo4j import GraphDatabase
from neo4j_graphrag.embeddings.openai import OpenAIEmbeddings
from neo4j_graphrag.retrievers import VectorRetriever

URI = "neo4j://localhost:7687"
AUTH = ("neo4j", "password")

INDEX_NAME = "vector-index-name"

# Connect to Neo4j database
driver = GraphDatabase.driver(URI, auth=AUTH)

# Create Embedder object
# Note: An OPENAI_API_KEY environment variable is required here
embedder = OpenAIEmbeddings(model="text-embedding-3-large")

# Initialize the retriever
retriever = VectorRetriever(driver, INDEX_NAME, embedder)

# Run the similarity search
query_text = "How do I do similarity search in Neo4j?"
response = retriever.search(query_text=query_text, top_k=5)

Note

Assumed Neo4j is running with populated vector index in place.

Limitations

The query over the vector index is an approximate nearest neighbor search and may not give exact results. See this reference for more details.

Development

Install dependencies

uv sync --all-extras

Getting started

Issues

If you have a bug to report or feature to request, first search to see if an issue already exists. If a related issue doesn't exist, please raise a new issue using the relevant issue form.

If you're a Neo4j Enterprise customer, you can also reach out to Customer Support.

If you don't have a bug to report or feature request, but you need a hand with the library; community support is available via Neo4j Online Community and/or Discord.

Make changes

Fork the repository.
Install Python and uv.
Create a working branch from main and start with your changes!

Pull request

When you're finished with your changes, create a pull request, also known as a PR.

Ensure that you have signed the CLA.
Ensure that the base of your PR is set to main.
Don't forget to link your PR to an issue if you are solving one.
Enable the checkbox to allow maintainer edits so that maintainers can make any necessary tweaks and update your branch for merge.
Reviewers may ask for changes to be made before a PR can be merged, either using suggested changes or normal pull request comments. You can apply suggested changes directly through the UI, and any other changes can be made in your fork and committed to the PR branch.
As you update your PR and apply changes, mark each conversation as resolved.

Run tests

Run the tests using uv.

uv run pytest

Unit tests

This should run out of the box once the dependencies are installed.

uv run pytest tests/unit

E2E tests

To run e2e tests you'd need to have some services running locally:

neo4j
weaviate
weaviate-text2vec-transformers

The easiest way to get it up and running is via Docker compose:

docker compose -f tests/e2e/docker-compose.yml up

Note

If you suspect something in the databases are cached, run docker compose -f tests/e2e/docker-compose.yml down to remove them completely

Once the services are running, execute the following command to run the e2e tests.

uv run pytest tests/e2e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GraphRAG for Python

Topics

Usage

Installation

Optional Dependencies

Examples

Creating a vector index

Populating the Neo4j Vector Index

Performing a similarity search

Limitations

Development

Install dependencies

Getting started

Issues

Make changes

Pull request

Run tests

Unit tests

E2E tests

Further information

FilesExpand file tree

index.rst

Latest commit

History

index.rst

File metadata and controls

GraphRAG for Python

Topics

Usage

Installation

Optional Dependencies

Examples

Creating a vector index

Populating the Neo4j Vector Index

Performing a similarity search

Limitations

Development

Install dependencies

Getting started

Issues

Make changes

Pull request

Run tests

Unit tests

E2E tests

Further information