ArXiv Paper Summarizer

A GitHub Actions workflow that fetches recent ArXiv papers, filters them using LLMs, summarizes them, and publishes the results as GitHub Issues.

Features

Smart Filtering: Uses LLMs (e.g., GPT-4.1-mini) to score paper relevance (0-10) based on your keywords.
Concise Summaries: Generates high-quality summaries using capable models (e.g., GPT-4.1).
Flexible LLM Support: Works with GitHub Models (default), OpenAI, Azure OpenAI, or any OpenAI-compatible API.
Incremental Fetching: Only fetches papers published since the last run to avoid duplicates.
Daily Schedule: Runs automatically every day at 07:00 UTC.
Reading List: Mark papers to read later with a checkbox, automatically tracked in a dedicated issue.

Configuration

Edit config.yaml to customize your preferences:

arxiv:
  categories:
    - "cs.AI"
    - "cs.LG"
  keywords:
    - "LLM"
    - "Agent"
  max_results: 20

github:
  usernames:
    - "your-username" # Users to tag in the issue
  issue_label: "arxiv-summary"

llm_service:
  base_url: "https://models.github.ai/inference"
  # API key is read from LLM_API_KEY env var, falling back to GITHUB_TOKEN if not set

models:
  filter: "gpt-5-mini"
  summarize: "gpt-5"

Using Other LLM Providers

The tool supports any OpenAI-compatible API. Configure llm_service in config.yaml:

# OpenAI
llm_service:
  base_url: "https://api.openai.com/v1"

# Azure OpenAI
llm_service:
  base_url: "https://<resource>.openai.azure.com/openai/deployments/<deployment>"

Set your API key via environment variable:

LLM_API_KEY - Your provider's API key (falls back to GITHUB_TOKEN if not set)
LLM_BASE_URL - Optionally override the base URL

GitHub Actions

ArXiv Summarizer

The workflow is defined in .github/workflows/summarize.yml. It is configured to run:

Daily at 07:00 UTC.
Manually via the "Run workflow" button in the Actions tab.

Using Custom LLM Providers in GitHub Actions

To use a different LLM provider in GitHub Actions, add these repository secrets:

LLM_BASE_URL - The API endpoint (e.g., https://api.openai.com/v1)
LLM_API_KEY - Your provider's API key

If these secrets are not set, the workflow defaults to GitHub Models with GITHUB_TOKEN.

Reading List

Each paper summary includes a "📚 Read Later" checkbox. When you check this box:

A GitHub workflow detects the change (.github/workflows/reading-list.yml)
The paper title and ArXiv link are automatically added to a 📚 ArXiv Reading List issue
The reading list issue is created automatically if it doesn't exist (labeled reading-list)

This lets you quickly bookmark interesting papers while reviewing the daily digest, with all your selections tracked in one place.

Local Development & Testing

You can run the tool locally without GitHub Actions.

Prerequisites

Install uv (or use pip).
A GitHub Personal Access Token (PAT) with repo scope (for creating issues) and access to GitHub Models.

Setup

# Install dependencies
uv sync

Running Locally

Create a .env file in the root directory:

GITHUB_TOKEN=your_fine_grained_token
GITHUB_REPOSITORY=owner/repo

# Optional: Use a different LLM provider
# LLM_BASE_URL=https://api.openai.com/v1
# LLM_API_KEY=your_openai_key

Run the summarizer:
```
uv run src/main.py
```

Token Permissions (Fine-grained):

Issues: Read and Write (to create summaries and check last run).
Models: Read (to access GitHub models.)

Note

When running locally, the tool will try to create a real issue in the specified repository. If you just want to test the fetching/summarizing logic without creating an issue, you can modify src/main.py or src/issue_creator.py temporarily.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.github/workflows		.github/workflows
.vscode		.vscode
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ArXiv Paper Summarizer

Features

Configuration

Using Other LLM Providers

GitHub Actions

ArXiv Summarizer

Using Custom LLM Providers in GitHub Actions

Reading List

Local Development & Testing

Prerequisites

Setup

Running Locally

About

Uh oh!

Contributors

Uh oh!

Languages

License

matouskozak/arxiv-digest

Folders and files

Latest commit

History

Repository files navigation

ArXiv Paper Summarizer

Features

Configuration

Using Other LLM Providers

GitHub Actions

ArXiv Summarizer

Using Custom LLM Providers in GitHub Actions

Reading List

Local Development & Testing

Prerequisites

Setup

Running Locally

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages