Academic Paper Figure Analyzer

A tool that automatically analyzes academic papers, extracting and explaining figures and their connections to the research. This tool uses LLMs to provide detailed analysis of each figure and its relationship to the paper's content.

Features

Extracts paper metadata (title, authors, abstract)
Identifies and counts total figures in the paper
Provides detailed analysis of each figure
Generates comprehensive explanations of how figures relate to the research
Parallel processing for efficient analysis of multiple figures
Structured output with separate files for metadata, background, and figure analysis

Prerequisites

Python 3.10+
OpenAI API key

Installation

Clone the repository:

git clone [your-repo-url]
cd [repo-name]

Run the setup script:

python setup_project.py

Create a .env file in the root directory and add your OpenAI API key:

OPENAI_API_KEY=your_api_key_here

Directory Structure

.
├── papers/             # Place your PDF papers here
├── output/            # Analysis results will be saved here
├── paper_analyzer.py  # Main analysis script
├── utils.py          # Utility functions
├── config.py         # Configuration settings
├── templates.py      # Prompt templates
├── setup.py          # Package setup configuration
└── setup_project.py  # Project setup script

Usage

Place your academic paper (PDF format) in the papers/ directory.
Run the analyzer:

python paper_analyzer.py papers/your_paper.pdf [--output-dir custom/output/path]

The script will:

Extract basic paper details (title, authors, abstract)
Count the total number of figures
Analyze each figure in detail
Generate connections between figures and research content
Analyze background information
Save the analysis in separate files under the output directory

Output Structure

The analysis will be saved in the output directory with the following files:

metadata.txt: Paper details (title, authors, abstract, figure count)
background.txt: Detailed background analysis and prerequisites
figures_analysis.txt: For each figure:
- Initial analysis (Information and Connection)
- Expanded analysis with additional context
- Detailed relationships to research content

Configuration

You can modify the following settings in config.py:

PAPER_DIR: Directory for input PDF papers (default: "papers")
OUTPUT_DIR: Directory for saving analysis results (default: "output")
DEFAULT_MODEL: GPT model to use for analysis (default: "gpt-4o-mini")

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

TODO

Acknowledgments

Built with LangChain
Powered by OpenAI's GPT models

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
tests		tests
.gitignore		.gitignore
README.md		README.md
app.py		app.py
config.py		config.py
models.py		models.py
paper_analyzer.py		paper_analyzer.py
playground.ipynb		playground.ipynb
requirements.txt		requirements.txt
setup.py		setup.py
setup_project.py		setup_project.py
templates.py		templates.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Academic Paper Figure Analyzer

Features

Prerequisites

Installation

Directory Structure

Usage

Output Structure

Configuration

Contributing

TODO

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Languages

JewelsHovan/extract-wisdom

Folders and files

Latest commit

History

Repository files navigation

Academic Paper Figure Analyzer

Features

Prerequisites

Installation

Directory Structure

Usage

Output Structure

Configuration

Contributing

TODO

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages