Research Master MCP supports 28 academic research sources with different pricing models, API requirements, and capabilities.
| Source | Search | Download | Read* | Citations | DOI Lookup | Author Search |
|---|---|---|---|---|---|---|
| arXiv | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
| Semantic Scholar | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
| OpenAlex | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
| PubMed | ✅ | ✅ | ✅ | ❌ | ❌ | ❌ |
| PMC | ✅ | ✅ | ✅ | ❌ | ❌ | ❌ |
| bioRxiv | ✅ | ✅ | ✅ | ❌ | ❌ | ❌ |
| medRxiv | ✅ | ✅ | ✅ | ❌ | ❌ | ❌ |
| HAL | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ |
| DBLP | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ |
| CrossRef | ❌ | ❌ | ❌ | ❌ | ✅ | ❌ |
| IACR ePrint | ✅ | ✅ | ✅ | ❌ | ❌ | ❌ |
| SSRN | ✅ | ✅ | ✅ | ❌ | ❌ | ❌ |
| CORE | ✅ | ✅ | ✅ | ❌ | ✅ | ❌ |
| EuropePMC | ✅ | ❌ | ✅ | ❌ | ❌ | ❌ |
| Dimensions | ✅ | ❌ | ❌ | ✅ | ❌ | ❌ |
| IEEE Xplore | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ |
| Zenodo | ✅ | ✅ | ✅ | ❌ | ✅ | ❌ |
| Unpaywall | ❌ | ✅ | ❌ | ❌ | ✅ | ❌ |
| MDPI | ✅ | ✅ | ✅ | ❌ | ✅ | ❌ |
| JSTOR | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ |
| SciSpace | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ |
| ACM DL | ✅ | ❌ | ❌ | ❌ | ✅ | ❌ |
| Connected Papers | ✅ | ❌ | ❌ | ✅ | ❌ | ❌ |
| DOAJ | ✅ | ✅ | ✅ | ❌ | ✅ | ❌ |
| WorldWideScience | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ |
| OSF Preprints | ✅ | ✅ | ✅ | ❌ | ✅ | ❌ |
| BASE | ✅ | ✅ | ✅ | ❌ | ✅ | ❌ |
| Springer | ✅ | ❌ | ❌ | ❌ | ✅ | ❌ |
| Google Scholar | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ |
*PDF text extraction requires poppler/libpoppler to be installed on your system.
| Source | Pricing | API Key Required | Free Tier | Rate Limits (no key) | Notes |
|---|---|---|---|---|---|
| arXiv | Free | No | Unlimited | 1 request/sec | Open access preprints |
| Semantic Scholar | Freemium | Optional | Yes | 1 request/sec | 100 papers/day free; API key increases limits |
| OpenAlex | Free | Optional | Unlimited | 10 requests/sec | Email recommended for polite pool |
| PubMed | Free | No | Unlimited | 3 requests/sec | NIH funded, no API key needed |
| PMC | Free | No | Unlimited | 3 requests/sec | Full-text biomedical literature |
| bioRxiv | Free | No | Unlimited | 1 request/sec | Biology preprints |
| HAL | Free | No | Unlimited | 1 request/sec | French open access repository |
| DBLP | Free | No | Unlimited | 1 request/sec | Computer science bibliography only |
| CrossRef | Free | No | Unlimited | 1 request/sec | DOI lookup only, no full search |
| IACR ePrint | Free | No | Unlimited | 1 request/sec | Cryptology research only |
| SSRN | Free | No | Limited | 1 request/sec | Social sciences preprints |
| CORE | Freemium | Optional | Yes | 1 request/sec | Aggregates open access papers |
| EuropePMC | Free | No | Unlimited | 1 request/sec | Biomedical literature, indexes PubMed/PMC |
| Dimensions | Free | No | Unlimited | 1 request/sec | Research metrics and discovery |
| IEEE Xplore | Paid | Required* | Yes | 3 requests/sec | Free API key from developer.ieee.org |
| Zenodo | Free | No | Unlimited | 1 request/sec | CERN-hosted open science repository |
| Unpaywall | Free | No | Unlimited | 1 request/sec | Find open access versions of papers |
| MDPI | Free | No | Unlimited | 1 request/sec | Open access publisher |
| JSTOR | Paid | Required* | Limited | 1 request/sec | Historical journals, requires institutional access |
| SciSpace | Freemium | Optional | Yes | 1 request/sec | Research discovery platform |
| ACM DL | Paid | Required* | Limited | 3 requests/sec | Computer science, free API key available |
| Connected Papers | Free | No | Unlimited | 1 request/sec | Related paper discovery |
| DOAJ | Free | No | Unlimited | 1 request/sec | Quality open access journals |
| WorldWideScience | Free | No | Unlimited | 1 request/sec | Global science gateway |
| OSF Preprints | Free | No | Unlimited | 1 request/sec | Open Science Framework |
| BASE | Free | No | Unlimited | 1 request/sec | Bielefeld Academic Search Engine |
| Springer | Paid | Required* | Limited | 1 request/sec | Academic publisher, some open access |
| Google Scholar | Free | No | Limited | 1 request/sec | Disabled by default, scraping-based |
*Requires API key for full access. Free API keys are available but have rate limits.
The following sources are completely free to use without any API key requirements.
arXiv is the premier open access preprint repository for physics, mathematics, computer science, and related fields. Operated by Cornell University, it provides unlimited access to millions of preprints without any API key or registration requirements.
PubMed and PMC are NIH-funded databases providing free access to biomedical and life sciences literature. PubMed contains abstracts and citations, while PMC provides full-text articles. Both are completely free and require no API key.
OpenAlex is an open infrastructure funded by various academic organizations, providing a comprehensive catalog of scholarly papers, authors, institutions, and more. While completely free, providing an email address via OPENALEX_EMAIL gives access to the "polite pool" with better rate limits.
HAL is France's national open access repository, providing free access to academic papers from French institutions. It requires no API key and offers unlimited queries.
DBLP specializes in computer science bibliography, providing free access to citations for conference proceedings, journals, and books in CS. While comprehensive for its niche, it only supports search and does not provide full paper downloads.
CrossRef provides DOI (Digital Object Identifier) lookup services for free. It does not support full-text search but is invaluable for looking up paper metadata by DOI.
IACR ePrint focuses exclusively on cryptology research from the International Association for Cryptologic Research. All papers are freely available, and no API key is required.
bioRxiv is the primary preprint server for biology and life sciences, operated by Cold Spring Harbor Laboratory. All preprints are free to access.
Zenodo is CERN-hosted open science repository, providing free access to research data and papers from researchers worldwide. No API key is required for basic access.
Unpaywall helps find open access versions of papers by checking thousands of open access repositories. It's completely free and requires no API key.
DOAJ (Directory of Open Access Journals) indexes only peer-reviewed, open access journals. It provides free access to journal metadata and paper information.
Connected Papers helps researchers discover related papers using a visual graph approach. It offers a free tier with reasonable rate limits.
WorldWideScience is a global science gateway providing access to scientific databases from multiple countries. It's completely free and operated by the scientific community.
OSF Preprints provides access to preprints from the Open Science Framework, covering many disciplines. No API key is required.
BASE (Bielefeld Academic Search Engine) aggregates content from over 4,000 data sources. It provides free access to academic resources.
Dimensions is a research discovery platform funded by Digital Science, offering free access to papers, grants, patents, and clinical trials.
MDPI is an open access publisher that provides free access to all its journal articles. The website is free to use.
Google Scholar is completely free but disabled by default because it uses scraping techniques rather than an official API. See Google Scholar Setup for how to enable it.
Semantic Scholar offers a free tier with approximately 100 papers per day and 1 request per second. Providing a SEMANTIC_SCHOLAR_API_KEY significantly increases these limits. The API key is free to obtain from the Semantic Scholar website.
CORE aggregates millions of open access papers from repositories worldwide. A free API key is available at core.ac.uk and increases rate limits. Without a key, functionality is limited.
SciSpace provides a research discovery platform with both free and paid tiers. An optional API key can be configured for enhanced access.
IEEE Xplore requires an API key for programmatic access. Free API keys are available by registering at developer.ieee.org. The free tier provides limited requests per second, and full access requires institutional subscriptions.
ACM Digital Library requires an API key for its programmatic search API. Free API keys can be obtained from the ACM Developer Portal (developers.acm.org). Without an API key, functionality is severely limited.
Springer may require an API key for full programmatic access. Check the Springer Nature Developer Portal for registration and access details.
JSTOR is primarily a subscription-based service. Institutional access is typically required, and API access may need additional licensing.
Google Scholar is disabled by default both at compile-time and runtime due to its scraping-based approach:
-
Compile with the feature flag:
cargo build --features google_scholar
-
Enable at runtime:
export GOOGLE_SCHOLAR_ENABLED=true
export SEMANTIC_SCHOLAR_API_KEY="your-api-key-here"Get your free API key at: https://www.semanticscholar.org/product/api
export CORE_API_KEY="your-core-api-key"Register for a free API key at: https://core.ac.uk/services/api
export IEEE_XPLORE_API_KEY="your-ieee-api-key"Register for a free API key at: https://developer.ieee.org/
export ACM_API_KEY="your-acm-api-key"Register for a free API key at: https://developers.acm.org/
export OPENALEX_EMAIL="your-email@example.com"Providing an email gives you access to OpenAlex's "polite pool" with better rate limits.
Without API keys, sources enforce rate limits to prevent abuse. You can customize these limits:
# Semantic Scholar rate limit (requests per second)
export SEMANTIC_SCHOLAR_RATE_LIMIT=5
# Global rate limit
export RESEARCH_MASTER_RATE_LIMITS_DEFAULT_REQUESTS_PER_SECOND=10
# Disable rate limiting entirely
export RESEARCH_MASTER_RATE_LIMITS_DEFAULT_REQUESTS_PER_SECOND=0Note: Disabling rate limits or setting them too high may result in your IP being blocked by the source APIs. Use with caution.
- Keyword search with relevance ranking
- Year range filtering:
"2020","2018-2022","2010-","-2015" - Category/subject filtering for categorized sources
- Author-specific search (supported sources)
- DOI-based lookup across multiple sources
- Download papers directly to your local filesystem
- Automatic directory creation
- Configurable download paths
- Optional organization by source
- PDF text extraction for reading paper contents (requires poppler)
- Forward citations: Find papers that cite a given paper
- References: Discover papers referenced by a given paper
- Related papers: Explore similar research
- Citation count tracking
Intelligent duplicate detection across sources:
- DOI matching (exact)
- Title similarity (Jaro-Winkler algorithm, 0.95+ threshold)
- Author verification
- Multiple strategies: keep first, keep last, or mark duplicates
- Auto-disable sources that fail to initialize (e.g., missing API keys)
- Enable specific sources via environment variable
- Graceful degradation when some sources are unavailable
- Google Scholar is disabled by default and requires
GOOGLE_SCHOLAR_ENABLED=trueto activate
- Installation - How to install Research Master MCP
- Usage - CLI commands and options
- Configuration - Environment variables and config file
- Development - Adding new sources