Skip to content

Commit 20becc8

Browse files
committed
add EarthArXiv to README
1 parent 0ce1394 commit 20becc8

File tree

1 file changed

+23
-0
lines changed

1 file changed

+23
-0
lines changed

README.md

Lines changed: 23 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -335,10 +335,33 @@ python manage.py harvest_journals --all --user-email [email protected]
335335
- `essd` - Earth System Science Data (OAI-PMH) ([Issue #59](https://github.com/GeoinformationSystems/optimap/issues/59))
336336
- `agile-giss` - AGILE-GISS conference series (OAI-PMH) ([Issue #60](https://github.com/GeoinformationSystems/optimap/issues/60))
337337
- `geo-leo` - GEO-LEO e-docs repository (OAI-PMH) ([Issue #13](https://github.com/GeoinformationSystems/optimap/issues/13))
338+
- `eartharxiv` - EarthArXiv preprint repository (OAI-PMH, ~6,000+ preprints)
338339
- `scientific-data` - Scientific Data (RSS/Atom) ([Issue #58](https://github.com/GeoinformationSystems/optimap/issues/58))
339340

340341
The command supports both OAI-PMH and RSS/Atom feeds, automatically detecting the feed type for each journal.
341342

343+
**Harvesting EarthArxiv preprints**:
344+
345+
EarthArxiv is a preprint server for Earth Sciences hosted by the California Digital Library. All harvested articles automatically receive metadata enrichment from OpenAlex, including author names, keywords, and topics.
346+
347+
```bash
348+
# Harvest first 100 preprints for testing
349+
python manage.py harvest_journals --journal eartharxiv --max-records 100 --create-sources
350+
351+
# Harvest all EarthArxiv preprints (6,000+)
352+
python manage.py harvest_journals --journal eartharxiv --create-sources
353+
354+
# Harvest EarthArxiv along with other journals
355+
python manage.py harvest_journals --journal eartharxiv --journal essd --journal geo-leo
356+
```
357+
358+
EarthArxiv provides comprehensive coverage of Earth Science preprints via its OAI-PMH API endpoint. Each publication is automatically matched with OpenAlex to retrieve:
359+
- Author information
360+
- Keywords and subject classification
361+
- Citation data
362+
- Open access status
363+
- Publication topics
364+
342365
The command provides detailed progress reporting including:
343366

344367
- Number of publications harvested

0 commit comments

Comments
 (0)