krank

Fetch curated dream reports.

See the online documentation for details.

Installation

pip install --upgrade krank

Usage

import krank

# View a list of available corpora.
krank.list_corpora()
# ['hvdc', 'zhang2019']

# View a list of available versions for a corpus.
krank.list_versions("zhang2019")
# ['1']

# Print metadata about a single corpus.
krank.info("zhang2019")
# Corpus: zhang2019
#   Title: Zhang & Wamsley, 2019
#   Description: Dream reports collected from a laboratory polysomnography study
#   Version: 1
#   Citations: Zhang, J., & Wamsley, E. J. (2019); Wong, W., Herzog, R., ... (2025)

# Load a corpus.
corpus = krank.load("zhang2019")
corpus
# Corpus('zhang2019')

# Print corpus info (same as krank.info()).
print(corpus)
# Corpus: zhang2019
#   Title: Zhang & Wamsley, 2019
#   Description: Dream reports collected from a laboratory polysomnography study
#   Version: 1
#   Citations: Zhang, J., & Wamsley, E. J. (2019); Wong, W., Herzog, R., ... (2025)

# Get number of reports and authors.
corpus.n_reports
# 120
corpus.n_authors
# 16

# Return metadata as a dictionary.
corpus.metadata["title"]
# 'Zhang & Wamsley 2019 Dream Reports'
corpus.metadata["hash"]
# 'md5:a61a3c56f4ee8c14e4a6466044df88f8'
corpus.metadata["brief_description"]
# 'Dream reports collected from a laboratory polysomnography study'

# Return unique authors and their associated metadata in a tidy dataframe.
corpus.authors.head()
#    author  age     sex
# 0      10   23    Male
# 1      26   18    Male
# 2      30   19  Female
# 3      42   20    Male
# 4      83   20    Male

# Return dream reports and their associated metadata in a tidy dataframe.
corpus.reports.head()
#    author      time stage_intended stage_actual  experience                                             report
# 0      10  07:17:27            REM      Morning        True  I umm I was with my parents and we're having a...
# 1      10  02:07:02             N2         NREM        True  Ummm... trying to put um everything together [...
# 2      10  03:50:48            REM          REM        True  Yes I'm ummm... I'm in an argument with a...wi...
# 3      10  00:16:08             N1           SO       False  What? I don't have a dream. What was your ques...
# 4      10  00:54:12             N2           SO        True  Uhuh. Umm let's see.. Running around...(silence).

Contributing

Open an Issue to request a new corpus (or for any other reason).

Adding a new corpus involves the following steps:

Adding a sources/<name>/prepare.ipynb that exports a single CSV file.
Creating a new Zenodo archive with that CSV file.
Add the corpus info to the registry.yaml file (See registry-schema.yaml for what fields are required.)
Add the corpus to mkdocs.yaml.

Versioning

Follows a modified semver structure, where, given a version number MAJOR.MINOR.PATCH, increment the:

MAJOR version when there are incompatible API changes or an existing corpus is removed.
MINOR version when functionality is added in a backward compatible manner or a new corpus is added.
PATCH version when backward compatible bug fixes are added or an existing corpus gets a new release.

Validating Registry Changes

After making changes to the registry, validate them locally using:

python scripts/validate_registry.py

This script checks:

Schema compliance (all required fields present with correct types)
Alphabetical ordering of collections and corpora
Valid references from collections to corpora
Proper hash formats and URLs

The validation runs automatically as part of the test suite in CI.

Credits

This project would not be possible without the work of the Fatiando a Terra Project, namely a heavy dependency on pooch, an inspiration from ensaio, and a structural model from the Fatiando a Terra FAIR data collection.

Uieda, L., V. C. Oliveira Jr, and V. C. F. Barbosa (2013), Modeling the Earth with Fatiando a Terra, Proceedings of the 12th Python in Science Conference, pp. 91-98. doi:10.25080/Majora-8b375195-010

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
.github		.github
docs		docs
scripts		scripts
sources		sources
src/krank		src/krank
tests		tests
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

krank

Installation

Usage

Contributing

Versioning

Validating Registry Changes

Credits

About

Uh oh!

Releases 3

Uh oh!

Contributors

Uh oh!

Languages

License

remrama/krank

Folders and files

Latest commit

History

Repository files navigation

krank

Installation

Usage

Contributing

Versioning

Validating Registry Changes

Credits

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Uh oh!

Contributors

Uh oh!

Languages