consistent strategy doc strings by psnairne · Pull Request #447 · P2GX/PhenoXtract

psnairne · 2026-03-10T09:26:41Z

No description provided.

SmartMonkey-git

Please add: #![deny(rustdoc::broken_intra_doc_links)] to the top of our lib.rs file.

Also add this to our ci:

doc-check:
  name: Doc Check
  runs-on: ubuntu-latest
  timeout-minutes: 15
  steps:
    - uses: actions/checkout@v4

    - uses: actions-rust-lang/setup-rust-toolchain@v1.13.0
      with:
        toolchain: stable

    - uses: Swatinem/rust-cache@v2.8.0

    - name: Check documentation
      env:
        RUSTDOCFLAGS: "-D warnings -D rustdoc::broken_intra_doc_links"
      run: cargo doc --no-deps --all-features

SmartMonkey-git · 2026-03-12T08:56:35Z

phenoxtract/src/transform/strategies/age_to_iso8601.rs

+/// # Example
+///
+/// The table
+/// ```text


I think, this is:

Suggested change

/// ```text

/// ```csv

SmartMonkey-git · 2026-03-12T08:59:53Z

phenoxtract/src/transform/strategies/age_to_iso8601.rs

 use std::collections::{HashMap, HashSet};

 #[derive(Debug)]
+/// # Description


The first line of a doc string will be used in different places as a headline for this class on the doc website of crates.io.

For example:
https://docs.rs/securiety/0.2.8/securiety/curie/index.html
https://github.com/SmartMonkey-git/securiety/blob/c6eeb0b7aac212d331b0876dc6e284500ad1c2ad/src/curie.rs#L3

Can use see how the first line is used?

So, if we keep this we will see something like:
AgeToIso8601Strategy Description

So, the first line should be a single sentence summary.

SmartMonkey-git · 2026-03-12T09:10:13Z

phenoxtract/src/transform/strategies/age_to_iso8601.rs

 /// Given a column whose cells contains ages (e.g. subject age, age of death, age of onset)
 /// this strategy converts integer entries to ISO8601 durations: 47 -> P47Y
+///
 /// NOTE: the integers must be between 0 and 150.


I would:

Suggested change

/// NOTE: the integers must be between 0 and 150.

/// ## Note

Integers must be between 0 and 150.

or maybe leave it out here, since its already explained in the error section.

SmartMonkey-git · 2026-03-12T09:10:15Z

phenoxtract/src/transform/strategies/age_to_iso8601.rs

 #[derive(Debug)]
+/// # Description
+///
 /// Given a column whose cells contains ages (e.g. subject age, age of death, age of onset)


Please, refer to our types everywhere.

Suggested change

/// Given a column whose cells contains ages (e.g. subject age, age of death, age of onset)

/// Given a column whose cells contains ages (e.g. [`Context:SubjectAge`], [`Context:AgeOfDeath`], [`Context:AgeOfOnset`])

SmartMonkey-git · 2026-03-12T09:13:26Z

phenoxtract/src/transform/strategies/alias_map.rs

-/// and a ToString AliasMap which converts "M" to "Male" and "F" to "Female"
+/// # Description
+///
+/// Given a collection of `ContextualiseDataframes`, this strategy will apply all the aliases


Also here:

Suggested change

/// Given a collection of `ContextualiseDataframes`, this strategy will apply all the aliases

/// Given a collection of [`ContextualisedDataframe`], this strategy will apply all the aliases

SmartMonkey-git · 2026-03-12T09:23:05Z

phenoxtract/src/transform/strategies/alias_map.rs

+/// P001, 4
+/// P002, 0
+/// ```
+///
+/// # Errors
+///
+/// Errors will be thrown if:
+/// - Any columns to be aliased cannot be cast to String datatype.


Also here: [String] or do you mean [Dtype::String]?

SmartMonkey-git · 2026-03-12T09:24:56Z

phenoxtract/src/transform/strategies/date_to_age.rs

+/// # Errors
+///
+/// An error will be thrown if
+/// 1. A DOB is before to a date for a patient, leading to a negative age.


Here you use numbers on AliasMapStrategy you used dashes.

Suggested change

/// 1. A DOB is before to a date for a patient, leading to a negative age.

/// 1. A date of birth is before to a date for a patient, leading to a negative age.

or might even link the context type.

SmartMonkey-git · 2026-03-12T09:27:42Z

phenoxtract/src/transform/strategies/date_to_age.rs

+///
+/// An error will be thrown if
+/// 1. A DOB is before to a date for a patient, leading to a negative age.
+/// 2. There exists a date which cannot be converted to an age due to missing DOB data.


There exists a date which cannot be converted to an age due to missing DOB data

I didn't know. This will be a major annoyance, as soon as we have large datasets.

Yes. For those situations we have a few options:

Just don't apply the strategy. I decided to be strict

Add a bool to the strategy so it can be strict or otherwise

Make the strategy just allow situations with missing data

Probably 2 is best. But I would point out that if you are creating a cohort of Phenopackets, you probably either want just ages in a field, or just dates. To have a mix is strange. So strictness did make sense. And DOB data probably is less likely to be missing than other data. Miraculously the strategy seemed to work with the i_data.

SmartMonkey-git · 2026-03-12T09:30:03Z

phenoxtract/src/transform/strategies/hpo_disease_splitter.rs

+/// # Fields
+///
+/// * `hpo_bidict_lib` - This should contain BiDictLibrary for the version of HPO that you want to use.
+/// * `disease_bidict_lib` - All non-HPO cells will be processed by this disease BiDictLibrary.


I don't think Fields is a common section. At least not to the standard.

Common sections

Maybe we should drop it.

SmartMonkey-git · 2026-03-12T09:32:05Z

phenoxtract/src/transform/strategies/mapping.rs

 /// ```ignore
 /// let sex_mapping = MappingStrategy::default_sex_mapping_strategy();
 /// // Maps variations like "m", "male", "man" → "MALE"
 /// // and "f", "female", "woman" → "FEMALE"
 /// ```


Example is not concise with the others

new doc strings

de28c45

psnairne linked an issue Mar 10, 2026 that may be closed by this pull request

Consistent strategy doc strings #283

Open

SmartMonkey-git reviewed Mar 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

consistent strategy doc strings#447

consistent strategy doc strings#447
psnairne wants to merge 1 commit intomainfrom
pn/consistent-strategy-doc-strings

psnairne commented Mar 10, 2026

Uh oh!

SmartMonkey-git left a comment •

edited

Loading

Uh oh!

SmartMonkey-git Mar 12, 2026

Uh oh!

SmartMonkey-git Mar 12, 2026

Uh oh!

SmartMonkey-git Mar 12, 2026

Uh oh!

SmartMonkey-git Mar 12, 2026

Uh oh!

SmartMonkey-git Mar 12, 2026

Uh oh!

SmartMonkey-git Mar 12, 2026

Uh oh!

SmartMonkey-git Mar 12, 2026

Uh oh!

SmartMonkey-git Mar 12, 2026 •

edited

Loading

Uh oh!

psnairne Mar 12, 2026

Uh oh!

SmartMonkey-git Mar 12, 2026

Uh oh!

SmartMonkey-git Mar 12, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	/// NOTE: the integers must be between 0 and 150.
	/// ## Note
	Integers must be between 0 and 150.

	/// Given a column whose cells contains ages (e.g. subject age, age of death, age of onset)
	/// Given a column whose cells contains ages (e.g. [`Context:SubjectAge`], [`Context:AgeOfDeath`], [`Context:AgeOfOnset`])

	/// Given a collection of `ContextualiseDataframes`, this strategy will apply all the aliases
	/// Given a collection of [`ContextualisedDataframe`], this strategy will apply all the aliases

	/// 1. A DOB is before to a date for a patient, leading to a negative age.
	/// 1. A date of birth is before to a date for a patient, leading to a negative age.

Conversation

psnairne commented Mar 10, 2026

Uh oh!

SmartMonkey-git left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SmartMonkey-git Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SmartMonkey-git Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SmartMonkey-git left a comment •

edited

Loading

SmartMonkey-git Mar 12, 2026 •

edited

Loading

SmartMonkey-git Mar 12, 2026 •

edited

Loading