Fix MHC typing vs HLA typing naming inconsistency by jonasscheid · Pull Request #801 · bigbio/proteomics-sample-metadata

jonasscheid · 2026-02-23T07:34:52Z

Summary

Renames HLA typing / HLA typing method to MHC typing / MHC typing method in sdrf-terms.tsv to match the immunopeptidomics template README which already uses the species-agnostic MHC typing terminology
Updates all cross-references in quickstart guides, main README, site HTML, and llms.txt
Aligns descriptions and allowed values to be species-agnostic (e.g., supports both human HLA and mouse H-2 nomenclature)
Adds inferred from mass spectrometry as a valid MHC typing method

Fixes #794

Test plan

Verify sdrf-terms.tsv column names match immunopeptidomics README.adoc column names
Verify site HTML renders correctly with updated terms
Confirm no remaining HLA typing references used as column names

…and site (fixes bigbio#794)

coderabbitai · 2026-02-23T07:35:16Z

Important

Review skipped

Auto reviews are disabled on base/target branches other than the default branch.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

ypriverol · 2026-02-23T07:37:51Z

@jonasscheid Can you review this file also https://github.com/bigbio/proteomics-sample-metadata/tree/dev/sdrf-proteomics/templates/immunopeptidomics for consistency and the given template: https://github.com/bigbio/sdrf-templates/blob/main/immunopeptidomics/1.0.0-dev/immunopeptidomics.yaml

ypriverol · 2026-02-24T06:49:02Z

@jonasscheid should we also clearly explain how to write the non cleavage enzymes: https://www.ebi.ac.uk/ols4/ontologies/ms/classes/http%253A%252F%252Fpurl.obolibrary.org%252Fobo%252FMS_1001956?lang=en

jonasscheid · 2026-02-24T14:55:04Z

Theoretically there is no need to have it since all immunopeptidomics experiments are unspecific cleavage or how do you normally handle these "default" metadata

ypriverol · 2026-02-24T18:25:10Z

I think is not needed. However, we still have one issue that I haven't solved quite well. When we combined two templates lets say DDA + Inmunopeptidomics; We will have conflicting rules in the DDA we required the Enzyme and in the inmunopeptidomics we actually dont' I have to make sure @noatgnu we have a mechanism to actually represent when a property is required in one template and optional/not required in the other, what to do, and more importantly, how to detect this and make it clear for the users. @jpfeuffer @timosachsenberg Can you give us an opinion about this also?

noatgnu · 2026-02-24T18:44:49Z

Currently we do have a mechanism for merging templates that were written but we need to have more test case for it. You can see it in the sdrf-pipelines here https://github.com/bigbio/sdrf-pipelines/blob/dev/src/sdrf_pipelines/sdrf/schemas/utils.py

You will find a utility for composing template from different schemas into a new schema with all the rules. For this purpose, the merge strategy would be using _merge_fields_combine_strategy here if two or more templates have the same column and requirement was defined for that column then only the requirement that is at the highest level will persist.

In this utils.py we also have ability to write out a tsv file with all the template column header in the correct order using schema_to_tsv function which take a schema.

We might need some update to have it write out the correct original schema references that the composite schema was created from including its name and version.

jpfeuffer · 2026-02-24T18:54:36Z

@noatgnu The problem with this is:

a) how is highest level defined
b) the immunopeptidomics template will by default say nothing about enzyme, so it won't overwrite any field whatsoever

jpfeuffer · 2026-02-24T19:00:29Z

But to be honest immunopeptidomics can just require an enzyme column and even require even more strictly a very specific enzyme only, called "No enzyme" which should be part of the ontology.

jonasscheid · 2026-02-25T07:17:05Z

a) how is highest level defined

@noatgnu I would expect mandatory > optional > custom for merging templates, or?

Agree with @jpfeuffer it will be combined most likely with proteomics-ms template, and therefore enzyme specification will be required. I would still put it in optional for immunopeptidomics template, since its not a concrete batch effect of the data compared to MHC type or enrichment method. But then we have it documented if other templates require enzyme.

Should also be done in the same way then for other templates for e.g. metaproteomics

nithujohn · 2026-02-25T07:32:48Z

I would suggest the name as MHC class as the example in the template for the same is "HLA-A02:01, HLA-B07:02, HLA-C*07:02, H-2Kb, H-2Db" which is the type of MHC molecule presenting peptides. But MHC typing would be the laboratory method used to determine which HLA alleles a donor has

jpfeuffer · 2026-02-25T07:35:37Z

Yes but even with optional , this would not be enough to overwrite the requiredness of this column that comes from the Ms template (with the reasonable order that you mentioned).

Therefore you either have to define overwriting strictly by combination order (i.e. MS + immuno is different from immuno + MS), OR as I said, make immuno even more strict about enzymes and make it a required AND term restricted column.

jonasscheid · 2026-02-25T07:39:55Z

What about going for a default enzyme value in the yaml for immunopeptidomics and if this field is required by another template, it will just be populated? I guess this behaviour could be relevant for other templates as well..

ypriverol · 2026-02-25T07:48:59Z

A couple of points:

Immunopeptidomics can be combined with both DDA and DIA, not only DDA.

I agree with @jpfeuffer that we can reference the enzyme using MS:1001956 (unspecific cleavage).

This discussion highlights a more general issue around combining templates. What happens when one template defines a column as REQUIRED, while another defines it as OPTIONAL or does not define it at all — especially when the column may not even make sense in that context? For example, the immunopeptidomics template may not define an enzyme column, while the DDA template defines enzyme as REQUIRED.

The real issue may be how we designed DDA/DIA: we marked enzyme as REQUIRED in DDA without considering valid DDA experiments (e.g., immunopeptidomics or top-down) where no enzyme applies. The key question is whether column requirements (REQUIRED / RECOMMENDED / OPTIONAL) should be context-dependent rather than globally enforced at the acquisition level.

jpfeuffer · 2026-02-25T08:28:58Z

Yes, @jonasscheid @ypriverol both your options are also valid! We have to decide which one is the most maintainable or least complex.

Align MHC typing column names across sdrf-terms.tsv, adoc templates, …

f012b14

…and site (fixes bigbio#794)

ypriverol approved these changes Feb 26, 2026

View reviewed changes

ypriverol merged commit dcf0f6c into bigbio:dev Feb 26, 2026
2 of 3 checks passed

Conversation

jonasscheid commented Feb 23, 2026

Summary

Test plan

Uh oh!

coderabbitai bot commented Feb 23, 2026

Review skipped

Uh oh!

ypriverol commented Feb 23, 2026

Uh oh!

ypriverol commented Feb 24, 2026

Uh oh!

jonasscheid commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ypriverol commented Feb 24, 2026

Uh oh!

noatgnu commented Feb 24, 2026

Uh oh!

jpfeuffer commented Feb 24, 2026

Uh oh!

jpfeuffer commented Feb 24, 2026

Uh oh!

jonasscheid commented Feb 25, 2026

Uh oh!

nithujohn commented Feb 25, 2026

Uh oh!

jpfeuffer commented Feb 25, 2026

Uh oh!

jonasscheid commented Feb 25, 2026

Uh oh!

ypriverol commented Feb 25, 2026

Uh oh!

jpfeuffer commented Feb 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

jonasscheid commented Feb 24, 2026 •

edited

Loading