Skip to content

instadeepai/tcr-dataset-size-estimation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Assessing Data Size Requirements For Training Generalizable Sequence-based TCR Specificity Models Via Pan-allelic MHC-I Non-self Ligandome Evaluation

This repository contains the companion code for the paper Assessing Data Size Requirements For Training Generalizable Sequence-based TCR Specificity Models Via Pan-allelic MHC-I Non-self Ligandome Evaluation by Delaunay A., McGibbon M et al.

Usage

The repository follows the structure of the paper:

  • The models_benchmark folder contains all code relevant for preparing the benchmark dataset, calculating bootstrapped ROC-AUC of models' specificity prediction and reproducing the paper's visualization plots.

  • The ligandome folder contains the code for reproducing the ligandome computation workflow on public data.

Please refer to each folder's README file for environment setup and specific usage instructions.

License and Copyright

This code is copyright of BioNTech SE, 2022-2025.

The code is available under the GPL v3 license terms. Due to licensing restrictions regarding redistribution for some of the external datasets, we provide instructions within this repository to access and prepare all datasets used in this work from their original sources.

Citing this work

Please refer to the following:

@article{Data set size requirements for generalizable TCR-antigen specificity prediction,
  title={Assessing Data Size Requirements For Training Generalizable Sequence-based TCR Specificity Models Via Pan-allelic MHC-I Non-self Ligandome Evaluation},
  author={Delaunay, A. and McGibbon, M. and Djermani, B. and Gorbushin, N. and Chaves García-Mascaraque, S. and Rayment, I. and Kizhvatov, I. and Petit, C. and Lang, M. and Rooney, M. and Beguir, K. and Sahin, U. and Copoiu, L. and Lopez Carranza, N. and Tovchigrechko, A.},
  journal={},
  volume={},
  year={2025},
  publisher={}
}

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 5