Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Generation of structure-guided pMHC-I libraries using Diffusion Models

Created by
  • Haebom

Author

Sergio Mares, Ariel Espinoza Weinberger, Nilah M. Ioannidis

Outline

To address the limitations of personalized vaccines and T-cell immunotherapies that rely on identifying peptide-MHC class I (pMHC-I) interactions capable of eliciting effective immune responses, this paper presents a structure-based benchmark of pMHC-I peptides designed based on crystal structure interaction distances using a diffusion model. This benchmark, encompassing 20 prioritized HLA alleles and independent of previously characterized peptides, demonstrates structural generalization without experimental dataset bias by reproducing canonical anchor residue notations. Using this resource, we demonstrate that state-of-the-art sequence-based predictors underperform in recognizing binding potential for these structurally stable designs, revealing allele-specific limitations not seen in existing evaluations. Our geometry-aware design pipeline generates peptides with high predicted structural integrity and higher residue diversity than existing datasets, providing a valuable resource for unbiased model training and evaluation. Code and data are available at https://github.com/sermare/struct-mhc-dev .

Takeaways, Limitations

Takeaways:
A new pMHC-I peptide benchmark is presented that overcomes the bias of existing benchmarks.
Generation of peptides with high structural integrity and residue diversity through structure-based design.
By revealing the limitations of state-of-the-art sequence-based predictors, we suggest directions for future model improvement.
Providing new resources to contribute to the development of personalized vaccines and T cell immunotherapies.
Limitations:
Further validation is needed to determine whether this benchmark fully reflects actual in vivo immune responses.
Further research is needed on the interpretability and predictability of design processes using diffusion models.
Since only 20 HLA alleles were included, generalizability to other alleles needs to be examined.
👍