Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

General Demographic Foundation Models for Enhancing Predictive Performance Across Diseases and Populations

Created by
  • Haebom

Author

Li-Chin Chen, Ji-Tian Sheu, Yuh-Jue Chuang

Outline

This study explores the development of the General Demographic Pre-trained (GDP) model, a foundational model specialized for demographic attributes such as age and gender. We pretrained and evaluated the model using datasets with diverse diseases and population compositions. We explored how to transform tabular demographic inputs into effective latent embeddings by combining ordering and encoding methods to construct the GDP architecture. Consequently, GDP demonstrated generalizability across tasks, diseases, and populations. Specifically, the sequential ordering approach improved model performance in diseases where age and gender are important for risk stratification, and it also enhanced representational significance in datasets where demographic attributes have relatively low predictive value.

Takeaways, Limitations

Takeaways:
A baseline model for demographic attributes in tabular form presents a promising direction for improving predictive performance in healthcare.
The GDP model demonstrates generalizability across occupations, diseases, and populations.
Sequential sorting improves model performance in diseases where age and gender are important for risk stratification.
GDP increases the expressive importance of demographic attributes, thereby increasing their influence within the predictive model.
Limitations:
There is no specific mention of Limitations in the paper (it does not appear in the abstract).
👍