Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

A Weighted U Statistic for Genetic Association Analyzes of Sequencing Data

Created by
  • Haebom

Author

Changshuai Wei, Ming Li, Zihuai He, Olga Vsevolozhskaya, Daniel J. Schaid, Qing Lu

Outline

This paper presents WU-seq, a novel methodology for effectively analyzing the massive amounts of genome sequence data generated by the advancement of next-generation sequencing technology. Existing statistical methods suffer from significant limitations in analytical power due to the low frequency of rare variants and the high dimensionality of the data. Based on the nonparametric U statistic, WU-seq can be applied to a variety of phenotypes without making assumptions about disease models or phenotypic distributions. Through simulation and empirical studies, we demonstrate that WU-seq outperforms the existing SKAT method, especially when phenotypes follow long-tailed distributions, and performs similarly to SKAT even when the assumptions are met. Finally, we apply WU-seq to the Dallas Heart Study (DHS) data and discover an association between ANGPTL 4 and very low-density lipoprotein cholesterol.

Takeaways, Limitations

Takeaways:
Introducing WU-seq, a powerful and flexible new methodology for analyzing high-dimensional sequence data.
It shows excellent performance even when the assumptions of the existing method (SKAT) are violated.
Applicable to various phenotypes
Discovering genetic associations through real-world data analysis
Limitations:
Further verification of the generalizability of the simulation and empirical studies presented in this study is needed.
A more comprehensive comparative study with other high-dimensional data analysis methods is needed.
Further analysis of the computational complexity and efficiency of WU-seq is needed.
👍