Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Reconstructing Sepsis Trajectories from Clinical Case Reports using LLMs: the Textual Time Series Corpus for Sepsis

Created by
  • Haebom

Author

Shahriar Noroozizadeh, Jeremy C. Weiss

Outline

This paper presents a pipeline for extracting and annotating clinical findings from patient records (case reports) over time using large-scale language models (LLMs) for time-series analysis of healthcare data. We construct a text time-series corpus of 2,139 sepsis (Sepsis-3) patient records from the PubMed Open Access (PMOA) subset and validate the system through comparison with the I2B2/MIMIC-IV dataset. Using the O1-preview and Llama 3.3 70B Instruct models, we demonstrate a high event match rate (~0.75) and strong concordance (~0.93). However, we highlight the limitations of LLMs in temporal reconstruction, suggesting potential improvements through multimodal integration.

Takeaways, Limitations

Takeaways:
We demonstrate that large-scale language models can be used to effectively extract and annotate clinical findings over time from medical records.
Automated medical data time series generation offers the potential for developing more sophisticated medical analytics and predictive models.
Contributing to the activation of medical research by providing a text time series corpus according to open access time.
Limitations:
There are limitations to the temporal reconstruction capabilities of LLM.
The need for improved performance through multi-modal data integration is raised.
Further review of the generalizability of the dataset used to evaluate the model's performance is needed.
👍