Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

PASS: Probabilistic Agentic Supernet Sampling for Interpretable and Adaptive Chest X-Ray Reasoning

Created by
  • Haebom

Author

Yushi Feng, Junye Du, Yingying Hong, Qifan Wang, Lequan Yu

Outline

PASS (Probabilistic Agentic Supernet Sampling) is a novel multimodal agent framework applicable to medical image analysis, such as chest X-ray (CXR) inference. It is proposed to address the black-box decision-making, poor multimodal integration, and inefficient pipeline problems inherent in existing tool-based agent systems. PASS adaptively samples agent workflows on a multi-tool graph to generate interpretable, probabilistically annotated decision paths. It learns task-conditional distributions to select the most appropriate tool at each layer and provides probabilistically annotated paths for subsequent audits, improving the safety of medical AI. Furthermore, it continuously condenses key findings into an evolving personalized memory and dynamically determines whether to deepen the inference path or terminate it early for efficiency. To achieve Pareto optimization between performance and cost, we designed a three-stage training procedure that includes expert knowledge pretraining, contrastive path ranking, and cost-aware reinforcement learning. We introduce CAB-E, a comprehensive benchmark for multi-level safety-critical free-form CXR inference, and perform rigorous evaluations, demonstrating that our methods significantly outperform existing methods on various benchmarks.

Takeaways, Limitations

Takeaways:
We present a new agent system paradigm that is interpretable, adaptive, and multimodal in the field of medical image analysis.
Solving the black box problem can improve the safety of medical AI.
We present an efficient method to improve performance while taking computational cost into account.
It has proven to perform well in various benchmarks.
We present a new benchmark, CAB-E, which lays the foundation for future research.
Limitations:
Currently, it has only been applied to the analysis of thoracic X line images, and its generalizability to other medical images or tasks requires further study.
Further validation of the versatility and generalizability of the CAB-E benchmark may be required.
Further research may be needed to optimize parameter settings for the three-step training procedure.
Application and validation in actual clinical settings are required.
👍