Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

CellForge: Agentic Design of Virtual Cell Models

Created by
  • Haebom

Author

Xiangru Tang, Zhuoyun Yu, Jiapeng Chen, Yan Cui, Daniel Shao, Weixu Wang, Fang Wu, Yuchen Zhuang, Wenqi Shi, Zhi Huang, Arman Cohan, Xihong Lin, Fabian Theis, Smita Krishnaswamy, Mark Gerstein

Outline

CellForge is a virtual cell modeling system that aims to predict quantitative responses to various stimuli through the convergence of artificial intelligence and biology. To overcome challenges such as the complexity of biological systems, heterogeneity of data types, and the need for expertise across various fields, CellForge leverages a multi-agent framework to directly transform provided biological datasets and research objectives into optimized virtual cell models. Using only single-cell multi-omics data and task descriptions, CellForge generates optimized model architectures and executable code for virtual cell model training and inference. It consists of three core modules: Task Analysis for dataset feature analysis and literature search; Method Design for developing optimal modeling strategies through collaboration among expert agents; and Experiment Execution for automated code generation. Agents in the Method Design module, comprised of experts and mediators with different perspectives, collaborate until consensus is reached. Experiments on single-cell perturbation prediction using six diverse datasets (including gene knockout, drug treatment, and cytokine stimulation) demonstrate CellForge's superior performance compared to existing state-of-the-art methods. This demonstrates that iterative interactions among LLM agents with different perspectives yield superior solutions than directly addressing modeling tasks.

Takeaways, Limitations

Takeaways:
We demonstrate that CellForge, a multi-agent-based virtual cell modeling system, can automatically generate optimized virtual cell models from single-cell multi-omics data.
Achieves superior performance over existing state-of-the-art methods on a variety of datasets and tasks.
Demonstrating the utility of a modeling approach through collaboration between LLM agents from different perspectives.
Publicly available code provided ( https://github.com/gersteinlab/CellForge ).
Limitations:
Further research is needed to explore CellForge's generalization performance and applicability to various biological systems.
There is a need to improve the efficiency and transparency of the collaboration process between agents.
The difficulty of creating models that fully reflect complex biological processes.
Potential overfitting to a specific dataset.
👍