Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

LLM-Based Agents for Competitive Landscape Mapping in Drug Asset Due Diligence

Created by
  • Haebom

Author

Alisa Vinogradova (Optic Inc), Vlad Vinogradov (Optic Inc), Dmitrii Radkevich (Optic Inc), Ilya Yasny (Optic Inc), Dmitry Kobyzev (Optic Inc), Ivan Izmailov (Optic Inc), Katsiaryna Yanchanka (Optic Inc), Roman Doronin (Optic Inc), Andrey Doronichev (Optic Inc)

Outline

This paper describes and benchmarks the competitor discovery component used within an agent-based AI system for rapid pharmaceutical asset due diligence. Given a specific indication, the competitor discovery AI agent searches for all drugs that comprise the competitive landscape for that indication and extracts standardized properties of these drugs. Competitor definitions vary across investors, data is paid/licensed, and fragmented across registries. Ontologies are inconsistent across indications, drug names have numerous aliases, are multimodal, and are rapidly evolving. While currently considered the best tool for this problem, LLM-based AI systems cannot reliably retrieve all competitor drug names, and there are no accepted public benchmarks for this task. To address this lack of evaluation, this paper uses an LLM-based agent to transform five years of multimodal unstructured due diligence notes from a private biotech VC fund into a structured evaluation corpus that maps indications to competitor drugs with standardized properties. Furthermore, we introduce a competitor validation LLM-as-a-judge agent that filters out false positives from the predicted competitor list to maximize precision and prevent hallucinations. In this benchmark, the Competitor Discovery agent achieved 83% recall, outperforming OpenAI Deep Research (65%) and Perplexity Labs (60%). This system is targeted at enterprise users; in a case study targeting a biotech VC fund, analyst processing time for competitive analysis was reduced from 2.5 days to approximately 3 hours (approximately 20x).

Takeaways, Limitations

Takeaways: This study presents an effective method for transforming multimodal unstructured data into structured data using an LLM-based agent, thereby improving the accuracy of competitive drug discovery. Through application in a real-world investment environment, this method demonstrates practical results, dramatically reducing analysis time. It also contributes to the advancement of future research by presenting a new benchmark dataset.
Limitations: The data used was limited to a specific biotech VC fund, requiring verification of generalizability. Further analysis and validation of the LLM-as-a-judge agent's performance are needed. The fact that competitor definitions vary across investors may limit the system's generalizability. Further research is needed to determine the system's adaptability to continuous data changes.
👍