Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

FGBench: A Dataset and Benchmark for Molecular Property Reasoning at Functional Group-Level in Large Language Models

Created by
  • Haebom

Author

Xuan Liu, Siru Ouyang, Xianrui Zhong, Jiawei Han, Huimin Zhao

Outline

This paper introduces FGBench, a novel dataset containing 625,000 molecular feature inference problems, aimed at improving the performance of large-scale language models (LLMs) leveraging functional group (FG) information in chemistry. FGBench accurately annotates and localizes functional groups within molecules, strengthening the connection between molecular structures and textual descriptions and facilitating the development of more interpretable and structure-aware LLMs. It encompasses regression and classification tasks for 245 different functional groups across three categories (single-functional group influence, multi-functional group interactions, and direct molecular comparisons). Benchmark results from state-of-the-art LLMs demonstrate that current LLMs struggle with feature inference at the functional group level. The FGBench methodology is expected to serve as a foundation for generating novel question-answer pairs with functional group-level information, enabling LLMs to better understand fine-grained molecular structure-property relationships. The dataset and evaluation code are publicly available on GitHub.

Takeaways, Limitations

Takeaways:
We present FGBench, a new dataset that can contribute to improving the chemical inference capability of LLM by utilizing fine-grained information at the functional group level.
Contribute to the development of new drugs and advancements in molecular design by improving understanding of the relationship between molecular structure and properties.
Provides regression and classification tasks for various functional groups to help evaluate and improve the performance of LLM.
FGBench's methodology provides a foundation for building other chemical-related datasets.
Limitations:
Current LLMs struggle with the functional-level inference problems presented in FGBench, suggesting the need for performance improvements in LLMs.
Further research may be needed on the size and diversity of the dataset.
There is a possibility of bias or data imbalance issues for certain functional groups.
👍