Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

A Highly Clean Recipe Dataset with Ingredient States Annotation for State Probing Task

Created by
  • Haebom

Author

Mashiro Toyooka, Kiyoharu Aizawa, Yoko Yamakata

Outline

This paper presents a study that applies state probing techniques to evaluate the ability of large-scale language models (LLMs) to understand the cooking process. LLMs are trained on vast amounts of procedural text, but because they lack direct observation of real-world phenomena, they struggle to accurately understand intermediate steps in cooking recipes. To address this, we construct a novel Japanese recipe dataset with clear and accurate annotations of ingredient state changes. Based on this dataset, we present three new tasks to evaluate the ability of LLMs to track ingredient state changes during the cooking process and identify ingredients present at intermediate steps. Experimental results using widely used LLMs, such as Llama3.1-70B and Qwen2.5-72B, demonstrate that learning ingredient state knowledge improves understanding of the cooking process, achieving performance comparable to that of commercial LLMs. The dataset is publicly available on Hugging Face.

Takeaways, Limitations

Takeaways:
We present a new methodology and dataset for assessing the comprehension of cooking process in LLM.
We experimentally demonstrated that learning material condition knowledge is effective in improving LLM students' understanding of the cooking process.
The published dataset provides a foundation for future related research.
Limitations:
The current dataset is limited to Japanese recipes. Further research is needed to expand it to other languages.
Because the LLMs used in the experiment were limited, further experiments on various LLMs are needed.
It may not encompass all aspects of the cooking process. For example, it may lack consideration for sensory factors (taste, aroma, etc.).
👍