Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

ITBLS: A Dataset of Interactive Conversations Over Tabular Information

Created by
  • Haebom

Author

Anirudh Sundar, Christopher Richardson, Adar Avsian, Larry Heck

Outline

This paper introduces iTBLS, an interactive tabular dataset based on data from ArXiv, a pre-release academic paper website. iTBLS consists of three types of tabular tasks: interpretation, modification, and generation. Interpretation focuses on understanding tables, modification focuses on manipulating table information, and generation focuses on adding new natural language evidence. Furthermore, we present a novel framework for reconstructing tabular tasks as question-answering. This framework formulates appropriate questions based on the characteristics of user interactions and uses user requests as evidence to answer them. This approach improves performance across all tasks in iTBLS compared to sequence-to-sequence modeling baselines. Furthermore, the proposed question-answering-based reconstruction method is applied to an existing text-to-table task dataset that summarizes text into tables, improving accuracy (Exact-Match) by up to 13% and BERTScore by up to 16% compared to the state-of-the-art.

Takeaways, Limitations

Takeaways:
We present iTBLS, a new interactive tabular dataset based on ArXiv data.
We present a novel framework that reconstructs table tasks into question-answering tasks and demonstrates improved performance over existing methods.
Demonstrates performance improvements for various table operations (interpretation, modification, generation).
We also achieved performance improvements in existing text-to-table operations.
Limitations:
There is a lack of specific mention of the size and diversity of the iTBLS dataset.
Further research is needed to evaluate the generalization performance and applicability of the proposed framework to other domains.
Additional analysis using evaluation metrics other than those used (Exact-Match, BERTScore) may be required.
👍