ODTQA-FoRe: An Open-Domain Tabular Question Answering Dataset for Future Data Forecasting and Reasoning

๐Ÿ“… 2026-06-01
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF

career value

165K/year
๐Ÿค– AI Summary
This study addresses the limitation of current large language models in supporting future-oriented numerical forecasting within open-domain table question answering, a gap exacerbated by the absence of dedicated datasets and methodologies. To bridge this gap, the work introduces the first open-domain table QA task specifically designed for future prediction and constructs the inaugural real-world time-series table QA benchmark dataset grounded in the real estate domain. Furthermore, it proposes TimeFore, a multi-agent framework that integrates SQL-based retrieval, external time-series forecasting models, and large language model reasoning through three synergistic modulesโ€”retrieval, prediction, and analysis. Experimental results demonstrate that TimeFore significantly enhances both prediction accuracy and answer consistency, offering an effective solution for future-oriented table question answering.
๐Ÿ“ Abstract
The rapid development of LLMs has significantly advanced tabular question answering, but most systems cannot perform future-oriented numerical prediction. To address this gap, we introduce a novel task, Open-Domain Tabular Question Answering for Future Data Forecasting and Reasoning, and propose the first dataset to cover time-series forecasting and forecast-based reasoning scenarios using real estate data. This task poses challenges in retrieving precise historical data, overcoming the forecasting limitations of LLMs, and standardizing responses for diverse queries. To solve the above challenges, we propose TimeFore, an LLM agent-based framework that decomposes the problem into three collaborative roles: a Retriever autonomously generates SQL to fetch data, a Forecaster invokes external time-series models for higher accuracy, and an Analyzer synthesizes the results to construct a precise and consistent final answer. Extensive experiments demonstrate the effectiveness of our TimeFore.
Problem

Research questions and friction points this paper is trying to address.

Open-Domain Tabular Question Answering
Future Data Forecasting
Time-Series Forecasting
Forecast-Based Reasoning
LLM Limitations
Innovation

Methods, ideas, or system contributions that make the work stand out.

Open-Domain Tabular QA
Future Data Forecasting
LLM Agent Framework
Time Series Reasoning
TimeFore
Z
Zhensheng Wang
School of Artificial Intelligence, Beijing Normal University, Beijing, PR China; Institute of Artificial Intelligence and Future Networks, Beijing Normal University, Zhuhai, PR China
X
Xiaole Liu
Faculty of Arts and Sciences, Beijing Normal University, Zhuhai, PR China
Wenmian Yang
Wenmian Yang
Specially Appointed Associate Professor, Beijing Normal University at Zhuhai
Data MiningMachine LearningNatural Language ProcessingTime series
K
Kun Zhou
School of Artificial Intelligence, Beijing Normal University, Beijing, PR China; Institute of Artificial Intelligence and Future Networks, Beijing Normal University, Zhuhai, PR China
Y
Yiquan Zhang
Institute of Artificial Intelligence and Future Networks, Beijing Normal University, Zhuhai, PR China
Weijia Jia
Weijia Jia
FIEEE, Chair Professor, Beijing Normal University and UIC
Cyber Intelligent ComputingNetworking