Content not yet available
This lecture has no active video or poster.
Would you like to see your presentation here, made available to a global audience of researchers?
Add your own presentation or have us affordably record your next conference.
Temporal knowledge graph question answering (TKGQA) involves multi-hop reasoning over temporally constrained entity relationships in knowledge graph to answer a given question. However, at each hop, large language models (LLMs) retrieve subgraphs with numerous temporally similar and semantically complex relations, increasing the risk of suboptimal decisions and error propagation. To address these challenges, we propose the multi-hop reasoning enhanced (MRE) framework, which enhances both forward and backward reasoning to improve the identification of globally optimal reasoning trajectories. Specifically, MRE begins with prompt engineering to guide LLM in generating diverse reasoning trajectories for the given question. Valid reasoning trajectories are then selected to supervise fine-tuning, serving as a cold-start strategy. Finally, we introduce tree-group relative policy optimization (T-GRPO)—a recursive, tree-structured learning-by-exploration approach. At each hop, exploration establishes strong causal dependencies on previous hop, while evaluation is informed by multi-path exploration feedback from subsequent hops. Experiments on two TKGQA benchmarks show that our MRE-based model outperforms state-of-the-art(SOTA) methods by 5.2% in accuracy on complex multi-hop queries. Further analysis highlights improved interpretability and robustness to noisy temporal annotations.