DS1 spectrogram: SpartQA: : A Textual Question Answering Benchmark for Spatial Reasoning

SpartQA: : A Textual Question Answering Benchmark for Spatial Reasoning

2104.05832

Authors

Roshanak Mirzaee,Hossein Rajaby Faghihi,Qiang Ning,Parisa Kordjmashidi

Abstract

This paper proposes a question-answering (QA) benchmark for spatial reasoning on natural language text which contains more realistic spatial phenomena not covered by prior work and is challenging for state-of-the-art language models (LM). We propose a distant supervision method to improve on this task.

Specifically, we design grammar and reasoning rules to automatically generate a spatial description of visual scenes and corresponding QA pairs. Experiments show that further pretraining LMs on these automatically generated data significantly improves LMs' capability on spatial understanding, which in turn helps to better solve two external datasets, bAbI, and boolQ.

We hope that this work can foster investigations into more sophisticated models for spatial reasoning over text.

Resources

Stay in the loop

Every AI paper that matters, free in your inbox daily.

Details

  • takara.ai
  • Custom AI and machine learning from the Frontier Research Team.
  • © 2026 takara.ai Ltd
  • Content is sourced from third-party publications.