DS1 spectrogram: MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation
  through Question Complexity

MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation through Question Complexity

2412.01572

Authors

Nan Du,Qi Li,Sihong Xie,Xiaqiang Tang,Qiang Gao

Abstract

Retrieval Augmented Generation (RAG) has proven to be highly effective in boosting the generative performance of language model in knowledge-intensive tasks. However, existing RAG framework either indiscriminately perform retrieval or rely on rigid single-class classifiers to select retrieval methods, leading to inefficiencies and suboptimal performance across queries of varying complexity.

To address these challenges, we propose a reinforcement learning-based framework that dynamically selects the most suitable retrieval strategy based on query complexity. % our solution Our approach leverages a multi-armed bandit algorithm, which treats each retrieval method as a distinct "arm" and adapts the selection process by balancing exploration and exploitation.

Additionally, we introduce a dynamic reward function that balances accuracy and efficiency, penalizing methods that require more retrieval steps, even if they lead to a correct result. Our method achieves new state of the art results on multiple single-hop and multi-hop datasets while reducing retrieval costs.

Our code are available at https://github.com/FUTUREEEEEE/MBA .

Resources

Stay in the loop

Every AI paper that matters, free in your inbox daily.

Details

  • takara.ai
  • Custom AI and machine learning from the Frontier Research Team.
  • © 2026 takara.ai Ltd
  • Content is sourced from third-party publications.