跳到主要导航 跳到搜索 跳到主要内容

To Search or Not to Search: Aligning the Decision Boundary of Deep Search Agents via Causal Intervention

  • Wenlin Zhang
  • , Kuicai Dong
  • , Junyi Li
  • , Yingyi Zhang
  • , Xiaopeng Li
  • , Pengyue Jia
  • , Yi Wen
  • , Derong Xu
  • , Maolin Wang
  • , Yichao Wang*
  • , Yong Liu
  • , Xiangyu Zhao*
  • *此作品的通讯作者
  • City University of Hong Kong
  • Huawei Technologies Co., Ltd.

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Deep search agents, which autonomously iterate through multi-turn web-based reasoning, represent a promising paradigm for complex information-seeking tasks. However, current agents suffer from critical inefficiency: they conduct excessive searches as they cannot accurately judge when to stop searching and start answering. This stems from outcome-centric training that prioritize final results over the search process itself. We identify the root cause as misaligned decision boundaries, the threshold determining when accumulated information suffices to answer. This causes over-search (redundant searching despite sufficient knowledge) and under-search (premature termination yielding incorrect answers). To address these errors, we propose a comprehensive framework comprising two key components. First, we introduce causal intervention-based diagnosis that identifies boundary errors by comparing factual and counterfactual trajectories at each decision point. Second, we develop Decision Boundary Alignment for Deep Search agents (DAS), which constructs preference datasets from causal feedback and aligns policies via preference optimization. Experiments on public datasets demonstrate that decision boundary errors are pervasive across state-of-the-art agents. Our DAS method effectively calibrates these boundaries, mitigating both over-search and under-search to achieve substantial gains in accuracy and efficiency. Our code and data are publicly available at: https://github.com/Applied-Machine-Learning-Lab/WWW2026-DAS.

源语言英语
主期刊名WWW 2026 - Proceedings of the ACM Web Conference 2026
出版商Association for Computing Machinery, Inc
2049-2059
页数11
ISBN(电子版)9798400723070
DOI
出版状态已出版 - 12 4月 2026
已对外发布
活动35th ACM Web Conference, WWW 2026 - Dubai, 阿拉伯联合酋长国
期限: 29 6月 20263 7月 2026

出版系列

姓名WWW 2026 - Proceedings of the ACM Web Conference 2026

会议

会议35th ACM Web Conference, WWW 2026
国家/地区阿拉伯联合酋长国
Dubai
时期29/06/263/07/26

指纹

探究 'To Search or Not to Search: Aligning the Decision Boundary of Deep Search Agents via Causal Intervention' 的科研主题。它们共同构成独一无二的指纹。

引用此