跳到主要导航 跳到搜索 跳到主要内容

Detecting Miscitation on the Scholarly Web through LLM-Augmented Text-Rich Graph Learning

  • Huidong Wu
  • , Haojia Xiang
  • , Jingtong Gao
  • , Xiangyu Zhao*
  • , Dengsheng Wu*
  • , Jianping Li
  • *此作品的通讯作者
  • City University of Hong Kong
  • University of Chinese Academy of Sciences
  • Shenzhen University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Scholarly web is a vast network of knowledge connected by citations. However, this system is increasingly compromised by miscitation, where references do not support or even contradict the claims they are cited for. Current miscitation detection methods, which primarily rely on semantic similarity or network anomalies, struggle to capture the nuanced relationship between a citation's context and its place in the wider network. While large language models (LLMs) offer powerful capabilities in semantic reasoning for this task, their deployment is hindered by hallucination risks and high computational costs. In this work, we introduce LLM-Augmented Graph Learning-based Miscitation Detector (LAGMiD), a novel framework that leverages LLMs for deep semantic reasoning over citation graphs and distills this knowledge into graph neural networks (GNNs) for efficient and scalable miscitation detection. Specifically, LAGMiD introduces an evidence-chain reasoning mechanism, which uses chain-of-thought prompting, to perform multi-hop citation tracing and assess semantic fidelity. To reduce LLM inference costs, we design a knowledge distillation method aligning GNN embeddings with intermediate LLM reasoning states. A collaborative learning strategy further routes complex cases to the LLM while optimizing the GNN for structure-based generalization. Experiments on three real-world benchmarks show that LAGMiD achieves state-of-the-art miscitation detection with significantly reduced inference cost.

源语言英语
主期刊名WWW 2026 - Proceedings of the ACM Web Conference 2026
出版商Association for Computing Machinery, Inc
4208-4219
页数12
ISBN(电子版)9798400723070
DOI
出版状态已出版 - 12 4月 2026
已对外发布
活动35th ACM Web Conference, WWW 2026 - Dubai, 阿拉伯联合酋长国
期限: 29 6月 20263 7月 2026

出版系列

姓名WWW 2026 - Proceedings of the ACM Web Conference 2026

会议

会议35th ACM Web Conference, WWW 2026
国家/地区阿拉伯联合酋长国
Dubai
时期29/06/263/07/26

指纹

探究 'Detecting Miscitation on the Scholarly Web through LLM-Augmented Text-Rich Graph Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此