跳到主要导航 跳到搜索 跳到主要内容

AI4Reading: Chinese Audiobook Interpretation System Based on Multi-Agent Collaboration

  • Minjiang Huang
  • , Jipeng Qiang*
  • , Yi Zhu
  • , Chaowei Zhang
  • , Xiangyu Zhao
  • , Kui Yu
  • *此作品的通讯作者
  • Yangzhou University
  • City University of Hong Kong
  • Hefei University of Technology

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Audiobook interpretations are attracting increasing attention, as they provide accessible and in-depth analyses of books that offer readers practical insights and intellectual inspiration. However, their manual creation process remains time-consuming and resource-intensive. To address this challenge, we propose AI4Reading, a multi-agent collaboration system leveraging large language models (LLMs) and speech synthesis technology to generate podcast-like audiobook interpretations. The system is designed to meet three key objectives: accurate content preservation, enhanced comprehensibility, and a logical narrative structure. To achieve these goals, we develop a framework composed of 11 specialized agents—including topic analysts, case analysts, editors, a narrator, and proofreaders—that work in concert to explore themes, extract real-world cases, refine content organization, and synthesize natural spoken language. By comparing expert interpretations with our system’s output, the results show that although AI4Reading still has a gap in speech generation quality, the generated interpretative scripts are simpler and more accurate.

源语言英语
主期刊名System Demonstrations
编辑Pushkar Mishra, Smaranda Muresan, Tao Yu
出版商Association for Computational Linguistics (ACL)
211-220
页数10
ISBN(电子版)9798891762534
DOI
出版状态已出版 - 2025
已对外发布
活动63rd Annual Meeting of the Association for Computational Linguistics, ACL 2025 - Vienna, 奥地利
期限: 27 7月 20251 8月 2025

出版系列

姓名Proceedings of the Annual Meeting of the Association for Computational Linguistics
3
ISSN(印刷版)0736-587X

会议

会议63rd Annual Meeting of the Association for Computational Linguistics, ACL 2025
国家/地区奥地利
Vienna
时期27/07/251/08/25

指纹

探究 'AI4Reading: Chinese Audiobook Interpretation System Based on Multi-Agent Collaboration' 的科研主题。它们共同构成独一无二的指纹。

引用此