AI研究论文每日精选

每日精选AI研究论文及翻译

长上下文大语言模型如是说
Thus Spake Long-Context Large Language Model

Xiaoran Liu, Ruixiao Li, Mianqiu Huang, Zhigeng Liu, Yuerong Song, Qipeng Guo, Siyang He, Qiqi Wang, Linlin Li, Qun Liu, Yaqian Zhou, Xuanjing Huang, Xipeng QiuFeb 24, 2025686

DICEPTION:面向视觉感知任务的通用扩散模型
DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks

Canyu Zhao, Mingyu Liu, Huanyi Zheng, Muzhi Zhu, Zhiyue Zhao, Hao Chen, Tong He, Chunhua ShenFeb 24, 2025513

Audio-FLAN:初步发布版
Audio-FLAN: A Preliminary Release

Liumeng Xue, Ziya Zhou, Jiahao Pan, Zixuan Li, Shuai Fan, Yinghao Ma, Sitong Cheng, Dongchao Yang, Haohan Guo, Yujia Xiao, Xinsheng Wang, Zixuan Shen, Chuanbo Zhu, Xinshen Zhang, Tianchi Liu, Ruibin Yuan, Zeyue Tian, Haohe Liu, Emmanouil Benetos, Ge Zhang, Yike Guo, Wei XueFeb 23, 2025342

GCC:基于色彩校验卡扩散的生成式色彩恒常性
GCC: Generative Color Constancy via Diffusing a Color Checker

Chen-Wei Chang, Cheng-De Fan, Chia-Che Chang, Yi-Chen Lo, Yu-Chee Tseng, Jiun-Long Huang, Yu-Lun LiuFeb 24, 2025272

CodeCriticBench:面向大型语言模型的综合性代码评审基准
CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models

Alexander Zhang, Marcus Dong, Jiaheng Liu, Wei Zhang, Yejie Wang, Jian Yang, Ge Zhang, Tianyu Liu, Zhongyuan Peng, Yingshui Tan, Yuanxing Zhang, Zhexu Wang, Weixun Wang, Yancheng He, Ken Deng, Wangchunshu Zhou, Wenhao Huang, Zhaoxiang ZhangFeb 23, 2025243

Stable-SPAM:如何在4比特精度下比16比特Adam更稳定地进行训练
Stable-SPAM: How to Train in 4-Bit More Stably than 16-Bit Adam

Tianjin Huang, Haotian Hu, Zhenyu Zhang, Gaojie Jin, Xiang Li, Li Shen, Tianlong Chen, Lu Liu, Qingsong Wen, Zhangyang Wang, Shiwei LiuFeb 24, 2025162

多模态不一致性推理(MMIR):多模态推理模型的新基准
Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models

Qianqi Yan, Yue Fan, Hongquan Li, Shan Jiang, Yang Zhao, Xinze Guan, Ching-Chen Kuo, Xin Eric WangFeb 22, 2025162

超越发布:生成式AI系统的访问考量
Beyond Release: Access Considerations for Generative AI Systems

Irene Solaiman, Rishi Bommasani, Dan Hendrycks, Ariel Herbert-Voss, Yacine Jernite, Aviya Skowron, Andrew TraskFeb 23, 2025122

X-Dancer:从富有表现力的音乐到人类舞蹈视频的生成
X-Dancer: Expressive Music to Human Dance Video Generation

Zeyuan Chen, Hongyi Xu, Guoxian Song, You Xie, Chenxu Zhang, Xin Chen, Chao Wang, Di Chang, Linjie LuoFeb 24, 2025113

基于场景的说服性语言生成在自动化营销中的应用
Grounded Persuasive Language Generation for Automated Marketing

Jibang Wu, Chenghao Yang, Simon Mahns, Chaoqi Wang, Hao Zhu, Fei Fang, Haifeng XuFeb 24, 2025103

TAG:一种去中心化的多智能体分层强化学习框架
TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning

Giuseppe Paolo, Abdelhakim Benechehab, Hamza Cherkaoui, Albert Thomas, Balázs KéglFeb 21, 202582

跨朝代时序推理与对齐能力基准测试
Benchmarking Temporal Reasoning and Alignment Across Chinese Dynasties

Zhenglin Wang, Jialong Wu, Pengfei LI, Yong Jiang, Deyu ZhouFeb 24, 202574

归纳基准:大语言模型在最简单复杂度类别中的失败
InductionBench: LLMs Fail in the Simplest Complexity Class

Wenyue Hua, Tyler Wong, Sun Fei, Liangming Pan, Adam Jardine, William Yang WangFeb 20, 202562

探究量化方法对大型语言模型安全性与可靠性的影响
Investigating the Impact of Quantization Methods on the Safety and Reliability of Large Language Models

Artyom Kharinaev, Viktor Moskvoretskii, Egor Shvetsov, Kseniia Studenikina, Bykov Mikhail, Evgeny BurnaevFeb 18, 202562

Pandora3D:一个面向高质量三维形状与纹理生成的综合框架
Pandora3D: A Comprehensive Framework for High-Quality 3D Shape and Texture Generation

Jiayu Yang, Taizhang Shang, Weixuan Sun, Xibin Song, Ziang Cheng, Senbo Wang, Shenzhou Chen, Weizhe Liu, Hongdong Li, Pan JiFeb 20, 202552

社区笔记能否取代专业事实核查员?
Can Community Notes Replace Professional Fact-Checkers?

Nadav Borenstein, Greta Warren, Desmond Elliott, Isabelle AugensteinFeb 19, 202552

MutaGReP:基于代码库的无执行计划搜索
MutaGReP: Execution-Free Repository-Grounded Plan Search for Code-Use

Zaid Khan, Ali Farhadi, Ranjay Krishna, Luca Weihs, Mohit Bansal, Tanmay GuptaFeb 21, 202542

警惕差距!大型音频模型的静态与交互式评估
Mind the Gap! Static and Interactive Evaluations of Large Audio Models

Minzhi Li, William Barr Held, Michael J Ryan, Kunat Pipatanakul, Potsawee Manakul, Hao Zhu, Diyi YangFeb 21, 202532

早期退出与即时置信度翻译质量评估
Early-Exit and Instant Confidence Translation Quality Estimation

Vilém Zouhar, Maike Züfle, Beni Egressy, Julius Cheng, Jan NiehuesFeb 20, 202532

自学习长上下文理解智能体
Self-Taught Agentic Long Context Understanding

Yufan Zhuang, Xiaodong Yu, Jialian Wu, Ximeng Sun, Ze Wang, Jiang Liu, Yusheng Su, Jingbo Shang, Zicheng Liu, Emad BarsoumFeb 21, 202522

MONSTER:莫纳什可扩展时间序列评估库
MONSTER: Monash Scalable Time Series Evaluation Repository

Angus Dempster, Navid Mohammadi Foumani, Chang Wei Tan, Lynn Miller, Amish Mishra, Mahsa Salehi, Charlotte Pelletier, Daniel F. Schmidt, Geoffrey I. WebbFeb 21, 202522

MegaLoc:一检索定全局
MegaLoc: One Retrieval to Place Them All

Gabriele Berton, Carlo MasoneFeb 24, 202512

布朗球体中的蛇
The snake in the Brownian sphere

Omer Angel, Emmanuel Jacob, Brett Kolesnik, Grégory MiermontFeb 18, 202512