ChatPaper.ai
打开菜单
首页
每日论文
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
January 14th, 2025
在数学推理中开发过程奖励模型的教训
The Lessons of Developing Process Reward Models in Mathematical Reasoning
Zhenru Zhang, Chujie Zheng, Yangzhen Wu, Beichen Zhang, Runji Lin, Bowen Yu, Dayiheng Liu, Jingren Zhou, Junyang Lin
•
Jan 13, 2025
•
77
8
张量积注意力就是你所需要的
Tensor Product Attention Is All You Need
Yifan Zhang, Yifeng Liu, Huizhuo Yuan, Zhen Qin, Yang Yuan, Quanquan Gu, Andrew Chi-Chih Yao
•
Jan 11, 2025
•
66
4
Transformer^2:自适应语言模型
Transformer^2: Self-adaptive LLMs
Qi Sun, Edoardo Cetin, Yujin Tang
•
Jan 9, 2025
•
46
6
BIOMEDICA:一个源自科学文献的开放生物医学图像描述存档、数据集和视觉-语言模型
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature
Alejandro Lozano, Min Woo Sun, James Burgess, Liangyu Chen, Jeffrey J Nirschl, Jeffrey Gu, Ivan Lopez, Josiah Aklilu, Austin Wolfgang Katzer, Collin Chiu, Anita Rau, Xiaohan Wang, Yuhui Zhang, Alfred Seunghoon Song, Robert Tibshirani, Serena Yeung-Levy
•
Jan 13, 2025
•
45
2
MinMo:一个用于无缝语音交互的多模态大型语言模型
MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Qian Chen, Yafeng Chen, Yanni Chen, Mengzhe Chen, Yingda Chen, Chong Deng, Zhihao Du, Ruize Gao, Changfeng Gao, Zhifu Gao, Yabin Li, Xiang Lv, Jiaqing Liu, Haoneng Luo, Bin Ma, Chongjia Ni, Xian Shi, Jialong Tang, Hui Wang, Hao Wang, Wen Wang, Yuxuan Wang, Yunlan Xu, Fan Yu, Zhijie Yan, Yexin Yang, Baosong Yang, Xian Yang, Guanrou Yang, Tianyu Zhao, Qinglin Zhang, Shiliang Zhang, Nan Zhao, Pei Zhang, Chong Zhang, Jinren Zhou
•
Jan 10, 2025
•
32
5
视频作者:迈向长篇故事视频生成
VideoAuteur: Towards Long Narrative Video Generation
Junfei Xiao, Feng Cheng, Lu Qi, Liangke Gui, Jiepeng Cen, Zhibei Ma, Alan Yuille, Lu Jiang
•
Jan 10, 2025
•
30
3
O1 复制之旅 -- 第3部分:用于医学推理的推理时间缩放
O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning
Zhongzhen Huang, Gui Geng, Shengyi Hua, Zhen Huang, Haoyang Zou, Shaoting Zhang, Pengfei Liu, Xiaofan Zhang
•
Jan 11, 2025
•
29
2
WebWalker:在网络遍历中对LLM进行基准测试
WebWalker: Benchmarking LLMs in Web Traversal
Jialong Wu, Wenbiao Yin, Yong Jiang, Zhenglin Wang, Zekun Xi, Runnan Fang, Deyu Zhou, Pengjun Xie, Fei Huang
•
Jan 13, 2025
•
18
3
SPAM:具有动量重置的Spike-Aware Adam,用于稳定的LLM训练
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training
Tianjin Huang, Ziquan Zhu, Gaojie Jin, Lu Liu, Zhangyang Wang, Shiwei Liu
•
Jan 12, 2025
•
14
2
3D中的非常见物体
UnCommon Objects in 3D
Xingchen Liu, Piyush Tayal, Jianyuan Wang, Jesus Zarzar, Tom Monnier, Konstantinos Tertikas, Jiali Duan, Antoine Toisoul, Jason Y. Zhang, Natalia Neverova, Andrea Vedaldi, Roman Shapovalov, David Novotny
•
Jan 13, 2025
•
12
2
ChemAgent:大型语言模型中的自更新库改进化学推理
ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning
Xiangru Tang, Tianyu Hu, Muyang Ye, Yanjun Shao, Xunjian Yin, Siru Ouyang, Wangchunshu Zhou, Pan Lu, Zhuosheng Zhang, Yilun Zhao, Arman Cohan, Mark Gerstein
•
Jan 11, 2025
•
7
2
通过模拟模型权重来评估数据选择的样本效用
Evaluating Sample Utility for Data Selection by Mimicking Model Weights
Tzu-Heng Huang, Manjot Bilkhu, Frederic Sala, Javier Movellan
•
Jan 12, 2025
•
5
2