ChatPaper.ai
打开菜单
首页
每日论文
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
February 11th, 2025
1B LLM能否超越405B LLM?重新思考计算优化的测试时间扩展
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling
Runze Liu, Junqi Gao, Jian Zhao, Kaiyan Zhang, Xiu Li, Biqing Qi, Wanli Ouyang, Bowen Zhou
•
Feb 10, 2025
•
142
6
SynthDetoxM:现代LLMs是少样本并行解毒数据注释者
SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators
Daniil Moskovskiy, Nikita Sushko, Sergey Pletenev, Elena Tutubalina, Alexander Panchenko
•
Feb 10, 2025
•
86
2
探索数学推理学习中结果奖励的极限
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning
Chengqi Lyu, Songyang Gao, Yuzhe Gu, Wenwei Zhang, Jianfei Gao, Kuikun Liu, Ziyi Wang, Shuaibin Li, Qian Zhao, Haian Huang, Weihan Cao, Jiangning Liu, Hongwei Liu, Junnan Liu, Songyang Zhang, Dahua Lin, Kai Chen
•
Feb 10, 2025
•
60
6
大型语言模型中的深度之咒
The Curse of Depth in Large Language Models
Wenfang Sun, Xinyuan Song, Pengxiang Li, Lu Yin, Yefeng Zheng, Shiwei Liu
•
Feb 9, 2025
•
37
5
使用多智能体强化学习训练社交推理语言模型。
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
Bidipta Sarkar, Warren Xia, C. Karen Liu, Dorsa Sadigh
•
Feb 9, 2025
•
34
3
LM2:大内存模型
LM2: Large Memory Models
Jikun Kang, Wenqi Wu, Filippos Christianos, Alex J. Chan, Fraser Greenlee, George Thomas, Marvin Purtorab, Andy Toulis
•
Feb 9, 2025
•
30
7
母婴量化
Matryoshka Quantization
Pranav Nair, Puranjay Datta, Jeff Dean, Prateek Jain, Aditya Kusupati
•
Feb 10, 2025
•
29
4
CODESIM:通过模拟驱动的规划和调试进行多智能体代码生成和问题解决
CODESIM: Multi-Agent Code Generation and Problem Solving through Simulation-Driven Planning and Debugging
Md. Ashraful Islam, Mohammed Eunus Ali, Md Rizwan Parvez
•
Feb 8, 2025
•
23
3
Show-o Turbo: 朝着加速统一多模态理解与生成的方向前进
Show-o Turbo: Towards Accelerated Unified Multimodal Understanding and Generation
Chenkai Xu, Xu Wang, Zhenyi Liao, Yishun Li, Tianqi Hou, Zhijie Deng
•
Feb 8, 2025
•
22
2
ReasonFlux:通过扩展思维模板进行分层LLM推理
ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates
Ling Yang, Zhaochen Yu, Bin Cui, Mengdi Wang
•
Feb 10, 2025
•
21
3
基于推测解码中的时间局部性的分层起草,实现大型语言模型的无损加速。
Lossless Acceleration of Large Language Models with Hierarchical Drafting based on Temporal Locality in Speculative Decoding
Sukmin Cho, Sangjin Choi, Taeho Hwang, Jeongyeon Seo, Soyeong Jeong, Huije Lee, Hoyun Song, Jong C. Park, Youngjin Kwon
•
Feb 8, 2025
•
18
3
MetaChain:一种用于LLM代理的全自动零代码框架
MetaChain: A Fully-Automated and Zero-Code Framework for LLM Agents
Jiabin Tang, Tianyu Fan, Chao Huang
•
Feb 9, 2025
•
16
2
Lumina-Video:使用多尺度 Next-DiT 实现高效灵活的视频生成
Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT
Dongyang Liu, Shicheng Li, Yutong Liu, Zhen Li, Kai Wang, Xinyue Li, Qi Qin, Yufei Liu, Yi Xin, Zhongyu Li, Bin Fu, Chenyang Si, Yuewen Cao, Conghui He, Ziwei Liu, Yu Qiao, Qibin Hou, Hongsheng Li, Peng Gao
•
Feb 10, 2025
•
13
2
EVEv2:改进的无编码器视觉-语言模型基线
EVEv2: Improved Baselines for Encoder-Free Vision-Language Models
Haiwen Diao, Xiaotong Li, Yufeng Cui, Yueze Wang, Haoge Deng, Ting Pan, Wenxuan Wang, Huchuan Lu, Xinlong Wang
•
Feb 10, 2025
•
12
2
历史引导的视频传播
History-Guided Video Diffusion
Kiwhan Song, Boyuan Chen, Max Simchowitz, Yilun Du, Russ Tedrake, Vincent Sitzmann
•
Feb 10, 2025
•
12
2
令牌的隐秘生活:通过视觉信息引导减少大型视觉-语言模型的幻觉
The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Zhuowei Li, Haizhou Shi, Yunhe Gao, Di Liu, Zhenting Wang, Yuxiao Chen, Ting Liu, Long Zhao, Hao Wang, Dimitris N. Metaxas
•
Feb 5, 2025
•
12
3
CustomVideoX:3D参考注意力驱动的动态适应,用于零样本定制视频扩散变压器。
CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers
D. She, Mushui Liu, Jingxuan Pang, Jin Wang, Zhen Yang, Wanggui He, Guanghao Zhang, Yi Wang, Qihan Huang, Haobin Tang, Yunlong Yu, Siming Fu
•
Feb 10, 2025
•
11
2
高效vDiT: 带有注意力瓦片的高效视频扩散变压器
Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile
Hangliang Ding, Dacheng Li, Runlong Su, Peiyuan Zhang, Zhijie Deng, Ion Stoica, Hao Zhang
•
Feb 10, 2025
•
9
2
扩散模型的双标题偏好优化
Dual Caption Preference Optimization for Diffusion Models
Amir Saeidi, Yiran Luo, Agneet Chatterjee, Shamanthak Hegde, Bimsara Pathiraja, Yezhou Yang, Chitta Baral
•
Feb 9, 2025
•
9
2
DreamDPO:通过直接偏好优化实现文本到3D生成与人类偏好的对齐
DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization
Zhenglin Zhou, Xiaobo Xia, Fan Ma, Hehe Fan, Yi Yang, Tat-Seng Chua
•
Feb 5, 2025
•
7
2
走向互联网规模的智能体训练
Towards Internet-Scale Training For Agents
Brandon Trabucco, Gunnar Sigurdsson, Robinson Piramuthu, Ruslan Salakhutdinov
•
Feb 10, 2025
•
6
2
APE:通过自适应并行编码实现更快速和更长上下文增强生成
APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding
Xinyu Yang, Tianqi Chen, Beidi Chen
•
Feb 8, 2025
•
6
4
Jakiro:通过MoE实现解耦多头增强推理解码
Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE
Haiduo Huang, Fuwei Yang, Zhenhua Liu, Yixing Xu, Jinze Li, Yang Liu, Xuanwu Yin, Dong Li, Pengju Ren, Emad Barsoum
•
Feb 10, 2025
•
5
2
Steel-LLM:从零开始到开源 —— 在构建一个以中文为中心的LLM中的个人经历
Steel-LLM:From Scratch to Open Source -- A Personal Journey in Building a Chinese-Centric LLM
Qingshui Gu, Shu Li, Tianyu Zheng, Zhaoxiang Zhang
•
Feb 10, 2025
•
4
2
禁忌科学:双重用途人工智能挑战基准和科学拒绝测试
Forbidden Science: Dual-Use AI Challenge Benchmark and Scientific Refusal Tests
David Noever, Forrest McKee
•
Feb 8, 2025
•
1
2
审计机器人基础模型的具身式红队行动
Embodied Red Teaming for Auditing Robotic Foundation Models
Sathwik Karnik, Zhang-Wei Hong, Nishant Abhangi, Yen-Chen Lin, Tsun-Hsuan Wang, Christophe Dupuy, Rahul Gupta, Pulkit Agrawal
•
Nov 27, 2024
•
1
2