ChatPaper.ai
打开菜单
首页
每日论文
arXiv
HuggingFace
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
April 16th, 2025
xVerify:推理模型评估中的高效答案验证器
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations
Ding Chen, Qingchen Yu, Pengyuan Wang, Wentao Zhang, Bo Tang, Feiyu Xiong, Xinchi Li, Minchuan Yang, Zhiyu Li
•
Apr 14, 2025
•
84
2
天才:一种通用且纯无监督的自训练框架 面向高级推理
Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning
Fangzhi Xu, Hang Yan, Chang Ma, Haiteng Zhao, Qiushi Sun, Kanzhi Cheng, Junxian He, Jun Liu, Zhiyong Wu
•
Apr 11, 2025
•
53
2
Seedream 3.0 技术报告
Seedream 3.0 Technical Report
Yu Gao, Lixue Gong, Qiushan Guo, Xiaoxia Hou, Zhichao Lai, Fanshi Li, Liang Li, Xiaochen Lian, Chao Liao, Liyang Liu, Wei Liu, Yichun Shi, Shiqi Sun, Yu Tian, Zhi Tian, Peng Wang, Rui Wang, Xuanda Wang, Xun Wang, Ye Wang, Guofeng Wu, Jie Wu, Xin Xia, Xuefeng Xiao, Zhonghua Zhai, Xinyu Zhang, Qi Zhang, Yuwei Zhang, Shijia Zhao, Jianchao Yang, Weilin Huang
•
Apr 15, 2025
•
46
5
指令与推理数据如何塑造后训练:从层级梯度视角看数据质量
How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients
Ming Li, Yanhong Li, Ziyue Li, Tianyi Zhou
•
Apr 14, 2025
•
39
2
Heimdall:生成式验证中的测试时缩放
Heimdall: test-time scaling on the generative verification
Wenlei Shi, Xing Jin
•
Apr 14, 2025
•
32
2
Pixel-SAIL:面向像素级理解的单一Transformer模型
Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding
Tao Zhang, Xiangtai Li, Zilong Huang, Yanwei Li, Weixian Lei, Xueqing Deng, Shihao Chen, Shunping Ji, Jiashi Feng
•
Apr 14, 2025
•
28
3
文本竞技场
TextArena
Leon Guertler, Bobby Cheng, Simon Yu, Bo Liu, Leshem Choshen, Cheston Tan
•
Apr 15, 2025
•
27
3
高效推理模型:综述
Efficient Reasoning Models: A Survey
Sicheng Feng, Gongfan Fang, Xinyin Ma, Xinchao Wang
•
Apr 15, 2025
•
18
4
NormalCrafter:基于扩散先验从视频中学习时序一致的法线
NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors
Yanrui Bin, Wenbo Hu, Haoyuan Wang, Xinya Chen, Bing Wang
•
Apr 15, 2025
•
17
2
DataDecide:如何通过小型实验预测最佳预训练数据
DataDecide: How to Predict Best Pretraining Data with Small Experiments
Ian Magnusson, Nguyen Tai, Ben Bogin, David Heineman, Jena D. Hwang, Luca Soldaini, Akshita Bhagia, Jiacheng Liu, Dirk Groeneveld, Oyvind Tafjord, Noah A. Smith, Pang Wei Koh, Jesse Dodge
•
Apr 15, 2025
•
16
2
简约的可扩展性:基于单一Transformer的视觉-语言学习实证分析
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer
Weixian Lei, Jiacong Wang, Haochen Wang, Xiangtai Li, Jun Hao Liew, Jiashi Feng, Zilong Huang
•
Apr 14, 2025
•
15
3
极简主义视角下的LLM推理:从拒绝采样到强化学习
A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce
Wei Xiong, Jiarui Yao, Yuhui Xu, Bo Pang, Lei Wang, Doyen Sahoo, Junnan Li, Nan Jiang, Tong Zhang, Caiming Xiong, Hanze Dong
•
Apr 15, 2025
•
14
4
ReZero:通过“再试一次”提升大语言模型的搜索能力
ReZero: Enhancing LLM search ability by trying one-more-time
Alan Dao, Thinh Le
•
Apr 15, 2025
•
14
2
通过主动学习实现高效过程奖励模型训练
Efficient Process Reward Model Training via Active Learning
Keyu Duan, Zichen Liu, Xin Mao, Tianyu Pang, Changyu Chen, Qiguang Chen, Michael Qizhe Shieh, Longxu Dou
•
Apr 14, 2025
•
13
2
SimpleAR:通过预训练、监督微调与强化学习推进自回归视觉生成的前沿
SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL
Junke Wang, Zhi Tian, Xun Wang, Xinyu Zhang, Weilin Huang, Zuxuan Wu, Yu-Gang Jiang
•
Apr 15, 2025
•
12
1
通过嵌入表示预热实现高效生成模型训练
Efficient Generative Model Training via Embedded Representation Warmup
Deyuan Liu, Peng Sun, Xufeng Li, Tao Lin
•
Apr 14, 2025
•
12
2
D^2iT:动态扩散Transformer,实现精准图像生成
D^2iT: Dynamic Diffusion Transformer for Accurate Image Generation
Weinan Jia, Mengqi Huang, Nan Chen, Lei Zhang, Zhendong Mao
•
Apr 13, 2025
•
12
2
DeepMath-103K:一个大规模、具挑战性、去污染且可验证的数学数据集,用于推进推理研究
DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
Zhiwei He, Tian Liang, Jiahao Xu, Qiuzhi Liu, Xingyu Chen, Yue Wang, Linfeng Song, Dian Yu, Zhenwen Liang, Wenxuan Wang, Zhuosheng Zhang, Rui Wang, Zhaopeng Tu, Haitao Mi, Dong Yu
•
Apr 15, 2025
•
11
6
RealHarm:真实世界语言模型应用失败案例集
RealHarm: A Collection of Real-World Language Model Application Failures
Pierre Le Jeune, Jiaen Liu, Luca Rossi, Matteo Dora
•
Apr 14, 2025
•
11
3
通过分组感知SSM剪枝实现高效混合语言模型压缩
Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning
Ali Taghibakhshi, Sharath Turuvekere Sreenivas, Saurav Muralidharan, Marcin Chochowski, Yashaswi Karnati, Raviraj Joshi, Ameya Sunil Mahabaleshwarkar, Zijia Chen, Yoshi Suhara, Oluwatobi Olabiyi, Daniel Korzekwa, Mostofa Patwary, Mohammad Shoeybi, Jan Kautz, Bryan Catanzaro, Ashwath Aithal, Nima Tajbakhsh, Pavlo Molchanov
•
Apr 15, 2025
•
10
2
视觉谜题:将多模态推理评估与领域知识解耦
VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge
Yueqi Song, Tianyue Ou, Yibo Kong, Zecheng Li, Graham Neubig, Xiang Yue
•
Apr 14, 2025
•
10
2
AI-大学:一个基于大语言模型的平台,旨在实现科学课堂的教学一致性
AI-University: An LLM-based platform for instructional alignment to scientific classrooms
Mostafa Faghih Shojaei, Rahul Gulati, Benjamin A. Jasperson, Shangshang Wang, Simone Cimolato, Dangli Cao, Willie Neiswanger, Krishna Garikipati
•
Apr 11, 2025
•
8
2
PVUW 2025挑战赛报告:复杂野外场景视频像素级理解的新进展
PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
Henghui Ding, Chang Liu, Nikhila Ravi, Shuting He, Yunchao Wei, Song Bai, Philip Torr, Kehuan Song, Xinglin Xie, Kexin Zhang, Licheng Jiao, Lingling Li, Shuyuan Yang, Xuqiang Cao, Linnan Zhao, Jiaxuan Zhao, Fang Liu, Mengjiao Wang, Junpei Zhang, Xu Liu, Yuting Yang, Mengru Ma, Hao Fang, Runmin Cong, Xiankai Lu, Zhiyang Che, Wei Zhan, Tianming Liang, Haichao Jiang, Wei-Shi Zheng, Jian-Fang Hu, Haobo Yuan, Xiangtai Li, Tao Zhang, Lu Qi, Ming-Hsuan Yang
•
Apr 15, 2025
•
6
2
扩散蒸馏与直接偏好优化相结合的高效3D LiDAR场景补全
Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion
An Zhaol, Shengyuan Zhang, Ling Yang, Zejian Li, Jiale Wu, Haoran Xu, AnYang Wei, Perry Pengyun GU Lingyun Sun
•
Apr 15, 2025
•
5
2
基于时序动态上下文的多模态长视频建模
Multimodal Long Video Modeling Based on Temporal Dynamic Context
Haoran Hao, Jiaming Han, Yiyuan Zhang, Xiangyu Yue
•
Apr 14, 2025
•
4
2
LazyReview:揭示NLP同行评审中惰性思维的数据集
LazyReview A Dataset for Uncovering Lazy Thinking in NLP Peer Reviews
Sukannya Purkayastha, Zhuang Li, Anne Lauscher, Lizhen Qu, Iryna Gurevych
•
Apr 15, 2025
•
3
2
多模态演示摘要的视觉-语言模型研究: 模态与结构效应分析
Summarization of Multimodal Presentations with Vision-Language Models: Study of the Effect of Modalities and Structure
Théo Gigant, Camille Guinaudeau, Frédéric Dufaux
•
Apr 14, 2025
•
3
2
自适应计算剪枝的遗忘Transformer
Adaptive Computation Pruning for the Forgetting Transformer
Zhixuan Lin, Johan Obando-Ceron, Xu Owen He, Aaron Courville
•
Apr 9, 2025
•
3
2
将生成式去噪与判别式目标对齐,释放扩散模型在视觉感知中的潜力
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Ziqi Pang, Xin Xu, Yu-Xiong Wang
•
Apr 15, 2025
•
2
2
遥感变化检测中的状态空间模型变换
Change State Space Models for Remote Sensing Change Detection
Elman Ghazaei, Erchan Aptoula
•
Apr 15, 2025
•
1
2