ChatPaper.ai
打開菜單
首頁
每日論文
arXiv
HuggingFace
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
April 16th, 2025
xVerify:推理模型評估的高效答案驗證器
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations
Ding Chen, Qingchen Yu, Pengyuan Wang, Wentao Zhang, Bo Tang, Feiyu Xiong, Xinchi Li, Minchuan Yang, Zhiyu Li
•
Apr 14, 2025
•
77
2
Genius:一個通用且純無監督的自訓練框架 用於高級推理
Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning
Fangzhi Xu, Hang Yan, Chang Ma, Haiteng Zhao, Qiushi Sun, Kanzhi Cheng, Junxian He, Jun Liu, Zhiyong Wu
•
Apr 11, 2025
•
51
2
指令與推理數據如何塑造後訓練:從層級梯度視角看數據質量
How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients
Ming Li, Yanhong Li, Ziyue Li, Tianyi Zhou
•
Apr 14, 2025
•
36
2
Seedream 3.0 技術報告
Seedream 3.0 Technical Report
Yu Gao, Lixue Gong, Qiushan Guo, Xiaoxia Hou, Zhichao Lai, Fanshi Li, Liang Li, Xiaochen Lian, Chao Liao, Liyang Liu, Wei Liu, Yichun Shi, Shiqi Sun, Yu Tian, Zhi Tian, Peng Wang, Rui Wang, Xuanda Wang, Xun Wang, Ye Wang, Guofeng Wu, Jie Wu, Xin Xia, Xuefeng Xiao, Zhonghua Zhai, Xinyu Zhang, Qi Zhang, Yuwei Zhang, Shijia Zhao, Jianchao Yang, Weilin Huang
•
Apr 15, 2025
•
35
3
Heimdall:生成式验证中的测试时缩放
Heimdall: test-time scaling on the generative verification
Wenlei Shi, Xing Jin
•
Apr 14, 2025
•
29
2
Pixel-SAIL:基於像素理解的單一Transformer模型
Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding
Tao Zhang, Xiangtai Li, Zilong Huang, Yanwei Li, Weixian Lei, Xueqing Deng, Shihao Chen, Shunping Ji, Jiashi Feng
•
Apr 14, 2025
•
26
3
文本競技場
TextArena
Leon Guertler, Bobby Cheng, Simon Yu, Bo Liu, Leshem Choshen, Cheston Tan
•
Apr 15, 2025
•
24
3
高效推理模型:綜述
Efficient Reasoning Models: A Survey
Sicheng Feng, Gongfan Fang, Xinyin Ma, Xinchao Wang
•
Apr 15, 2025
•
16
4
NormalCrafter:從影片中學習時間一致的法線 擴散先驗
NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors
Yanrui Bin, Wenbo Hu, Haoyuan Wang, Xinya Chen, Bing Wang
•
Apr 15, 2025
•
14
2
DataDecide:如何通過小型實驗預測最佳預訓練數據
DataDecide: How to Predict Best Pretraining Data with Small Experiments
Ian Magnusson, Nguyen Tai, Ben Bogin, David Heineman, Jena D. Hwang, Luca Soldaini, Akshita Bhagia, Jiacheng Liu, Dirk Groeneveld, Oyvind Tafjord, Noah A. Smith, Pang Wei Koh, Jesse Dodge
•
Apr 15, 2025
•
13
2
簡約之可擴展性:基於單一Transformer的視覺-語言學習實證分析
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer
Weixian Lei, Jiacong Wang, Haochen Wang, Xiangtai Li, Jun Hao Liew, Jiashi Feng, Zilong Huang
•
Apr 14, 2025
•
13
3
透過主動學習實現高效的過程獎勵模型訓練
Efficient Process Reward Model Training via Active Learning
Keyu Duan, Zichen Liu, Xin Mao, Tianyu Pang, Changyu Chen, Qiguang Chen, Michael Qizhe Shieh, Longxu Dou
•
Apr 14, 2025
•
11
2
DeepMath-103K:一個大規模、具挑戰性、去污染且可驗證的數學數據集,用於推進推理能力
DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
Zhiwei He, Tian Liang, Jiahao Xu, Qiuzhi Liu, Xingyu Chen, Yue Wang, Linfeng Song, Dian Yu, Zhenwen Liang, Wenxuan Wang, Zhuosheng Zhang, Rui Wang, Zhaopeng Tu, Haitao Mi, Dong Yu
•
Apr 15, 2025
•
10
6
SimpleAR:透過預訓練、監督微調與強化學習推進自回歸視覺生成的前沿
SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL
Junke Wang, Zhi Tian, Xun Wang, Xinyu Zhang, Weilin Huang, Zuxuan Wu, Yu-Gang Jiang
•
Apr 15, 2025
•
10
1
RealHarm:真實世界語言模型應用失敗案例集
RealHarm: A Collection of Real-World Language Model Application Failures
Pierre Le Jeune, Jiaen Liu, Luca Rossi, Matteo Dora
•
Apr 14, 2025
•
10
3
透過嵌入表示預熱實現高效生成模型訓練
Efficient Generative Model Training via Embedded Representation Warmup
Deyuan Liu, Peng Sun, Xufeng Li, Tao Lin
•
Apr 14, 2025
•
10
2
D^2iT:動態擴散變壓器用於精確圖像生成
D^2iT: Dynamic Diffusion Transformer for Accurate Image Generation
Weinan Jia, Mengqi Huang, Nan Chen, Lei Zhang, Zhendong Mao
•
Apr 13, 2025
•
10
2
通過群組感知SSM剪枝實現高效混合語言模型壓縮
Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning
Ali Taghibakhshi, Sharath Turuvekere Sreenivas, Saurav Muralidharan, Marcin Chochowski, Yashaswi Karnati, Raviraj Joshi, Ameya Sunil Mahabaleshwarkar, Zijia Chen, Yoshi Suhara, Oluwatobi Olabiyi, Daniel Korzekwa, Mostofa Patwary, Mohammad Shoeybi, Jan Kautz, Bryan Catanzaro, Ashwath Aithal, Nima Tajbakhsh, Pavlo Molchanov
•
Apr 15, 2025
•
9
2
從拒絕抽樣到強化學習:大型語言模型推理的極簡主義方法
A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce
Wei Xiong, Jiarui Yao, Yuhui Xu, Bo Pang, Lei Wang, Doyen Sahoo, Junnan Li, Nan Jiang, Tong Zhang, Caiming Xiong, Hanze Dong
•
Apr 15, 2025
•
9
3
ReZero:通過再試一次來提升大型語言模型的搜索能力
ReZero: Enhancing LLM search ability by trying one-more-time
Alan Dao, Thinh Le
•
Apr 15, 2025
•
9
2
視覺謎題:將多模態推理評估與領域知識解耦
VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge
Yueqi Song, Tianyue Ou, Yibo Kong, Zecheng Li, Graham Neubig, Xiang Yue
•
Apr 14, 2025
•
9
2
AI-大學:一個基於大型語言模型的教學對齊平台,專為科學課堂設計
AI-University: An LLM-based platform for instructional alignment to scientific classrooms
Mostafa Faghih Shojaei, Rahul Gulati, Benjamin A. Jasperson, Shangshang Wang, Simone Cimolato, Dangli Cao, Willie Neiswanger, Krishna Garikipati
•
Apr 11, 2025
•
6
2
擴散蒸餾與直接偏好優化於高效3D LiDAR場景補全
Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion
An Zhaol, Shengyuan Zhang, Ling Yang, Zejian Li, Jiale Wu, Haoran Xu, AnYang Wei, Perry Pengyun GU Lingyun Sun
•
Apr 15, 2025
•
4
2
PVUW 2025 挑戰賽報告:複雜野外場景中像素級視頻理解的新進展
PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
Henghui Ding, Chang Liu, Nikhila Ravi, Shuting He, Yunchao Wei, Song Bai, Philip Torr, Kehuan Song, Xinglin Xie, Kexin Zhang, Licheng Jiao, Lingling Li, Shuyuan Yang, Xuqiang Cao, Linnan Zhao, Jiaxuan Zhao, Fang Liu, Mengjiao Wang, Junpei Zhang, Xu Liu, Yuting Yang, Mengru Ma, Hao Fang, Runmin Cong, Xiankai Lu, Zhiyang Che, Wei Zhan, Tianming Liang, Haichao Jiang, Wei-Shi Zheng, Jian-Fang Hu, Haobo Yuan, Xiangtai Li, Tao Zhang, Lu Qi, Ming-Hsuan Yang
•
Apr 15, 2025
•
4
2
基於時序動態上下文的多模態長視頻建模
Multimodal Long Video Modeling Based on Temporal Dynamic Context
Haoran Hao, Jiaming Han, Yiyuan Zhang, Xiangyu Yue
•
Apr 14, 2025
•
3
2
自適應計算剪枝於遺忘變換器
Adaptive Computation Pruning for the Forgetting Transformer
Zhixuan Lin, Johan Obando-Ceron, Xu Owen He, Aaron Courville
•
Apr 9, 2025
•
3
2
LazyReview:一個揭露NLP同行評審中懶惰思維的數據集
LazyReview A Dataset for Uncovering Lazy Thinking in NLP Peer Reviews
Sukannya Purkayastha, Zhuang Li, Anne Lauscher, Lizhen Qu, Iryna Gurevych
•
Apr 15, 2025
•
2
2
多模态演示摘要与视觉-语言模型: 模态与结构影响的研究
Summarization of Multimodal Presentations with Vision-Language Models: Study of the Effect of Modalities and Structure
Théo Gigant, Camille Guinaudeau, Frédéric Dufaux
•
Apr 14, 2025
•
2
2
將生成式去噪與判別式目標對齊,釋放擴散模型在視覺感知中的潛力
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Ziqi Pang, Xin Xu, Yu-Xiong Wang
•
Apr 15, 2025
•
1
2
用於遙感變化檢測的狀態空間模型變更
Change State Space Models for Remote Sensing Change Detection
Elman Ghazaei, Erchan Aptoula
•
Apr 15, 2025
•
0
2