ChatPaper.ai
打开菜单
首页
每日论文
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
January 7th, 2025
STAR:利用文本到视频模型进行空间-时间增强的真实世界视频超分辨率
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution
Rui Xie, Yinhong Liu, Penghao Zhou, Chen Zhao, Jun Zhou, Kai Zhang, Zhenyu Zhang, Jian Yang, Zhenheng Yang, Ying Tai
•
Jan 6, 2025
•
51
3
测试时间计算:从系统一思维到系统二思维
Test-time Computing: from System-1 Thinking to System-2 Thinking
Yixin Ji, Juntao Li, Hai Ye, Kaixin Wu, Jia Xu, Linjian Mo, Min Zhang
•
Jan 5, 2025
•
40
2
BoostStep:通过改进的单步推理提升大型语言模型的数学能力
BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning
Beichen Zhang, Yuhong Liu, Xiaoyi Dong, Yuhang Zang, Pan Zhang, Haodong Duan, Yuhang Cao, Dahua Lin, Jiaqi Wang
•
Jan 6, 2025
•
35
2
Dispider:通过解耦感知、决策和反应,实现视频LLMs的主动实时交互
Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction
Rui Qian, Shuangrui Ding, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Dahua Lin, Jiaqi Wang
•
Jan 6, 2025
•
33
3
大型语言模型的个性化基于图的检索
Personalized Graph-Based Retrieval for Large Language Models
Steven Au, Cameron J. Dimacali, Ojasmitha Pedirappagari, Namyong Park, Franck Dernoncourt, Yu Wang, Nikos Kanakaris, Hanieh Deilamsalehy, Ryan A. Rossi, Nesreen K. Ahmed
•
Jan 4, 2025
•
28
2
浮点量化训练的缩放定律
Scaling Laws for Floating Point Quantization Training
Xingwu Sun, Shuaipeng Li, Ruobing Xie, Weidong Han, Kan Wu, Zhen Yang, Yixing Li, An Wang, Shuai Li, Jinbao Xue, Yu Cheng, Yangyu Tao, Zhanhui Kang, Chengzhong Xu, Di Wang, Jie Jiang
•
Jan 5, 2025
•
25
2
TransPixar:通过透明度推动文本到视频生成的进展
TransPixar: Advancing Text-to-Video Generation with Transparency
Luozhou Wang, Yijun Li, Zhifei Chen, Jui-Hsien Wang, Zhifei Zhang, He Zhang, Zhe Lin, Yingcong Chen
•
Jan 6, 2025
•
22
4
METAGENE-1:用于流行病监测的宏基因组基础模型
METAGENE-1: Metagenomic Foundation Model for Pandemic Monitoring
Ollie Liu, Sami Jaghouar, Johannes Hagemann, Shangshang Wang, Jason Wiemels, Jeff Kaufman, Willie Neiswanger
•
Jan 3, 2025
•
21
2
穿越口罩:基于口罩的运动轨迹用于图像到视频的生成
Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation
Guy Yariv, Yuval Kirstain, Amit Zohar, Shelly Sheynin, Yaniv Taigman, Yossi Adi, Sagie Benaim, Adam Polyak
•
Jan 6, 2025
•
19
2
GS-DiT:通过高效密集的3D点跟踪推动伪4D高斯场的视频生成。
GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking
Weikang Bian, Zhaoyang Huang, Xiaoyu Shi, Yijin Li, Fu-Yun Wang, Hongsheng Li
•
Jan 5, 2025
•
17
2
自动RT:针对大型语言模型的红队行动自动越狱策略探索
Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models
Yanjiang Liu, Shuhen Zhou, Yaojie Lu, Huijia Zhu, Weiqiang Wang, Hongyu Lin, Ben He, Xianpei Han, Le Sun
•
Jan 3, 2025
•
17
2
DepthMaster:驯服单目深度估计的扩散模型
DepthMaster: Taming Diffusion Models for Monocular Depth Estimation
Ziyang Song, Zerong Wang, Bo Li, Hao Zhang, Ruijie Zhu, Li Liu, Peng-Tao Jiang, Tianzhu Zhang
•
Jan 5, 2025
•
15
4
PRMBench:一个针对过程级奖励模型的细粒度且具有挑战性的基准测试。
PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models
Mingyang Song, Zhaochen Su, Xiaoye Qu, Jiawei Zhou, Yu Cheng
•
Jan 6, 2025
•
14
2
ToolHop:用于评估多跳工具使用中大型语言模型的查询驱动基准。
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use
Junjie Ye, Zhengyin Du, Xuesong Yao, Weijian Lin, Yufei Xu, Zehui Chen, Zaiyuan Wang, Sining Zhu, Zhiheng Xi, Siyu Yuan, Tao Gui, Qi Zhang, Xuanjing Huang, Jiechao Chen
•
Jan 5, 2025
•
10
3
自动呈现:从零开始设计结构化视觉。
AutoPresent: Designing Structured Visuals from Scratch
Jiaxin Ge, Zora Zhiruo Wang, Xuhui Zhou, Yi-Hao Peng, Sanjay Subramanian, Qinyue Tan, Maarten Sap, Alane Suhr, Daniel Fried, Graham Neubig, Trevor Darrell
•
Jan 1, 2025
•
8
2
Samba-asr是一种利用结构化状态空间模型的最先进语音识别技术。
Samba-asr state-of-the-art speech recognition leveraging structured state-space models
Syed Abdul Gaffar Shakhadri, Kruthika KR, Kartik Basavaraj Angadi
•
Jan 6, 2025
•
8
3
内容:将自定义照片与视频扩散变换器混合
Ingredients: Blending Custom Photos with Video Diffusion Transformers
Zhengcong Fei, Debang Li, Di Qiu, Changqian Yu, Mingyuan Fan
•
Jan 3, 2025
•
8
2
视觉语言模型评估中具有挑战性的多项选择题的自动生成
Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation
Yuhui Zhang, Yuchang Su, Yiming Liu, Xiaohan Wang, James Burgess, Elaine Sui, Chenyu Wang, Josiah Aklilu, Alejandro Lozano, Anjiang Wei, Ludwig Schmidt, Serena Yeung-Levy
•
Jan 6, 2025
•
7
2
ProTracker:用于稳健和准确的点跟踪的概率集成
ProTracker: Probabilistic Integration for Robust and Accurate Point Tracking
Tingyang Zhang, Chen Wang, Zhiyang Dou, Qingzhe Gao, Jiahui Lei, Baoquan Chen, Lingjie Liu
•
Jan 6, 2025
•
4
2