ChatPaper.ai
打开菜单
首页
每日论文
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
February 13th, 2025
BenchMAX:用于大型语言模型的全面多语言评估套件
BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models
Xu Huang, Wenhao Zhu, Hanxu Hu, Conghui He, Lei Li, Shujian Huang, Fei Yuan
•
Feb 11, 2025
•
51
2
Fino1:论推理增强LLM在金融领域的可迁移性
Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance
Lingfei Qian, Weipeng Zhou, Yan Wang, Xueqing Peng, Jimin Huang, Qianqian Xie
•
Feb 12, 2025
•
50
5
TransMLA:多头潜在注意力就是你所需要的。
TransMLA: Multi-head Latent Attention Is All You Need
Fanxu Meng, Zengwei Yao, Muhan Zhang
•
Feb 11, 2025
•
47
9
蒸馏扩展定律
Distillation Scaling Laws
Dan Busbridge, Amitis Shidani, Floris Weers, Jason Ramapuram, Etai Littwin, Russ Webb
•
Feb 12, 2025
•
46
4
TextAtlas5M:用于密集文本图像生成的大规模数据集
TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation
Alex Jinpeng Wang, Dongxing Mao, Jiawei Zhang, Weiming Han, Zhuobai Dong, Linjie Li, Yiqi Lin, Zhengyuan Yang, Libo Qin, Fuwei Zhang, Lijuan Wang, Min Li
•
Feb 11, 2025
•
43
2
Light-A-Video:通过渐进光融合实现无需训练的视频照明调整
Light-A-Video: Training-free Video Relighting via Progressive Light Fusion
Yujie Zhou, Jiazi Bu, Pengyang Ling, Pan Zhang, Tong Wu, Qidong Huang, Jinsong Li, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Anyi Rao, Jiaqi Wang, Li Niu
•
Feb 12, 2025
•
41
2
CineMaster:一种用于影视文本到视频生成的3D感知和可控框架
CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation
Qinghe Wang, Yawen Luo, Xiaoyu Shi, Xu Jia, Huchuan Lu, Tianfan Xue, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai
•
Feb 12, 2025
•
37
2
使用连续概念进行LLM预训练
LLM Pretraining with Continuous Concepts
Jihoon Tack, Jack Lanchantin, Jane Yu, Andrew Cohen, Ilia Kulikov, Janice Lan, Shibo Hao, Yuandong Tian, Jason Weston, Xian Li
•
Feb 12, 2025
•
28
4
WorldGUI:全面桌面GUI自动化的动态测试
WorldGUI: Dynamic Testing for Comprehensive Desktop GUI Automation
Henry Hengyuan Zhao, Difei Gao, Mike Zheng Shou
•
Feb 12, 2025
•
26
4
LASP-2:重新思考线性注意力及其混合中的序列并行化
LASP-2: Rethinking Sequence Parallelism for Linear Attention and Its Hybrid
Weigao Sun, Disen Lan, Yiran Zhong, Xiaoye Qu, Yu Cheng
•
Feb 11, 2025
•
24
2
忽略KL惩罚!通过增加对关键标记的探索来增强强化学习微调。
Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning
Jean Vassoyan, Nathanaël Beau, Roman Plaud
•
Feb 10, 2025
•
18
2
《模拟任何人2:具有环境可负担性的高保真角色图像动画》
Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance
Li Hu, Guangyuan Wang, Zhen Shen, Xin Gao, Dechao Meng, Lian Zhuo, Peng Zhang, Bang Zhang, Liefeng Bo
•
Feb 10, 2025
•
16
4
PDE-Controller:LLMs 用于偏微分方程的自动形式化和推理
PDE-Controller: LLMs for Autoformalization and Reasoning of PDEs
Mauricio Soroco, Jialin Song, Mengzhou Xia, Kye Emond, Weiran Sun, Wuyang Chen
•
Feb 3, 2025
•
16
2
DPO-Shift:改变直接偏好优化的分布
DPO-Shift: Shifting the Distribution of Direct Preference Optimization
Xiliang Yang, Feng Jiang, Qianen Zhang, Lei Zhao, Xiao Li
•
Feb 11, 2025
•
15
2
NoLiMa:超越字面匹配的长上下文评估
NoLiMa: Long-Context Evaluation Beyond Literal Matching
Ali Modarressi, Hanieh Deilamsalehy, Franck Dernoncourt, Trung Bui, Ryan A. Rossi, Seunghyun Yoon, Hinrich Schütze
•
Feb 7, 2025
•
15
2
SARChat-Bench-2M:一项面向合成孔径雷达图像解释的多任务视觉-语言基准测试
SARChat-Bench-2M: A Multi-Task Vision-Language Benchmark for SAR Image Interpretation
Zhiming Ma, Xiayang Xiao, Sihao Dong, Peidong Wang, HaiPeng Wang, Qingyun Pan
•
Feb 12, 2025
•
12
4
下一个区块预测:通过半自回归建模生成视频
Next Block Prediction: Video Generation via Semi-Autoregressive Modeling
Shuhuai Ren, Shuming Ma, Xu Sun, Furu Wei
•
Feb 11, 2025
•
9
2
面向大型语言模型的可信检索增强生成:一项调研
Towards Trustworthy Retrieval Augmented Generation for Large Language Models: A Survey
Bo Ni, Zheyuan Liu, Leyao Wang, Yongjia Lei, Yuying Zhao, Xueqi Cheng, Qingkai Zeng, Luna Dong, Yinglong Xia, Krishnaram Kenthapadi, Ryan Rossi, Franck Dernoncourt, Md Mehrab Tanjim, Nesreen Ahmed, Xiaorui Liu, Wenqi Fan, Erik Blasch, Yu Wang, Meng Jiang, Tyler Derr
•
Feb 8, 2025
•
8
2
LLM模块:使用增强交叉注意力从大模型向小模型传递知识
LLM Modules: Knowledge Transfer from a Large to a Small Model using Enhanced Cross-Attention
Konstantin Kolomeitsev
•
Feb 12, 2025
•
4
2
中介者:基于内存高效的LLM合并,减少参数冲突和基于不确定性的路由。
Mediator: Memory-efficient LLM Merging with Less Parameter Conflicts and Uncertainty Based Routing
Kunfeng Lai, Zhenheng Tang, Xinglin Pan, Peijie Dong, Xiang Liu, Haolan Chen, Li Shen, Bo Li, Xiaowen Chu
•
Feb 6, 2025
•
4
2
MetaSC:面向语言模型的测试时间安全规范优化
MetaSC: Test-Time Safety Specification Optimization for Language Models
Víctor Gallego
•
Feb 11, 2025
•
3
2
医学图像密集对比表示学习中的假阳性和假阴性问题的同胚先验
Homeomorphism Prior for False Positive and Negative Problem in Medical Image Dense Contrastive Representation Learning
Yuting He, Boyu Wang, Rongjun Ge, Yang Chen, Guanyu Yang, Shuo Li
•
Feb 7, 2025
•
0
2