ChatPaper.ai
打开菜单
首页
每日论文
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
February 26th, 2025
SWE-RL:通过开源软件演化中的强化学习推动大语言模型推理能力发展
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution
Yuxiang Wei, Olivier Duchenne, Jade Copet, Quentin Carbonneaux, Lingming Zhang, Daniel Fried, Gabriel Synnaeve, Rishabh Singh, Sida I. Wang
•
Feb 25, 2025
•
69
5
OmniAlign-V:迈向多模态大语言模型与人类偏好的深度对齐
OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference
Xiangyu Zhao, Shengyuan Ding, Zicheng Zhang, Haian Huang, Maosong Cao, Weiyun Wang, Jiaqi Wang, Xinyu Fang, Wenhai Wang, Guangtao Zhai, Haodong Duan, Hua Yang, Kai Chen
•
Feb 25, 2025
•
69
2
SpargeAttn:精准稀疏注意力机制,加速任意模型推理
SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference
Jintao Zhang, Chendong Xiang, Haofeng Huang, Jia Wei, Haocheng Xi, Jun Zhu, Jianfei Chen
•
Feb 25, 2025
•
53
2
ART:匿名区域变换器——面向可变多层透明图像生成
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation
Yifan Pu, Yiming Zhao, Zhicong Tang, Ruihong Yin, Haoxing Ye, Yuhui Yuan, Dong Chen, Jianmin Bao, Sirui Zhang, Yanbin Wang, Lin Liang, Lijuan Wang, Ji Li, Xiu Li, Zhouhui Lian, Gao Huang, Baining Guo
•
Feb 25, 2025
•
34
4
KV-Edit:无需训练的图像编辑技术,实现精准背景保留
KV-Edit: Training-Free Image Editing for Precise Background Preservation
Tianrui Zhu, Shiyi Zhang, Jiawei Shao, Yansong Tang
•
Feb 24, 2025
•
33
3
揭示大语言模型下游性能扩展:基于聚类的视角
Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective
Chengyin Xu, Kaiyuan Chen, Xiao Li, Ke Shen, Chenggang Li
•
Feb 24, 2025
•
19
2
Curie:迈向由AI代理驱动的严谨自动化科学实验
Curie: Toward Rigorous and Automated Scientific Experimentation with AI Agents
Patrick Tser Jern Kon, Jiachen Liu, Qiuyi Ding, Yiming Qiu, Zhenning Yang, Yibo Huang, Jayanth Srinivasa, Myungjin Lee, Mosharaf Chowdhury, Ang Chen
•
Feb 22, 2025
•
18
5
K-LoRA:实现无需训练的任意主题与风格LoRAs融合
K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs
Ziheng Ouyang, Zhen Li, Qibin Hou
•
Feb 25, 2025
•
15
2
将视觉感知令牌引入多模态大语言模型
Introducing Visual Perception Token into Multimodal Large Language Model
Runpeng Yu, Xinyin Ma, Xinchao Wang
•
Feb 24, 2025
•
14
2
规模分布解耦:实现大规模语言模型的稳定高效训练
Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models
Ya Wang, Zhijian Zhuo, Yutao Zeng, Xun Zhou, Jian Yang, Xiaoqing Li
•
Feb 21, 2025
•
13
2
WebGames:面向通用网页浏览AI智能体的挑战性测试平台
WebGames: Challenging General-Purpose Web-Browsing AI Agents
George Thomas, Alex J. Chan, Jikun Kang, Wenqi Wu, Filippos Christianos, Fraser Greenlee, Andy Toulis, Marvin Purtorab
•
Feb 25, 2025
•
11
2
彩票大语言模型假说:重新思考大语言模型压缩应保留哪些能力?
The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?
Zhenheng Tang, Xiang Liu, Qian Wang, Peijie Dong, Bingsheng He, Xiaowen Chu, Bo Li
•
Feb 24, 2025
•
8
2
多模态大语言模型知晓何处聚焦:无需训练即可感知细微视觉细节
MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs
Jiarui Zhang, Mahyar Khayatkhoei, Prateek Chhikara, Filip Ilievski
•
Feb 24, 2025
•
7
2
提示到排行榜
Prompt-to-Leaderboard
Evan Frick, Connor Chen, Joseph Tennyson, Tianle Li, Wei-Lin Chiang, Anastasios N. Angelopoulos, Ion Stoica
•
Feb 20, 2025
•
7
3
寻找最佳平衡点:扩展偏好优化的数据构建策略
Finding the Sweet Spot: Preference Data Construction for Scaling Preference Optimization
Yao Xiao, Hai Ye, Linyao Chen, Hwee Tou Ng, Lidong Bing, Xiaoli Li, Roy Ka-wei Lee
•
Feb 24, 2025
•
6
2
AAD-LLM:基于神经注意力的听觉场景理解模型
AAD-LLM: Neural Attention-Driven Auditory Scene Understanding
Xilin Jiang, Sukru Samet Dindar, Vishal Choudhari, Stephan Bickel, Ashesh Mehta, Guy M McKhann, Adeen Flinker, Daniel Friedman, Nima Mesgarani
•
Feb 24, 2025
•
5
3
LDGen:通过大语言模型驱动的语言表征增强文本到图像合成
LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation
Pengzhi Li, Pengfei Yu, Zide Liu, Wei He, Xuhao Pan, Xudong Rao, Tao Wei, Wei Chen
•
Feb 25, 2025
•
4
2
面向统计学家的语言大模型概览
An Overview of Large Language Models for Statisticians
Wenlong Ji, Weizhe Yuan, Emily Getzen, Kyunghyun Cho, Michael I. Jordan, Song Mei, Jason E Weston, Weijie J. Su, Jing Xu, Linjun Zhang
•
Feb 25, 2025
•
4
2
LaTIM:测量Mamba模型中潜在令牌间交互
LaTIM: Measuring Latent Token-to-Token Interactions in Mamba Models
Hugo Pitorro, Marcos Treviso
•
Feb 21, 2025
•
4
2
Shakti-VLMs:面向企业AI的可扩展视觉语言模型
Shakti-VLMs: Scalable Vision-Language Models for Enterprise AI
Syed Abdul Gaffar Shakhadri, Kruthika KR, Kartik Basavaraj Angadi
•
Feb 24, 2025
•
3
2
WiCkeD:一种提升多项选择基准测试难度的简易方法
WiCkeD: A Simple Method to Make Multiple Choice Benchmarks More Challenging
Ahmed Elhady, Eneko Agirre, Mikel Artetxe
•
Feb 25, 2025
•
2
2
通过词汇课程扩展大规模语言模型预训练
Scaling LLM Pre-training with Vocabulary Curriculum
Fangyuan Yu
•
Feb 25, 2025
•
1
2