ChatPaper.ai
打开菜单
首页
每日论文
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
January 3rd, 2025
2.5年的课堂时间:一本面向视觉-语言预训练的多模态教材
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
Wenqi Zhang, Hang Zhang, Xin Li, Jiashuo Sun, Yongliang Shen, Weiming Lu, Deli Zhao, Yueting Zhuang, Lidong Bing
•
Jan 1, 2025
•
95
7
VideoAnydoor:具有精确运动控制的高保真视频对象插入
VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control
Yuanpeng Tu, Hao Luo, Xi Chen, Sihui Ji, Xiang Bai, Hengshuang Zhao
•
Jan 2, 2025
•
49
3
CodeElo:使用人类可比赛的Elo评分对LLM的竞赛级代码生成进行基准测试
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings
Shanghaoran Quan, Jiaxi Yang, Bowen Yu, Bo Zheng, Dayiheng Liu, An Yang, Xuancheng Ren, Bofei Gao, Yibo Miao, Yunlong Feng, Zekun Wang, Jian Yang, Zeyu Cui, Yang Fan, Yichang Zhang, Binyuan Hui, Junyang Lin
•
Jan 2, 2025
•
47
6
LTX-Video:实时视频潜在扩散
LTX-Video: Realtime Video Latent Diffusion
Yoav HaCohen, Nisan Chiprut, Benny Brazowski, Daniel Shalem, Dudu Moshe, Eitan Richardson, Eran Levin, Guy Shiran, Nir Zabari, Ori Gordon, Poriya Panet, Sapir Weissbuch, Victor Kulikov, Yaki Bitterman, Zeev Melumian, Ofir Bibi
•
Dec 30, 2024
•
41
3
VideoRefer套件:利用视频LLM推进时空对象理解
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM
Yuqian Yuan, Hang Zhang, Wentong Li, Zesen Cheng, Boqiang Zhang, Long Li, Xin Li, Deli Zhao, Wenqiao Zhang, Yueting Zhuang, Jianke Zhu, Lidong Bing
•
Dec 31, 2024
•
41
2
重建与生成:在潜在扩散模型中驯服优化困境
Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
Jingfeng Yao, Xinggang Wang
•
Jan 2, 2025
•
36
2
ProgCo:程序帮助大型语言模型自我校正
ProgCo: Program Helps Self-Correction of Large Language Models
Xiaoshuai Song, Yanan Wu, Weixun Wang, Jiaheng Liu, Wenbo Su, Bo Zheng
•
Jan 2, 2025
•
25
2
使用MLLM作为图像安全性评估的裁判,无需人工标注。
MLLM-as-a-Judge for Image Safety without Human Labeling
Zhenting Wang, Shuming Hu, Shiyu Zhao, Xiaowen Lin, Felix Juefei-Xu, Zhuowei Li, Ligong Han, Harihar Subramanyam, Li Chen, Jianfa Chen, Nan Jiang, Lingjuan Lyu, Shiqing Ma, Dimitris N. Metaxas, Ankit Jain
•
Dec 31, 2024
•
24
2
MapEval:基于地图的基础模型中地理空间推理的评估
MapEval: A Map-Based Evaluation of Geo-Spatial Reasoning in Foundation Models
Mahir Labib Dihan, Md Tanvir Hassan, Md Tanvir Parvez, Md Hasebul Hasan, Md Almash Alam, Muhammad Aamir Cheema, Mohammed Eunus Ali, Md Rizwan Parvez
•
Dec 31, 2024
•
22
2
A3:移动GUI代理的安卓代理竞技场
A3: Android Agent Arena for Mobile GUI Agents
Yuxiang Chai, Hanhao Li, Jiayu Zhang, Liang Liu, Guozhi Wang, Shuai Ren, Siyuan Huang, Hongsheng Li
•
Jan 2, 2025
•
22
3
将专门的视觉编码器统一为视频语言模型
Unifying Specialized Visual Encoders for Video Language Models
Jihoon Chung, Tyler Zhu, Max Gonzalez Saez-Diez, Juan Carlos Niebles, Honglu Zhou, Olga Russakovsky
•
Jan 2, 2025
•
21
2
代码奖励建模单元测试的动态缩放
Dynamic Scaling of Unit Tests for Code Reward Modeling
Zeyao Ma, Xiaokang Zhang, Jing Zhang, Jifan Yu, Sijia Luo, Jie Tang
•
Jan 2, 2025
•
17
2
嵌套注意力:语义感知注意力值用于概念个性化
Nested Attention: Semantic-aware Attention Values for Concept Personalization
Or Patashnik, Rinon Gal, Daniil Ostashev, Sergey Tulyakov, Kfir Aberman, Daniel Cohen-Or
•
Jan 2, 2025
•
11
2
SeedVR:在扩散变压器中播种无限,实现通用视频修复
SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration
Jianyi Wang, Zhijie Lin, Meng Wei, Yang Zhao, Ceyuan Yang, Chen Change Loy, Lu Jiang
•
Jan 2, 2025
•
11
2
MapQaTor:一个用于高效标注地图查询数据集的系统
MapQaTor: A System for Efficient Annotation of Map Query Datasets
Mahir Labib Dihan, Mohammed Eunus Ali, Md Rizwan Parvez
•
Dec 30, 2024
•
9
2
通过最近性和过度平滑的视角理解和缓解状态空间模型的瓶颈问题
Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing
Peihao Wang, Ruisi Cai, Yuehao Wang, Jiajun Zhu, Pragya Srivastava, Zhangyang Wang, Pan Li
•
Dec 31, 2024
•
7
2
基于人口意识的扩散用于时间序列生成
Population Aware Diffusion for Time Series Generation
Yang Li, Han Meng, Zhenyu Bi, Ingolv T. Urnes, Haipeng Chen
•
Jan 1, 2025
•
6
2
通过上下文等变位置编码重新思考语言模型中的寻址
Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding
Jiajun Zhu, Peihao Wang, Ruisi Cai, Jason D. Lee, Pan Li, Zhangyang Wang
•
Jan 1, 2025
•
6
4
SeFAR:具有时间扰动和学习稳定化的半监督细粒度动作识别
SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization
Yongle Huang, Haodong Chen, Zhenbang Xu, Zihan Jia, Haozhou Sun, Dian Shao
•
Jan 2, 2025
•
5
2