ChatPaper.ai
打開菜單
首頁
每日論文
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
December 2nd, 2024
GRAPE:通過偏好對齊來泛化機器人策略
GRAPE: Generalizing Robot Policy via Preference Alignment
Zijian Zhang, Kaiyuan Zheng, Zhaorun Chen, Joel Jang, Yi Li, Chaoqi Wang, Mingyu Ding, Dieter Fox, Huaxiu Yao
•
Nov 28, 2024
•
44
2
無需視頻模型的視頻深度
Video Depth without Video Models
Bingxin Ke, Dominik Narnhofer, Shengyu Huang, Lei Ke, Torben Peters, Katerina Fragkiadaki, Anton Obukhov, Konrad Schindler
•
Nov 28, 2024
•
35
7
超越範例:透過MCTS在上下文學習中的高層次自動推理範式
Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS
Jinyang Wu, Mingkuan Feng, Shuai Zhang, Feihu Che, Zengqi Wen, Jianhua Tao
•
Nov 27, 2024
•
34
14
Yi-Lightning 技術報告
Yi-Lightning Technical Report
01. AI, Alan Wake, Albert Wang, Bei Chen, C. X. Lv, Chao Li, Chengen Huang, Chenglin Cai, Chujie Zheng, Daniel Cooper, Ethan Dai, Fan Zhou, Feng Hu, Heng Ji, Howard Qiu, Jiangcheng Zhu, Jun Tian, Katherine Su, Lihuan Zhang, Liying Li, Ming Song, Mou Li, Peng Liu, Qichen Hu, Shawn Wang, Shijun Zhou, Shiyong Li, Tianhang Zhu, Wen Xie, Xiang He, Xiaobo Chen, Xiaohui Hu, Xiaoyi Ren, Xinyao Niu, Yanpeng Li, Yongke Zhao, Yongzhen Luo, Yuchi Xu, Yuxuan Sha, Zhaodong Yan, Zhiyuan Liu, Zirui Zhang
•
Dec 2, 2024
•
26
2
關於多模態大型語言模型的特定領域後訓練
On Domain-Specific Post-Training for Multimodal Large Language Models
Daixuan Cheng, Shaohan Huang, Ziyu Zhu, Xintong Zhang, Wayne Xin Zhao, Zhongzhi Luan, Bo Dai, Zhenliang Zhang
•
Nov 29, 2024
•
25
3
增強視頻擴散取樣的時空跳躍引導
Spatiotemporal Skip Guidance for Enhanced Video Diffusion Sampling
Junha Hyung, Kinam Kim, Susung Hong, Min-Jung Kim, Jaegul Choo
•
Nov 27, 2024
•
24
3
逆向思考使LLM更強大的推理者。
Reverse Thinking Makes LLMs Stronger Reasoners
Justin Chih-Yao Chen, Zifeng Wang, Hamid Palangi, Rujun Han, Sayna Ebrahimi, Long Le, Vincent Perot, Swaroop Mishra, Mohit Bansal, Chen-Yu Lee, Tomas Pfister
•
Nov 29, 2024
•
20
2
FAM擴散:頻率和注意力調節用於具有穩定擴散的高分辨率圖像生成
FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion
Haosen Yang, Adrian Bulat, Isma Hadji, Hai X. Pham, Xiatian Zhu, Georgios Tzimiropoulos, Brais Martinez
•
Nov 27, 2024
•
17
2
時間步嵌入告訴我們:是時候為視頻擴散模型緩存了。
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
Feng Liu, Shiwei Zhang, Xiaofeng Wang, Yujie Wei, Haonan Qiu, Yuzhong Zhao, Yingya Zhang, Qixiang Ye, Fang Wan
•
Nov 28, 2024
•
17
2
拼圖:基於蒸餾的推論優化LLM的NAS
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs
Akhiad Bercovich, Tomer Ronen, Talor Abramovich, Nir Ailon, Nave Assaf, Mohammad Dabbah, Ido Galil, Amnon Geifman, Yonatan Geifman, Izhak Golan, Netanel Haber, Ehud Karpas, Itay Levy, Shahar Mor, Zach Moshe, Najeeb Nabwani, Omri Puny, Ran Rubin, Itamar Schen, Ido Shahaf, Oren Tropp, Omer Ullman Argov, Ran Zilberstein, Ran El-Yaniv
•
Nov 28, 2024
•
16
2
軌跡注意力用於精細視頻運動控制
Trajectory Attention for Fine-grained Video Motion Control
Zeqi Xiao, Wenqi Ouyang, Yifan Zhou, Shuai Yang, Lei Yang, Jianlou Si, Xingang Pan
•
Nov 28, 2024
•
12
2
對於低比特率高質量語音編碼的Transformer模型擴展
Scaling Transformers for Low-Bitrate High-Quality Speech Coding
Julian D Parker, Anton Smirnov, Jordi Pons, CJ Carr, Zack Zukowski, Zach Evans, Xubo Liu
•
Nov 29, 2024
•
11
3
DisCoRD:通過矯正流解碼從離散標記到連續運動
DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding
Jungbin Cho, Junwan Kim, Jisoo Kim, Minseo Kim, Mingu Kang, Sungeun Hong, Tae-Hyun Oh, Youngjae Yu
•
Nov 29, 2024
•
10
2
一次性查看每個畫面:使用多軸梯度檢查點的Video-Ma^2mba進行高效長格式視頻理解
Look Every Frame All at Once: Video-Ma^2mba for Efficient Long-form Video Understanding with Multi-Axis Gradient Checkpointing
Hosu Lee, Junho Kim, Hyunjun Kim, Yong Man Ro
•
Nov 29, 2024
•
10
2
MATATA:一種用於表格應用的弱監督式數學工具輔助推理
MATATA: a weak-supervised MAthematical Tool-Assisted reasoning for Tabular Applications
Vishnou Vinayagame, Gregory Senay, Luis Martí
•
Nov 28, 2024
•
8
2
AC3D:分析和改善在視頻傳播中的3D攝像頭控制的研究Transformer
AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers
Sherwin Bahmani, Ivan Skorokhodov, Guocheng Qian, Aliaksandr Siarohin, Willi Menapace, Andrea Tagliasacchi, David B. Lindell, Sergey Tulyakov
•
Nov 27, 2024
•
8
2
DeMo:解耦動量優化
DeMo: Decoupled Momentum Optimization
Bowen Peng, Jeffrey Quesnelle, Diederik P. Kingma
•
Nov 29, 2024
•
6
2
LLM教師-學生框架用於無需手動標註數據的文本分類:以IPTC新聞主題分類為例研究
LLM Teacher-Student Framework for Text Classification With No Manually Annotated Data: A Case Study in IPTC News Topic Classification
Taja Kuzman, Nikola Ljubešić
•
Nov 29, 2024
•
6
2
AlphaTablets:從單眼視頻中重建3D平面的通用平面表示
AlphaTablets: A Generic Plane Representation for 3D Planar Reconstruction from Monocular Videos
Yuze He, Wang Zhao, Shaohui Liu, Yubin Hu, Yushi Bai, Yu-Hui Wen, Yong-Jin Liu
•
Nov 29, 2024
•
6
2
SpotLight:透過擴散實現陰影引導的物體照明
SpotLight: Shadow-Guided Object Relighting via Diffusion
Frédéric Fortier-Chouinard, Zitian Zhang, Louis-Etienne Messier, Mathieu Garon, Anand Bhattad, Jean-François Lalonde
•
Nov 27, 2024
•
3
1
訓練噪聲標記修剪
Training Noise Token Pruning
Mingxing Rao, Bohan Jiang, Daniel Moyer
•
Nov 27, 2024
•
1
2