ChatPaper.ai
打開菜單
首頁
每日論文
定價
賬戶
工作台
🇭🇰
繁體中文
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究論文每日精選
每日精選AI研究論文及翻譯
November 18th, 2024
LLaVA-o1:讓視覺語言模型逐步推理
LLaVA-o1: Let Vision Language Models Reason Step-by-Step
Guowei Xu, Peng Jin, Li Hao, Yibing Song, Lichao Sun, Li Yuan
•
Nov 15, 2024
•
105
7
GaussianAnything:用於3D生成的交互式點雲潛在擴散
GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation
Yushi Lan, Shangchen Zhou, Zhaoyang Lyu, Fangzhou Hong, Shuai Yang, Bo Dai, Xingang Pan, Chen Change Loy
•
Nov 12, 2024
•
21
6
透過硬綁定和軟微調的方式實現區域感知的文本到圖像生成
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
Zhennan Chen, Yajie Li, Haofan Wang, Zhibo Chen, Zhengkai Jiang, Jun Li, Qian Wang, Jian Yang, Ying Tai
•
Nov 10, 2024
•
34
6
GUI代理程式的曙光:與Claude 3.5電腦的初步案例研究
The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use
Siyuan Hu, Mingyu Ouyang, Difei Gao, Mike Zheng Shou
•
Nov 15, 2024
•
31
3
Xmodel-1.5:一個十億規模的多語言LLM
Xmodel-1.5: An 1B-scale Multilingual LLM
Wang Qun, Liu Yang, Lin Qingquan, Jiang Ling
•
Nov 15, 2024
•
14
2
將其編號:像翻轉漫畫那樣對視頻進行時間定位
Number it: Temporal Grounding Videos like Flipping Manga
Yongliang Wu, Xinting Hu, Yuyang Sun, Yizhou Zhou, Wenbo Zhu, Fengyun Rao, Bernt Schiele, Xu Yang
•
Nov 15, 2024
•
14
2
MARS:釋放變異減少在訓練大型模型中的威力
MARS: Unleashing the Power of Variance Reduction for Training Large Models
Huizhuo Yuan, Yifeng Liu, Shuang Wu, Xun Zhou, Quanquan Gu
•
Nov 15, 2024
•
13
2