ChatPaper.ai
打开菜单
首页
每日论文
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
February 14th, 2025
LLM肩上的随机鹦鹉:对物理概念理解的总结评估
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding
Mo Yu, Lemao Liu, Junjie Wu, Tsz Ting Chung, Shunchi Zhang, Jiangnan Li, Dit-Yan Yeung, Jie Zhou
•
Feb 13, 2025
•
184
3
InfiniteHiP:在单个GPU上将语言模型上下文扩展至300万个标记
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU
Heejun Lee, Geon Park, Jaduk Suh, Sung Ju Hwang
•
Feb 13, 2025
•
143
6
Skrr:用于高效存储的跳过和重用文本编码器层,用于文本到图像生成
Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation
Hoigi Seo, Wongi Jeong, Jae-sun Seo, Se Young Chun
•
Feb 12, 2025
•
41
2
TripoSG:使用大规模矫正流模型进行高保真度3D形状合成
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models
Yangguang Li, Zi-Xin Zou, Zexiang Liu, Dehu Wang, Yuan Liang, Zhipeng Yu, Xingchao Liu, Yuan-Chen Guo, Ding Liang, Wanli Ouyang, Yan-Pei Cao
•
Feb 10, 2025
•
34
3
SelfCite:自监督对齐:大型语言模型中的上下文归因
SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models
Yung-Sung Chuang, Benjamin Cohen-Wang, Shannon Zejiang Shen, Zhaofeng Wu, Hu Xu, Xi Victoria Lin, James Glass, Shang-Wen Li, Wen-tau Yih
•
Feb 13, 2025
•
33
2
EmbodiedBench:用于视觉驱动的具身代理的多模态大型语言模型的全面基准测试
EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents
Rui Yang, Hanyang Chen, Junyu Zhang, Mark Zhao, Cheng Qian, Kangrui Wang, Qineng Wang, Teja Venkat Koripella, Marziyeh Movahedi, Manling Li, Heng Ji, Huan Zhang, Tong Zhang
•
Feb 13, 2025
•
33
2
这个模型能否也识别狗?从权重中进行零样本模型搜索
Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights
Jonathan Kahana, Or Nathan, Eliahu Horwitz, Yedid Hoshen
•
Feb 13, 2025
•
31
2
一个开放的配方:通过模型合并在一天内将特定语言的LLMs调整为推理模型
An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging
Kunat Pipatanakul, Pittawat Taveekitworachai, Potsawee Manakul, Kasima Tharnpipitchai
•
Feb 13, 2025
•
30
4
CoSER:协调基于LLM的已建立角色的人设模拟
CoSER: Coordinating LLM-Based Persona Simulation of Established Roles
Xintao Wang, Heng Wang, Yifei Zhang, Xinfeng Yuan, Rui Xu, Jen-tse Huang, Siyu Yuan, Haoran Guo, Jiangjie Chen, Wei Wang, Yanghua Xiao, Shuchang Zhou
•
Feb 13, 2025
•
28
2
MME-CoT:在大型多模态模型中对思维链进行基准测试,以评估推理质量、鲁棒性和效率。
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency
Dongzhi Jiang, Renrui Zhang, Ziyu Guo, Yanwei Li, Yu Qi, Xinyan Chen, Liuhui Wang, Jianhan Jin, Claire Guo, Shen Yan, Bo Zhang, Chaoyou Fu, Peng Gao, Hongsheng Li
•
Feb 13, 2025
•
27
2
探索在3D LMMs中无编码器架构的潜力
Exploring the Potential of Encoder-free Architectures in 3D LMMs
Yiwen Tang, Zoey Guo, Zhuhao Wang, Ray Zhang, Qizhi Chen, Junli Liu, Delin Qu, Zhigang Wang, Dong Wang, Xuelong Li, Bin Zhao
•
Feb 13, 2025
•
25
2
大型语言模型中的逻辑推理:一项调查
Logical Reasoning in Large Language Models: A Survey
Hanmeng Liu, Zhizhang Fu, Mengru Ding, Ruoxi Ning, Chaoli Zhang, Xiaozhang Liu, Yue Zhang
•
Feb 13, 2025
•
22
5
SQuARE:用于增强大型语言模型中思维链的顺序问答推理引擎
SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models
Daniel Fleischer, Moshe Berchansky, Gad Markovits, Moshe Wasserblat
•
Feb 13, 2025
•
16
2
台风 T1:一个开放的泰国推理模型
Typhoon T1: An Open Thai Reasoning Model
Pittawat Taveekitworachai, Potsawee Manakul, Kasima Tharnpipitchai, Kunat Pipatanakul
•
Feb 13, 2025
•
16
2
CoT-Valve:长度可压缩的思维链调节
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
Xinyin Ma, Guangnian Wan, Runpeng Yu, Gongfan Fang, Xinchao Wang
•
Feb 13, 2025
•
14
2
通过高质量合成数据改进多模态多语言嵌入 mmE5
mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data
Haonan Chen, Liang Wang, Nan Yang, Yutao Zhu, Ziliang Zhao, Furu Wei, Zhicheng Dou
•
Feb 12, 2025
•
13
2
DexTrack:实现从人类参考中实现灵巧操作的通用神经跟踪控制
DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References
Xueyi Liu, Jianibieke Adalibieke, Qianwei Han, Yuzhe Qin, Li Yi
•
Feb 13, 2025
•
12
2
大型语言模型中的数学推理:评估在广泛数字范围内的逻辑和算术错误
Mathematical Reasoning in Large Language Models: Assessing Logical and Arithmetic Errors across Wide Numerical Ranges
Safal Shrestha, Minwu Kim, Keith Ross
•
Feb 12, 2025
•
11
2
VFX创作者:具有可控扩散变换器的动画视觉效果生成
VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer
Xinyu Liu, Ailing Zeng, Wei Xue, Harry Yang, Wenhan Luo, Qifeng Liu, Yike Guo
•
Feb 9, 2025
•
8
2
具有三维感知的二维表示的潜在辐射场
Latent Radiance Fields with 3D-aware 2D Representations
Chaoyi Zhou, Xi Liu, Feng Luo, Siyu Huang
•
Feb 13, 2025
•
6
2
3CAD:一个用于无监督异常检测的大规模真实世界3C产品数据集
3CAD: A Large-Scale Real-World 3C Product Dataset for Unsupervised Anomaly
Enquan Yang, Peng Xing, Hanyang Sun, Wenbo Guo, Yuanwei Ma, Zechao Li, Dan Zeng
•
Feb 9, 2025
•
6
2