ChatPaper.ai
打开菜单
首页
每日论文
定价
账户
工作台
🇨🇳
中文简体
Loading...
•
•
•
•
•
•
•
•
•
•
AI研究论文每日精选
每日精选AI研究论文及翻译
February 24th, 2025
LLM显微镜:揭示标点符号在Transformer上下文记忆中的隐秘作用
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers
Anton Razzhigaev, Matvey Mikhalchuk, Temurbek Rahmatullaev, Elizaveta Goncharova, Polina Druzhinina, Ivan Oseledets, Andrey Kuznetsov
•
Feb 20, 2025
•
163
3
SurveyX:基于大语言模型的学术调查自动化系统
SurveyX: Academic Survey Automation via Large Language Models
Xun Liang, Jiawei Yang, Yezhaohui Wang, Chen Tang, Zifan Zheng, Simin Niu, Shichao Song, Hanyu Wang, Bo Tang, Feiyu Xiong, Keming Mao, Zhiyu li
•
Feb 20, 2025
•
94
5
Mol-LLaMA:迈向大分子语言模型中对分子的通用理解
Mol-LLaMA: Towards General Understanding of Molecules in Large Molecular Language Model
Dongki Kim, Wonbin Lee, Sung Ju Hwang
•
Feb 19, 2025
•
43
2
PhotoDoodle:从少量成对数据中学习艺术图像编辑
PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data
Shijie Huang, Yiren Song, Yuxuan Zhang, Hailong Guo, Xueyin Wang, Mike Zheng Shou, Jiaming Liu
•
Feb 20, 2025
•
38
6
MaskGWM:一种基于视频掩码重建的通用驾驶世界模型
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction
Jingcheng Ni, Yuxin Guo, Yichen Liu, Rui Chen, Lewei Lu, Zehuan Wu
•
Feb 17, 2025
•
37
2
SIFT:通过贴纸将大语言模型推理锚定于上下文
SIFT: Grounding LLM Reasoning in Contexts via Stickers
Zihao Zeng, Xuyao Huang, Boxiu Li, Zhijie Deng
•
Feb 19, 2025
•
30
3
VLM^2-Bench:深入探究视觉语言模型如何隐式关联显式匹配的视觉线索
VLM^2-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues
Jianshu Zhang, Dongyu Yao, Renjie Pi, Paul Pu Liang, Yi R., Fung
•
Feb 17, 2025
•
29
2
LightThinker:逐步思考的压缩算法
LightThinker: Thinking Step-by-Step Compression
Jintian Zhang, Yuqi Zhu, Mengshu Sun, Yujie Luo, Shuofei Qiao, Lun Du, Da Zheng, Huajun Chen, Ningyu Zhang
•
Feb 21, 2025
•
26
5
安全标准是否人人相同?大型语言模型的用户特定安全性评估
Is Safety Standard Same for Everyone? User-Specific Safety Evaluation of Large Language Models
Yeonjun In, Wonjoong Kim, Kanghoon Yoon, Sungchul Kim, Mehrab Tanjim, Kibum Kim, Chanyoung Park
•
Feb 20, 2025
•
15
2
MoBA:面向长上下文大语言模型的混合块注意力机制
MoBA: Mixture of Block Attention for Long-Context LLMs
Enzhe Lu, Zhejun Jiang, Jingyuan Liu, Yulun Du, Tao Jiang, Chao Hong, Shaowei Liu, Weiran He, Enming Yuan, Yuzhi Wang, Zhiqi Huang, Huan Yuan, Suting Xu, Xinran Xu, Guokun Lai, Yanru Chen, Huabin Zheng, Junjie Yan, Jianlin Su, Yuxin Wu, Neo Y. Zhang, Zhilin Yang, Xinyu Zhou, Mingxing Zhang, Jiezhong Qiu
•
Feb 18, 2025
•
14
2
StructFlowBench:面向多轮指令跟随的结构化流程基准测试
StructFlowBench: A Structured Flow Benchmark for Multi-turn Instruction Following
Jinnan Li, Jinzhe Li, Yue Wang, Yi Chang, Yuan Wu
•
Feb 20, 2025
•
13
2
迈向全自动材料发现:基于大规模合成数据集与专家级大语言模型评判
Towards Fully-Automated Materials Discovery via Large-Scale Synthesis Dataset and Expert-Level LLM-as-a-Judge
Heegyu Kim, Taeyang Jeon, Seungtaek Choi, Jihoon Hong, Dongwon Jeon, Sungbum Cho, Ga-Yeon Baek, Kyung-Won Kwak, Dong-Hee Lee, Sun-Jin Choi, Jisu Bae, Chihoon Lee, Yunseo Kim, Jinsung Park, Hyunsouk Cho
•
Feb 23, 2025
•
11
2
依据韩国教育标准评估多模态生成式人工智能
Evaluating Multimodal Generative AI with Korean Educational Standards
Sanghee Park, Geewook Kim
•
Feb 21, 2025
•
9
3
MedHallu:大型语言模型医学幻觉检测综合基准
MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models
Shrey Pandit, Jiawei Xu, Junyuan Hong, Zhangyang Wang, Tianlong Chen, Kaidi Xu, Ying Ding
•
Feb 20, 2025
•
9
2
深入JSON思维:强化策略确保LLM严格遵循模式规范
Think Inside the JSON: Reinforcement Strategy for Strict LLM Schema Adherence
Bhavik Agarwal, Ishan Joshi, Viktoria Rojkova
•
Feb 18, 2025
•
9
2
大型语言模型中推理与性能的关系——o3(迷你版)更注重深度思考,而非延长思考时间
The Relationship Between Reasoning and Performance in Large Language Models -- o3 (mini) Thinks Harder, Not Longer
Marthe Ballon, Andres Algaba, Vincent Ginis
•
Feb 21, 2025
•
8
2
ReQFlow:用于高效高质量蛋白质骨架生成的正则化四元数流
ReQFlow: Rectified Quaternion Flow for Efficient and High-Quality Protein Backbone Generation
Angxiao Yue, Zichong Wang, Hongteng Xu
•
Feb 20, 2025
•
8
3
FantasyID:基于面部知识增强的身份保持视频生成
FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation
Yunpeng Zhang, Qiang Wang, Fan Jiang, Yaqi Fan, Mu Xu, Yonggang Qi
•
Feb 19, 2025
•
8
2
一步扩散模型与f-散度分布匹配
One-step Diffusion Models with f-Divergence Distribution Matching
Yilun Xu, Weili Nie, Arash Vahdat
•
Feb 21, 2025
•
7
2
InterFeedback:通过人类反馈揭示大型多模态模型的交互智能
InterFeedback: Unveiling Interactive Intelligence of Large Multimodal Models via Human Feedback
Henry Hengyuan Zhao, Wenqi Pei, Yifei Tao, Haiyang Mei, Mike Zheng Shou
•
Feb 20, 2025
•
7
2
KITAB-Bench:面向阿拉伯语OCR与文档理解的多领域综合基准测试平台
KITAB-Bench: A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding
Ahmed Heakl, Abdullah Sohail, Mukul Ranjan, Rania Hossam, Ghazi Ahmed, Mohamed El-Geish, Omar Maher, Zhiqiang Shen, Fahad Khan, Salman Khan
•
Feb 20, 2025
•
7
2
EgoSpeak:在真实场景中为自我中心对话代理学习何时发言
EgoSpeak: Learning When to Speak for Egocentric Conversational Agents in the Wild
Junhyeok Kim, Min Soo Kim, Jiwan Chung, Jungbin Cho, Jisoo Kim, Sungwoong Kim, Gyeongbo Sim, Youngjae Yu
•
Feb 17, 2025
•
6
2
超级智能体带来灾难性风险:科学家AI能否开辟更安全的道路?
Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path?
Yoshua Bengio, Michael Cohen, Damiano Fornasiere, Joumana Ghosn, Pietro Greiner, Matt MacDermott, Sören Mindermann, Adam Oberman, Jesse Richardson, Oliver Richardson, Marc-Antoine Rondeau, Pierre-Luc St-Charles, David Williams-King
•
Feb 21, 2025
•
5
2
辩论之树:多角色辩论框架激发批判性思维,助力科学对比分析
Tree-of-Debate: Multi-Persona Debate Trees Elicit Critical Thinking for Scientific Comparative Analysis
Priyanka Kargupta, Ishika Agarwal, Tal August, Jiawei Han
•
Feb 20, 2025
•
5
2
mStyleDistance:多语言风格嵌入及其评估
mStyleDistance: Multilingual Style Embeddings and their Evaluation
Justin Qiu, Jiacheng Zhu, Ajay Patel, Marianna Apidianaki, Chris Callison-Burch
•
Feb 21, 2025
•
3
2
CrossOver:三维场景跨模态对齐
CrossOver: 3D Scene Cross-Modal Alignment
Sayan Deb Sarkar, Ondrej Miksik, Marc Pollefeys, Daniel Barath, Iro Armeni
•
Feb 20, 2025
•
3
3
PLDR-LLMs学会了一种可泛化的张量运算器,能够在推理阶段替代其自身的深度神经网络。
PLDR-LLMs Learn A Generalizable Tensor Operator That Can Replace Its Own Deep Neural Net At Inference
Burc Gokden
•
Feb 19, 2025
•
3
2
WHAC:基于现实世界的人类与相机系统
WHAC: World-grounded Humans and Cameras
Wanqi Yin, Zhongang Cai, Ruisi Wang, Fanzhou Wang, Chen Wei, Haiyi Mei, Weiye Xiao, Zhitao Yang, Qingping Sun, Atsushi Yamashita, Ziwei Liu, Lei Yang
•
Mar 19, 2024
•
3
2
大规模语言模型在罕见病鉴别诊断中的应用: 从腹部放线菌病到威尔逊氏病
Rare Disease Differential Diagnosis with Large Language Models at Scale: From Abdominal Actinomycosis to Wilson's Disease
Elliot Schumacher, Dhruv Naik, Anitha Kannan
•
Feb 20, 2025
•
2
2
政治科学视角下的LLM基准测试:以联合国为视角
Benchmarking LLMs for Political Science: A United Nations Perspective
Yueqing Liang, Liangwei Yang, Chen Wang, Congying Xia, Rui Meng, Xiongxiao Xu, Haoran Wang, Ali Payani, Kai Shu
•
Feb 19, 2025
•
2
2
学习发现基因表达预测的调控元件
Learning to Discover Regulatory Elements for Gene Expression Prediction
Xingyu Su, Haiyang Yu, Degui Zhi, Shuiwang Ji
•
Feb 19, 2025
•
2
2
UPCORE:面向均衡遗忘的效用保持型核心集选择
UPCORE: Utility-Preserving Coreset Selection for Balanced Unlearning
Vaidehi Patil, Elias Stengel-Eskin, Mohit Bansal
•
Feb 20, 2025
•
1
2
JL1-CD:遥感变化检测新基准与鲁棒多教师知识蒸馏框架
JL1-CD: A New Benchmark for Remote Sensing Change Detection and a Robust Multi-Teacher Knowledge Distillation Framework
Ziyuan Liu, Ruifei Zhu, Long Gao, Yuanxiu Zhou, Jingyu Ma, Yuantao Gu
•
Feb 19, 2025
•
1
2
超越“拒绝”:量化AI过度拒绝与情感依恋的边界
Beyond No: Quantifying AI Over-Refusal and Emotional Attachment Boundaries
David Noever, Grant Rosario
•
Feb 20, 2025
•
0
3