ChatPaper.ai
메뉴 열기
홈
오늘의 논문
요금제
계정
작업공간
🇰🇷
한국어
Loading...
•
•
•
•
•
•
•
•
•
•
AI 연구 논문 데일리
번역이 포함된 일일 선별된 AI 연구 논문
January 14th, 2025
수학적 추론에서 과정 보상 모델 개발의 교훈
The Lessons of Developing Process Reward Models in Mathematical Reasoning
Zhenru Zhang, Chujie Zheng, Yangzhen Wu, Beichen Zhang, Runji Lin, Bowen Yu, Dayiheng Liu, Jingren Zhou, Junyang Lin
•
Jan 13, 2025
•
91
8
텐서곱 어텐션만 있으면 충분합니다.
Tensor Product Attention Is All You Need
Yifan Zhang, Yifeng Liu, Huizhuo Yuan, Zhen Qin, Yang Yuan, Quanquan Gu, Andrew Chi-Chih Yao
•
Jan 11, 2025
•
84
4
트랜스포머 제곱: 자기 적응형 LLMs
Transformer^2: Self-adaptive LLMs
Qi Sun, Edoardo Cetin, Yujin Tang
•
Jan 9, 2025
•
53
7
BIOMEDICA: 과학 문헌에서 파생된 오픈 바이오의료 이미지 캡션 아카이브, 데이터셋 및 비전-언어 모델
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature
Alejandro Lozano, Min Woo Sun, James Burgess, Liangyu Chen, Jeffrey J Nirschl, Jeffrey Gu, Ivan Lopez, Josiah Aklilu, Austin Wolfgang Katzer, Collin Chiu, Anita Rau, Xiaohan Wang, Yuhui Zhang, Alfred Seunghoon Song, Robert Tibshirani, Serena Yeung-Levy
•
Jan 13, 2025
•
50
2
MinMo: 음성 상호작용을 위한 다중 모달 대형 언어 모델
MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Qian Chen, Yafeng Chen, Yanni Chen, Mengzhe Chen, Yingda Chen, Chong Deng, Zhihao Du, Ruize Gao, Changfeng Gao, Zhifu Gao, Yabin Li, Xiang Lv, Jiaqing Liu, Haoneng Luo, Bin Ma, Chongjia Ni, Xian Shi, Jialong Tang, Hui Wang, Hao Wang, Wen Wang, Yuxuan Wang, Yunlan Xu, Fan Yu, Zhijie Yan, Yexin Yang, Baosong Yang, Xian Yang, Guanrou Yang, Tianyu Zhao, Qinglin Zhang, Shiliang Zhang, Nan Zhao, Pei Zhang, Chong Zhang, Jinren Zhou
•
Jan 10, 2025
•
45
6
비디오 저자: 장문 형식 비디오 생성을 향하여
VideoAuteur: Towards Long Narrative Video Generation
Junfei Xiao, Feng Cheng, Lu Qi, Liangke Gui, Jiepeng Cen, Zhibei Ma, Alan Yuille, Lu Jiang
•
Jan 10, 2025
•
31
3
O1 복제 여정 -- 제 3부: 의료 추론을 위한 추론 시간 스케일링
O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning
Zhongzhen Huang, Gui Geng, Shengyi Hua, Zhen Huang, Haoyang Zou, Shaoting Zhang, Pengfei Liu, Xiaofan Zhang
•
Jan 11, 2025
•
29
2
WebWalker: 웹 탐색에서 LLMs의 벤치마킹
WebWalker: Benchmarking LLMs in Web Traversal
Jialong Wu, Wenbiao Yin, Yong Jiang, Zhenglin Wang, Zekun Xi, Runnan Fang, Deyu Zhou, Pengjun Xie, Fei Huang
•
Jan 13, 2025
•
19
3
스팸: 안정적인 LLM 훈련을 위한 모멘텀 리셋과 함께 스파이크 인식 아담
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training
Tianjin Huang, Ziquan Zhu, Gaojie Jin, Lu Liu, Zhangyang Wang, Shiwei Liu
•
Jan 12, 2025
•
15
2
3D에서의 희귀한 물체들
UnCommon Objects in 3D
Xingchen Liu, Piyush Tayal, Jianyuan Wang, Jesus Zarzar, Tom Monnier, Konstantinos Tertikas, Jiali Duan, Antoine Toisoul, Jason Y. Zhang, Natalia Neverova, Andrea Vedaldi, Roman Shapovalov, David Novotny
•
Jan 13, 2025
•
13
2
ChemAgent: 대형 언어 모델에서 화학 추론을 개선하는 자체 업데이트 라이브러리
ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning
Xiangru Tang, Tianyu Hu, Muyang Ye, Yanjun Shao, Xunjian Yin, Siru Ouyang, Wangchunshu Zhou, Pan Lu, Zhuosheng Zhang, Yilun Zhao, Arman Cohan, Mark Gerstein
•
Jan 11, 2025
•
9
2
모델 가중치를 모방하여 데이터 선택을 위한 샘플 유틸리티 평가
Evaluating Sample Utility for Data Selection by Mimicking Model Weights
Tzu-Heng Huang, Manjot Bilkhu, Frederic Sala, Javier Movellan
•
Jan 12, 2025
•
5
2