每日论文
扩展扩展去噪步骤的扩散模型推理时间缩放Inference-Time Scaling for Diffusion Models beyond Scaling Denoising
Steps
扩展扩展去噪步骤的扩散模型推理时间缩放
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising
Steps
Nanye Ma, Shangyuan Tong, Haolin Jia, Hexiang Hu, Yu-Chuan Su, Mingda Zhang, Xuan Yang, Yandong Li, Tommi Jaakkola, Xuhui Jia, Saining Xie•Jan 16, 2025•352
OmniThink:通过思维拓展机器写作中的知识边界OmniThink: Expanding Knowledge Boundaries in Machine Writing through
Thinking
OmniThink:通过思维拓展机器写作中的知识边界
OmniThink: Expanding Knowledge Boundaries in Machine Writing through
Thinking
Zekun Xi, Wenbiao Yin, Jizhan Fang, Jialong Wu, Runnan Fang, Ningyu Zhang, Jiang Yong, Pengjun Xie, Fei Huang, Huajun Chen•Jan 16, 2025•292
扩展视觉标记生成器用于重建和生成的经验教训Learnings from Scaling Visual Tokenizers for Reconstruction and
Generation
扩展视觉标记生成器用于重建和生成的经验教训
Learnings from Scaling Visual Tokenizers for Reconstruction and
Generation
Philippe Hansen-Estruch, David Yan, Ching-Yao Chung, Orr Zohar, Jialiang Wang, Tingbo Hou, Tao Xu, Sriram Vishwanath, Peter Vajda, Xinlei Chen•Jan 16, 2025•203
利用先进的患者模拟器探索询诊-诊断关系Exploring the Inquiry-Diagnosis Relationship with Advanced Patient
Simulators
利用先进的患者模拟器探索询诊-诊断关系
Exploring the Inquiry-Diagnosis Relationship with Advanced Patient
Simulators
Zhaocheng Liu, Quan Tu, Wen Ye, Yu Xiao, Zhishou Zhang, Hengfu Cui, Yalun Zhu, Qiang Ju, Shizheng Li, Jian Xie•Jan 16, 2025•164
迈向大型推理模型:强化语言模型的推理综述Towards Large Reasoning Models: A Survey of Reinforced Reasoning with
Large Language Models
迈向大型推理模型:强化语言模型的推理综述
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with
Large Language Models
Fengli Xu, Qianyue Hao, Zefang Zong, Jingwei Wang, Yunke Zhang, Jingyi Wang, Xiaochong Lan, Jiahui Gong, Tianjian Ouyang, Fanjin Meng, Chenyang Shao, Yuwei Yan, Qinglong Yang, Yiwen Song, Sijian Ren, Xinyuan Hu, Yu Li, Jie Feng, Chen Gao, Yong Li•Jan 16, 2025•142
SynthLight:通过学习重新渲染合成人脸的扩散模型进行人像照明调整SynthLight: Portrait Relighting with Diffusion Model by Learning to
Re-render Synthetic Faces
SynthLight:通过学习重新渲染合成人脸的扩散模型进行人像照明调整
SynthLight: Portrait Relighting with Diffusion Model by Learning to
Re-render Synthetic Faces
Sumit Chaturvedi, Mengwei Ren, Yannick Hold-Geoffroy, Jingyuan Liu, Julie Dorsey, Zhixin Shu•Jan 16, 2025•132
FAST:用于视觉-语言-动作模型的高效动作标记化FAST: Efficient Action Tokenization for Vision-Language-Action Models
FAST:用于视觉-语言-动作模型的高效动作标记化
FAST: Efficient Action Tokenization for Vision-Language-Action Models
Karl Pertsch, Kyle Stachowicz, Brian Ichter, Danny Driess, Suraj Nair, Quan Vuong, Oier Mees, Chelsea Finn, Sergey Levine•Jan 16, 2025•122
生成式视频模型是否可以通过观看视频学习物理原理?Do generative video models learn physical principles from watching
videos?
生成式视频模型是否可以通过观看视频学习物理原理?
Do generative video models learn physical principles from watching
videos?
Saman Motamed, Laura Culp, Kevin Swersky, Priyank Jaini, Robert Geirhos•Jan 14, 2025•102
CaPa:用于高效生成4K纹理网格的雕刻与绘制综合技术CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation
CaPa:用于高效生成4K纹理网格的雕刻与绘制综合技术
CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation
Hwan Heo, Jangyeong Kim, Seongyeong Lee, Jeong A Wi, Junyoung Choi, Sangjun Ahn•Jan 16, 2025•103
堆:一个无污染的多语言代码数据集,用于评估大型语言模型The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating
Large Language Models
堆:一个无污染的多语言代码数据集,用于评估大型语言模型
The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating
Large Language Models
Jonathan Katzy, Razvan Mihai Popescu, Arie van Deursen, Maliheh Izadi•Jan 16, 2025•82
RLHS:利用事后模拟减轻RLHF中的错位问题RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation
RLHS:利用事后模拟减轻RLHF中的错位问题
RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation
Kaiqu Liang, Haimin Hu, Ryan Liu, Thomas L. Griffiths, Jaime Fernández Fisac•Jan 15, 2025•72
AnyStory:走向文本到图像生成中的统一单一和多主题个性化AnyStory: Towards Unified Single and Multiple Subject Personalization in
Text-to-Image Generation
AnyStory:走向文本到图像生成中的统一单一和多主题个性化
AnyStory: Towards Unified Single and Multiple Subject Personalization in
Text-to-Image Generation
Junjie He, Yuxiang Tuo, Binghui Chen, Chongyang Zhong, Yifeng Geng, Liefeng Bo•Jan 16, 2025•62