ChatPaper.aiChatPaper

LookingGlass:基于拉普拉斯金字塔形变生成的错视艺术

LookingGlass: Generative Anamorphoses via Laplacian Pyramid Warping

April 11, 2025
作者: Pascal Chang, Sergio Sancho, Jingwei Tang, Markus Gross, Vinicius C. Azevedo
cs.AI

摘要

变形图像(Anamorphosis)是指一类经过刻意扭曲处理的图像,直接观看时难以辨认其真实形态。只有当通过特定视角,如借助反射镜或透镜等折反射装置观察时,其原本面貌才得以显现。尽管这类数学装置的构建可追溯至17世纪,但它们仅在特定观察点下可被解读,常规视角下则往往失去意义。本文以生成式视角重新审视这些著名的视觉错觉现象。借助潜在校正流模型,我们提出了一种方法,能够创建即便直接观看仍保持有效解读的变形图像。为此,我们引入了拉普拉斯金字塔变形技术,这是一种频率感知的图像变形方法,对生成高质量视觉效果至关重要。我们的工作将视觉字谜(Visual Anagrams,arXiv:2311.17919)扩展至潜在空间模型及更广泛的空间变换,从而开创了新型生成式感知错觉的创作可能。
English
Anamorphosis refers to a category of images that are intentionally distorted, making them unrecognizable when viewed directly. Their true form only reveals itself when seen from a specific viewpoint, which can be through some catadioptric device like a mirror or a lens. While the construction of these mathematical devices can be traced back to as early as the 17th century, they are only interpretable when viewed from a specific vantage point and tend to lose meaning when seen normally. In this paper, we revisit these famous optical illusions with a generative twist. With the help of latent rectified flow models, we propose a method to create anamorphic images that still retain a valid interpretation when viewed directly. To this end, we introduce Laplacian Pyramid Warping, a frequency-aware image warping technique key to generating high-quality visuals. Our work extends Visual Anagrams (arXiv:2311.17919) to latent space models and to a wider range of spatial transforms, enabling the creation of novel generative perceptual illusions.

Summary

AI-Generated Summary

PDF86April 22, 2025