LookingGlass:基於拉普拉斯金字塔變形的生成式錯視圖像
LookingGlass: Generative Anamorphoses via Laplacian Pyramid Warping
April 11, 2025
作者: Pascal Chang, Sergio Sancho, Jingwei Tang, Markus Gross, Vinicius C. Azevedo
cs.AI
摘要
變形圖像(Anamorphosis)指的是一類被刻意扭曲的圖像,當直接觀看時,它們顯得難以辨識。其真實形態僅在從特定視角觀察時才會顯現,這通常需要借助如鏡子或透鏡等折反射裝置。儘管這類數學裝置的構建可追溯至17世紀,但它們僅在特定觀察點下才可解讀,而在正常視角下往往失去意義。本文中,我們以生成式視角重新審視這些著名的光學幻象。借助潛在整流流模型,我們提出了一種方法來創建變形圖像,這些圖像在直接觀看時仍能保持有效的解讀。為此,我們引入了拉普拉斯金字塔扭曲技術,這是一種頻率感知的圖像扭曲方法,對生成高質量視覺效果至關重要。我們的工作將視覺字謎(Visual Anagrams,arXiv:2311.17919)擴展至潛在空間模型及更廣泛的空間變換,從而能夠創造新穎的生成式感知幻象。
English
Anamorphosis refers to a category of images that are intentionally distorted,
making them unrecognizable when viewed directly. Their true form only reveals
itself when seen from a specific viewpoint, which can be through some
catadioptric device like a mirror or a lens. While the construction of these
mathematical devices can be traced back to as early as the 17th century, they
are only interpretable when viewed from a specific vantage point and tend to
lose meaning when seen normally. In this paper, we revisit these famous optical
illusions with a generative twist. With the help of latent rectified flow
models, we propose a method to create anamorphic images that still retain a
valid interpretation when viewed directly. To this end, we introduce Laplacian
Pyramid Warping, a frequency-aware image warping technique key to generating
high-quality visuals. Our work extends Visual Anagrams (arXiv:2311.17919) to
latent space models and to a wider range of spatial transforms, enabling the
creation of novel generative perceptual illusions.Summary
AI-Generated Summary