ChatPaper.aiChatPaper

LookingGlass:基於拉普拉斯金字塔變形的生成式錯視圖像

LookingGlass: Generative Anamorphoses via Laplacian Pyramid Warping

April 11, 2025
作者: Pascal Chang, Sergio Sancho, Jingwei Tang, Markus Gross, Vinicius C. Azevedo
cs.AI

摘要

變形圖像(Anamorphosis)指的是一類被刻意扭曲的圖像,當直接觀看時,它們顯得難以辨識。其真實形態僅在從特定視角觀察時才會顯現,這通常需要借助如鏡子或透鏡等折反射裝置。儘管這類數學裝置的構建可追溯至17世紀,但它們僅在特定觀察點下才可解讀,而在正常視角下往往失去意義。本文中,我們以生成式視角重新審視這些著名的光學幻象。借助潛在整流流模型,我們提出了一種方法來創建變形圖像,這些圖像在直接觀看時仍能保持有效的解讀。為此,我們引入了拉普拉斯金字塔扭曲技術,這是一種頻率感知的圖像扭曲方法,對生成高質量視覺效果至關重要。我們的工作將視覺字謎(Visual Anagrams,arXiv:2311.17919)擴展至潛在空間模型及更廣泛的空間變換,從而能夠創造新穎的生成式感知幻象。
English
Anamorphosis refers to a category of images that are intentionally distorted, making them unrecognizable when viewed directly. Their true form only reveals itself when seen from a specific viewpoint, which can be through some catadioptric device like a mirror or a lens. While the construction of these mathematical devices can be traced back to as early as the 17th century, they are only interpretable when viewed from a specific vantage point and tend to lose meaning when seen normally. In this paper, we revisit these famous optical illusions with a generative twist. With the help of latent rectified flow models, we propose a method to create anamorphic images that still retain a valid interpretation when viewed directly. To this end, we introduce Laplacian Pyramid Warping, a frequency-aware image warping technique key to generating high-quality visuals. Our work extends Visual Anagrams (arXiv:2311.17919) to latent space models and to a wider range of spatial transforms, enabling the creation of novel generative perceptual illusions.

Summary

AI-Generated Summary

PDF86April 22, 2025