Generative AI model for Global Illumination effects

Recent advances in generative techniques [1] exhibit the ability to generate images with visually appealing content and illumination. Strong priors in generative models, learned from a large-scale datasets, have enabled the breakthrough, ushering a new era in neural rendering. While there has been some research focusing on realistic and controllable lighting effects with diffusion-based models, they still lack the capability to produce specific lighting effects. Particularly, generating multi-bounce, high-frequency lighting effects like caustics remain untackled in diffusion-based image generation.

Diffusion-based models [2] have demonstrated the capability of generating photorealistic images in various domains. Nevertheless, some research addresses the limitation of the current diffusion model-based image generation for shadows and reflections, while introducing conditioned diffusion models to model the lighting with single bounce shading and mirror reflection as a depth conditioned image inpainting task.

Caustics Generation by a Conditional Diffusion Model

We leverage diffusion-based techniques to generate an indirect illumination of a particular lighting effect. Specifically, our technique enables a diffusion model to generate cardioid-shaped reflective caustics as a conditional image generation task.

We use a latent-space diffusion model as our baseline architecture and set multi-image conditioning and light embeddings. We use geometric and material information like albedo, normal, roughness, and metallic as conditioning images, augmenting them with illumination information like direct illumination and radiance cues. These conditioning images are encoded into latent space using a pre-trained Variational Autoencoder (VAE) encoder. Light position encoded by Positional Encoding and light direction encoded by Spherical Harmonics form an additional input to the diffusion UNet. Figure 1 presents our framework with a conditional diffusion model for generating caustics effect.

Figure 1. Framework with a conditional diffusion model.

Results

We fine-tune a latent-space diffusion model using our caustics dataset and demonstrate our approach generates visually plausible cardioid-shaped caustics. The conditioning information that includes geometric, material and illumination data, as well as light property, is easily obtained from existing rendering pipeline.

Figure 2 shows our results for validation data (Top) and test data (Bottom). The indirect illumination (Figure 2 (b)) is generated from our fine-tuned diffusion model and composited to the direct illumination (Figure 2 (a)), which is one of our conditioning images, to present a final result (Figure 2 (c)).

Figure 2. Our results. (a) Direct illumination, (b) Indirect illumination in our result, (c) (a)+(b) our result, (d) Reference global illumination.

Figure 3. (Left) Our result. (Right) Reference global illumination.

Our work paves a way to interesting research for generative diffusion-based models to be capable of specific indirect illumination effect generation. Further details can be found in our paper [3] presented at Eurographics 2025 – Short paper.

References

Brooks, Peebles, et al. Video generation models as world simulators. (2024).
Hanqun Cao, Cheng Tan, Zhangyang Gao, Yilun Xu, Guangyong Chen, Pheng-Ann Heng, Stan Z. Li. A Survey on Generative Diffusion Models. IEEE Transactions on Knowledge and Data Engineering (2024).
Wojciech Uss, Wojciech Kaliński, Alexandr Kuznetsov, Harish Anand, Sungye Kim. Cardioid Caustics Generation with Conditional Diffusion Models. Eurographics 2025 – Short Papers (2025).

Source link

Subscription Plans

Beginner’s Bundle

Infinity Plan

Elevate Subscription

Generative AI model for Global Illumination effects

Caustics Generation by a Conditional Diffusion Model

Results

References

Salesforce CRM Consulting Solutions for B2B Growth

Mistral’s Le Chat adds deep research agent and voice mode to challenge OpenAI’s enterprise dominance

MG – Darknet Diaries

The end of Intel hybrid processors in 2028?

Montréal Attracts Top Meetings With Worldly Charm and Local Warmth

Related articles

Why Spanish Job Fairs Are Going Digital And How Your Event Can Too

Salesforce CRM Consulting Solutions for B2B Growth

Mistral’s Le Chat adds deep research agent and voice mode to challenge OpenAI’s enterprise dominance

MG – Darknet Diaries

Follow us

Company

Contact Us

Popular news

Why Spanish Job Fairs Are Going Digital And How Your Event Can Too

Salesforce CRM Consulting Solutions for B2B Growth

Mistral’s Le Chat adds deep research agent and voice mode to challenge OpenAI’s enterprise dominance