Naresh Kumar Devulapally
Prompt Inversion - DLLMs
Prompt Inversion using Diffusion Large Langauge Models
Naresh Kumar Devulapally
Target: CVPR 2026
Fall 2025
What is Prompt Inversion?
Naresh Kumar Devulapally
Prompt Inversion - DLLMs
Fall 2025
Given an input image (\( I_n \)) Prompt Inversion is the process of finding the optimal textual prompt (\( P \)) that, when fed into a pre-trained image generation model (\( G \)), would most faithfully reconstruct a given input image (\( I_n \))
(\( I_n \))
"A vibrant and colorful teapot with blue, red, orange, and green stripes, featuring an orange spout, handle, and feet, on a clean white background."
(\( P \))
Why Prompt Inversion?
Naresh Kumar Devulapally
Prompt Inversion - DLLMs
Fall 2025
A dog-faced backpack
Simple captions do not let you reproduce the object/style in images
Image Editing v/s Prompt Inversion
Naresh Kumar Devulapally
Prompt Inversion - DLLMs
Fall 2025
(\( I_n \))
Image Editing: "Transform the realistic photo of the colorful teapot into a whimsical 2D children's book illustration that feels lively and enchanting."
Prompt Inversion
A whimsical and vibrant children's book illustration of a colorful teapot with blue, red, orange, and green stripes, featuring a playful orange spout, handle, and feet. The style is charming and expressive, with soft textures and a clean, inviting background, reminiscent of a classic storybook
Baseline Method
Naresh Kumar Devulapally
Prompt Inversion - DLLMs
Fall 2025
Baseline Method: VGD
Baseline Experiments - VGD
Naresh Kumar Devulapally
Prompt Inversion - DLLMs
Fall 2025
'Pot, a vibrant and playful ceramic teapot set, with steam emitting from a small, handle-shaped pipe, sitting on a bright, striped, and bouncy surface next to 3 bright, multi-hued, water-based, plant-pot-shaped, plastic, toy, water-injected, water-based water balloon-shaped water-injected plastic.'
Week 1: Task - Soft Prompting in SD 3.5
Naresh Kumar Devulapally
Prompt Inversion - DLLMs
Fall 2025
Question: Does Soft Prompting tokens allow for additional conditioning for personalization?
A photo of \( s_* \) dog
Week 1: Task - Soft Prompting in SD 3.5
Naresh Kumar Devulapally
Prompt Inversion - DLLMs
Fall 2025
Question: Does Soft Prompting tokens allow for additional conditioning for personalization?
A photo of \( s_* \) dog beside a cat.
A photo of \( s_* \) dog beside a white cat.
Hard
Prompt Inversion
with 1 image
Week 1: Task - Replace Llama with LLada (Inf. Speed up)
Naresh Kumar Devulapally
Prompt Inversion - DLLMs
Fall 2025
Input
VGD - LLama
VGD - LLada - Block Size 32
VGD - LLada - Block Size 16
VGD - LLada - Block Size 1
32 \( \times \) Speed up*
32 \( \times \) Speed up*
No Speed up*
* Approximate calculations
(Full LLada generation)
Baseline Experiments - VGD
Naresh Kumar Devulapally
Prompt Inversion - DLLMs
Fall 2025
LLaDa - Block size 32
Baseline Experiments - VGD
Naresh Kumar Devulapally
Prompt Inversion - DLLMs
Fall 2025
LLaDa - Block size 32
October 9 - Weekly
Naresh Kumar Devulapally
Prompt Inversion - DLLMs
Fall 2025
October 9 - Weekly
Naresh Kumar Devulapally
Prompt Inversion - DLLMs
Fall 2025
How can we validate our generated prompts?
Milestone 1 - To-Dos
Naresh Kumar Devulapally
Prompt Inversion - DLLMs
Fall 2025
October 17 - Weekly
Naresh Kumar Devulapally
Prompt Inversion - DLLMs
Fall 2025