SNR-Edit: Structure-Aware Noise Rectification for Inversion-Free Flow-Based Editing
Lifan Jiang
Boxi Wu
Yuhang Pei
Tianrun Wu
Yongyuan Chen
Yan Zhao
Shiyu Yu
Deng Cai
State Key lab of CAD&CG, Zhejiang University
Submit to ICML 2026
Paper | GitHub | Dataset
SNR-Edit is a training-free, model-agnostic framework for inversion-free text-guided image editing with flow-based generative models. SNR-Edit identifies a Structural--Stochastic Mismatch, where a fixed Gaussian proxy drives the source trajectory off the source latent manifold and causes structural drift. To address this, SNR-Edit constructs a structure-aware prior by decomposing the source image into semantic regions (SAM2), encoding geometry with RoPE, and mapping region signatures through a frozen randomized projection to form a structural map. The map is resized to latent resolution, min--max normalized to [-1, 1], and broadcast to obtain a latent prior that modulates the noise. During integration, SNR-Edit mixes this prior with Gaussian noise to form rectified noise and computes a corrected source state that anchors the dynamics. It then evaluates the target velocity at a re-anchored position, yielding a rectified flow that preserves layout while executing edits. SNR-Edit improves structural fidelity and text alignment on SD3 and FLUX with minimal overhead.
Abstract
Inversion-free image editing using flow-based generative models challenges the prevailing inversion-based pipelines. However, existing approaches rely on fixed Gaussian noise to construct the source trajectory, leading to biased trajectory dynamics and causing structural degradation or quality loss. To address this, we introduce SNR-Edit, a training-free framework achieving faithful Latent Trajectory Correction via adaptive noise control. Mechanistically, SNR-Edit uses structure-aware noise rectification to inject segmentation constraints into the initial noise, anchoring the stochastic component of the source trajectory to the real image’s implicit inversion position and reducing trajectory drift during source–target transport. This lightweight modification yields smoother latent trajectories and ensures high-fidelity structural preservation without requiring model tuning or inversion. Across SD3 and FLUX, evaluations on PIE-Bench and SNR-Bench show that SNR-Edit delivers performance on pixel-level metrics and VLM-based scoring, while adding only ~1s overhead per image.