Misc | Mayank Shrivastava

Training Free Guided Flow Matching with Optimal Control (ICLR 2025)

the celebA dataset contains 40 binary attribute categories each labelled as present or absent.

text_prompts = ['A photo of a smiling face.']

ema.copy_to(score_model.parameters())
N = 1
batch_size = 1

calculate clip_semantic_loss between original image and inverted one.

traj_oc is storing the entire trajectory.

len(traj) = 101

traj[0].shape torch.Size([1, 3, 256, 256])

embed_to_latent:

def embed_to_latent(model_fn, img):
  device = img.device
  def ode_func(t, x):
    x = from_flattened_numpy(x, img.shape).to(device).type(torch.float32)
    vec_t = torch.ones(img.shape[0], device=x.device) * t
    drift = model_fn(x, vec_t*999)
    return to_flattened_numpy(drift)