Introduction to IP Adapter Face ID

IP Adapter Face ID：Generate various style images conditioned on a face with only text prompts.

We use face ID embedding from a face recognition model instead of CLIP image embedding, additionally, we use LoRA to improve ID consistency. IP Adapter Face ID can generate various style images conditioned on a face with only text prompts.

A Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models.Various image synthesis with our proposed IP-Adapter applied on the pretrained text-to-image diffusion model and additional structure controller.

Approach of IP Adapter Face ID

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt. The proposed IP-Adapter consists of two parts: a image encoder to extract image features from image prompt, and adapted modules with decoupled cross-attention to embed image features into the pretrained text-to-image diffusion model.