Img-Diffusion
updated
StreamDiffusion: A Pipeline-level Solution for Real-time Interactive
Generation
Paper
• 2312.12491
• Published • 75
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and
Generating with Multimodal LLMs
Paper
• 2401.11708
• Published • 30
Training-Free Consistent Text-to-Image Generation
Paper
• 2402.03286
• Published • 67
PALP: Prompt Aligned Personalization of Text-to-Image Models
Paper
• 2401.06105
• Published • 49
ImagenHub: Standardizing the evaluation of conditional image generation
models
Paper
• 2310.01596
• Published • 19
Instruct-Imagen: Image Generation with Multi-modal Instruction
Paper
• 2401.01952
• Published • 31
Scalable Diffusion Models with Transformers
Paper
• 2212.09748
• Published • 17
Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass
Diffusion Transformers
Paper
• 2401.11605
• Published • 23
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Paper
• 2402.10210
• Published • 35
Paper
• 2402.13144
• Published • 100
Gen4Gen: Generative Data Pipeline for Generative Multi-Concept
Composition
Paper
• 2402.15504
• Published • 21
DiffusionGPT: LLM-Driven Text-to-Image Generation System
Paper
• 2401.10061
• Published • 31
LightIt: Illumination Modeling and Control for Diffusion Models
Paper
• 2403.10615
• Published • 18
StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based
Semantic Control
Paper
• 2403.09055
• Published • 26
Be Yourself: Bounded Attention for Multi-Subject Text-to-Image
Generation
Paper
• 2403.16990
• Published • 25
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept
Matching
Paper
• 2404.03653
• Published • 35
Bigger is not Always Better: Scaling Properties of Latent Diffusion
Models
Paper
• 2404.01367
• Published • 22
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video
Generation
Paper
• 2405.01434
• Published • 56
Getting it Right: Improving Spatial Consistency in Text-to-Image Models
Paper
• 2404.01197
• Published • 31
Dynamic Typography: Bringing Words to Life
Paper
• 2404.11614
• Published • 46
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image
Generation
Paper
• 2404.02733
• Published • 22
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation
Paper
• 2404.19427
• Published • 74
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip
Connection Editing
Paper
• 2312.11392
• Published • 20
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale
Prediction
Paper
• 2404.02905
• Published • 74
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Paper
• 2403.03206
• Published • 71
Scalable Pre-training of Large Autoregressive Image Models
Paper
• 2401.08541
• Published • 38
Stable Flow: Vital Layers for Training-Free Image Editing
Paper
• 2411.14430
• Published • 22