Platypus: A Generalized Specialist Model for Reading Text in Various Forms Paper • 2408.14805 • Published 24 days ago • 11
CommonCatalog Collection Common Catalog, a dataset with Creative Commons licensed images and machine-generated caption pairs • 8 items • Updated May 16 • 14
EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts Paper • 2406.09162 • Published Jun 13 • 13
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment Paper • 2403.05135 • Published Mar 8 • 42
Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation Paper • 2306.17115 • Published Jun 29, 2023 • 11