InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation Paper • 2404.19427 • Published Apr 30 • 71
Gecko: Versatile Text Embeddings Distilled from Large Language Models Paper • 2403.20327 • Published Mar 29 • 47
RadSplat: Radiance Field-Informed Gaussian Splatting for Robust Real-Time Rendering with 900+ FPS Paper • 2403.13806 • Published Mar 20 • 18
Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing Paper • 2402.15151 • Published Feb 23 • 7
OpenMath Collection A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" • 15 items • Updated 6 days ago • 35
Diffuse to Choose: Enriching Image Conditioned Inpainting in Latent Diffusion Models for Virtual Try-All Paper • 2401.13795 • Published Jan 24 • 64
Lumiere: A Space-Time Diffusion Model for Video Generation Paper • 2401.12945 • Published Jan 23 • 86
ProPainter: Improving Propagation and Transformer for Video Inpainting Paper • 2309.03897 • Published Sep 7, 2023 • 26
DrugChat: Towards Enabling ChatGPT-Like Capabilities on Drug Molecule Graphs Paper • 2309.03907 • Published May 18, 2023 • 8
PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models Paper • 2309.05793 • Published Sep 11, 2023 • 50
MagiCapture: High-Resolution Multi-Concept Portrait Customization Paper • 2309.06895 • Published Sep 13, 2023 • 27
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing Paper • 2308.07926 • Published Aug 15, 2023 • 27
DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory Paper • 2308.08089 • Published Aug 16, 2023 • 21
TeCH: Text-guided Reconstruction of Lifelike Clothed Humans Paper • 2308.08545 • Published Aug 16, 2023 • 33
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning Paper • 2307.04725 • Published Jul 10, 2023 • 64
Sketch-A-Shape: Zero-Shot Sketch-to-3D Shape Generation Paper • 2307.03869 • Published Jul 8, 2023 • 22
LongNet: Scaling Transformers to 1,000,000,000 Tokens Paper • 2307.02486 • Published Jul 5, 2023 • 80
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis Paper • 2307.01952 • Published Jul 4, 2023 • 80
Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors Paper • 2306.17843 • Published Jun 30, 2023 • 43
One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization Paper • 2306.16928 • Published Jun 29, 2023 • 38
Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language Paper • 2306.16410 • Published Jun 28, 2023 • 27
Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold Paper • 2305.10973 • Published May 18, 2023 • 31
GANeRF: Leveraging Discriminators to Optimize Neural Radiance Fields Paper • 2306.06044 • Published Jun 9, 2023 • 4
DreamEditor: Text-Driven 3D Scene Editing with Neural Fields Paper • 2306.13455 • Published Jun 23, 2023 • 8
Blended-NeRF: Zero-Shot Object Generation and Blending in Existing Neural Radiance Fields Paper • 2306.12760 • Published Jun 22, 2023 • 8
Progressively Optimized Local Radiance Fields for Robust View Synthesis Paper • 2303.13791 • Published Mar 24, 2023 • 7
Large-scale Language Model Rescoring on Long-form Data Paper • 2306.08133 • Published Jun 13, 2023 • 4
MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing Paper • 2306.10012 • Published Jun 16, 2023 • 35
NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations Paper • 2306.09109 • Published Jun 15, 2023 • 4
Exploring the MIT Mathematics and EECS Curriculum Using Large Language Models Paper • 2306.08997 • Published Jun 15, 2023 • 10