Abecid (Adam Lee)

upvoted a paper about 19 hours ago

Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models

Paper • 2410.02416 • Published 3 days ago • 15

upvoted an article 5 days ago

Article

Converting Vertex-Colored Meshes to Textured Meshes

6 days ago

• 7

upvoted a paper 11 days ago

MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling

Paper • 2409.16160 • Published 11 days ago • 30

upvoted a paper 12 days ago

V^3: Viewing Volumetric Videos on Mobiles via Streamable 2D Dynamic Gaussians

Paper • 2409.13648 • Published 15 days ago • 9

upvoted a paper 13 days ago

Imagine yourself: Tuning-Free Personalized Image Generation

Paper • 2409.13346 • Published 16 days ago • 66

upvoted an article 17 days ago

Article

Outpainting II - Differential Diffusion

By

•

Apr 23

• 44

upvoted 4 papers 18 days ago

DrawingSpinUp: 3D Animation from Single Character Drawings

Paper • 2409.08615 • Published 23 days ago • 14

Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

Paper • 2409.11406 • Published 18 days ago • 23

Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think

Paper • 2409.11355 • Published 18 days ago • 26

EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer

Paper • 2409.10819 • Published 19 days ago • 17

upvoted 2 papers 19 days ago

Robust Dual Gaussian Splatting for Immersive Human-centric Volumetric Videos

Paper • 2409.08353 • Published 23 days ago • 10

A Diffusion Approach to Radiance Field Relighting using Multi-Illumination Synthesis

Paper • 2409.08947 • Published 22 days ago • 11

upvoted an article 21 days ago

Article

Accelerate 1.0.0

23 days ago

• 34

upvoted a paper 27 days ago

Qihoo-T2X: An Efficiency-Focused Diffusion Transformer via Proxy Tokens for Text-to-Any-Task

Paper • 2409.04005 • Published 30 days ago • 16

upvoted 8 papers about 1 month ago

OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model

Paper • 2409.01199 • Published Sep 2 • 12

SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable Manners

Paper • 2408.16768 • Published Aug 29 • 26

ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model

Paper • 2408.16767 • Published Aug 29 • 29

Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation

Paper • 2408.14819 • Published Aug 27 • 19

Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation

Paper • 2408.15239 • Published Aug 27 • 27

upvoted an article about 1 month ago

Article

SDXL in 4 steps with Latent Consistency LoRAs

Nov 9, 2023

• 10

upvoted 5 papers about 1 month ago

LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation

Paper • 2408.13252 • Published Aug 23 • 23

IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models

Paper • 2308.06721 • Published Aug 13, 2023 • 29

Sapiens: Foundation for Human Vision Models

Paper • 2408.12569 • Published Aug 22 • 86

Subsurface Scattering for 3D Gaussian Splatting

Paper • 2408.12282 • Published Aug 22 • 5

xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations

Paper • 2408.12590 • Published Aug 22 • 33

upvoted 5 papers about 2 months ago

Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from User's Casual Sketches

Paper • 2408.04567 • Published Aug 8 • 23

RayGauss: Volumetric Gaussian-Based Ray Casting for Photorealistic Novel View Synthesis

Paper • 2408.03356 • Published Aug 6 • 8

Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields

Paper • 2408.03822 • Published Aug 7 • 9

An Object is Worth 64x64 Pixels: Generating 3D Object via Image Diffusion

Paper • 2408.03178 • Published Aug 6 • 36

LatentSwap: An Efficient Latent Code Mapping Framework for Face Swapping

Paper • 2402.18351 • Published Feb 28 • 1

upvoted 5 papers 2 months ago

Fine-gained Zero-shot Video Sampling

Paper • 2407.21475 • Published Jul 31 • 5

Visual Text Generation in the Wild

Paper • 2407.14138 • Published Jul 19 • 8

Tora: Trajectory-oriented Diffusion Transformer for Video Generation

Paper • 2407.21705 • Published Jul 31 • 25

WalkTheDog: Cross-Morphology Motion Alignment via Phase Manifolds

Paper • 2407.18946 • Published Jul 11 • 12

FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention

Paper • 2407.19918 • Published Jul 29 • 47

upvoted an article 2 months ago

Article

Memory-efficient Diffusion Transformers with Quanto and Diffusers

Jul 30

• 52

upvoted 6 papers 2 months ago

MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music Generation

Paper • 2407.15060 • Published Jul 21 • 9

VSSD: Vision Mamba with Non-Casual State Space Duality

Paper • 2407.18559 • Published Jul 26 • 16

Floating No More: Object-Ground Reconstruction from a Single Image

Paper • 2407.18914 • Published Jul 26 • 18

SHIC: Shape-Image Correspondences with no Keypoint Supervision

Paper • 2407.18907 • Published Jul 26 • 39

BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation

Paper • 2407.17952 • Published Jul 25 • 27

ViPer: Visual Personalization of Generative Models via Individual Preference Learning

Paper • 2407.17365 • Published Jul 24 • 11

upvoted an article 2 months ago

Article

It's raining diffusion personalization techniques☔️🎭🖼️

By

•

Apr 11

• 18

upvoted a paper 2 months ago

HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions

Paper • 2407.15187 • Published Jul 21 • 10

upvoted 4 papers 3 months ago

Jumping Ahead: Improving Reconstruction Fidelity with JumpReLU Sparse Autoencoders

Paper • 2407.14435 • Published Jul 19 • 6

Understanding Reference Policies in Direct Preference Optimization

Paper • 2407.13709 • Published Jul 18 • 16

Shape of Motion: 4D Reconstruction from a Single Video

Paper • 2407.13764 • Published Jul 18 • 19

Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion

Paper • 2407.13759 • Published Jul 18 • 17

upvoted an article 3 months ago

Article

TGI Multi-LoRA: Deploy Once, Serve 30 Models

Jul 18

• 44

upvoted 8 papers 3 months ago

E5-V: Universal Embeddings with Multimodal Large Language Models

Paper • 2407.12580 • Published Jul 17 • 38

Scaling Diffusion Transformers to 16 Billion Parameters

Paper • 2407.11633 • Published Jul 16 • 25

GRUtopia: Dream General Robots in a City at Scale

Paper • 2407.10943 • Published Jul 15 • 23

Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models

Paper • 2407.10285 • Published Jul 14 • 4

StyleSplat: 3D Object Style Transfer with Gaussian Splatting

Paper • 2407.09473 • Published Jul 12 • 10

Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On

Paper • 2407.08348 • Published Jul 11 • 50

Video Diffusion Alignment via Reward Gradients

Paper • 2407.08737 • Published Jul 11 • 47

Controlling Space and Time with Diffusion Models

Paper • 2407.07860 • Published Jul 10 • 16

Adam Lee

AI & ML interests

Organizations

Abecid's activity

Converting Vertex-Colored Meshes to Textured Meshes

Outpainting II - Differential Diffusion

Accelerate 1.0.0

SDXL in 4 steps with Latent Consistency LoRAs

Memory-efficient Diffusion Transformers with Quanto and Diffusers

It's raining diffusion personalization techniques☔️🎭🖼️

TGI Multi-LoRA: Deploy Once, Serve 30 Models