Edward Alexis Vásquez Becerra's picture

38 40

Edward Alexis Vásquez Becerra

edalvb

·

AI & ML interests

None yet

Organizations

None yet

edalvb's activity

upvoted a paper 5 months ago

InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation

Paper • 2404.19427 • Published Apr 30 • 71

upvoted a paper 6 months ago

Gecko: Versatile Text Embeddings Distilled from Large Language Models

Paper • 2403.20327 • Published Mar 29 • 47

upvoted 2 papers 7 months ago

RadSplat: Radiance Field-Informed Gaussian Splatting for Robust Real-Time Rendering with 900+ FPS

Paper • 2403.13806 • Published Mar 20 • 18

Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing

Paper • 2402.15151 • Published Feb 23 • 7

upvoted a collection 8 months ago

OpenMath

A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" • 15 items • Updated 6 days ago • 35

upvoted a paper 8 months ago

Diffuse to Choose: Enriching Image Conditioned Inpainting in Latent Diffusion Models for Virtual Try-All

Paper • 2401.13795 • Published Jan 24 • 64

upvoted a paper 9 months ago

Lumiere: A Space-Time Diffusion Model for Video Generation

Paper • 2401.12945 • Published Jan 23 • 86

upvoted 14 papers about 1 year ago

Text-to-3D using Gaussian Splatting

Paper • 2309.16585 • Published Sep 28, 2023 • 31

ProPainter: Improving Propagation and Transformer for Video Inpainting

Paper • 2309.03897 • Published Sep 7, 2023 • 26

Tracking Anything with Decoupled Video Segmentation

Paper • 2309.03903 • Published Sep 7, 2023 • 27

DrugChat: Towards Enabling ChatGPT-Like Capabilities on Drug Molecule Graphs

Paper • 2309.03907 • Published May 18, 2023 • 8

Large-Scale Automatic Audiobook Creation

Paper • 2309.03926 • Published Sep 7, 2023 • 53

NExT-GPT: Any-to-Any Multimodal LLM

Paper • 2309.05519 • Published Sep 11, 2023 • 78

PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models

Paper • 2309.05793 • Published Sep 11, 2023 • 50

MagiCapture: High-Resolution Multi-Concept Portrait Customization

Paper • 2309.06895 • Published Sep 13, 2023 • 27

CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Paper • 2308.07926 • Published Aug 15, 2023 • 27

DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory

Paper • 2308.08089 • Published Aug 16, 2023 • 21

TeCH: Text-guided Reconstruction of Lifelike Clothed Humans

Paper • 2308.08545 • Published Aug 16, 2023 • 33

AutoDecoding Latent 3D Diffusion Models

Paper • 2307.05445 • Published Jul 7, 2023 • 13

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

Paper • 2307.04725 • Published Jul 10, 2023 • 64

Sketch-A-Shape: Zero-Shot Sketch-to-3D Shape Generation

Paper • 2307.03869 • Published Jul 8, 2023 • 22

upvoted 17 papers over 1 year ago

LongNet: Scaling Transformers to 1,000,000,000 Tokens

Paper • 2307.02486 • Published Jul 5, 2023 • 80

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

Paper • 2307.01952 • Published Jul 4, 2023 • 80

Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors

Paper • 2306.17843 • Published Jun 30, 2023 • 43

One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization

Paper • 2306.16928 • Published Jun 29, 2023 • 38

Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language

Paper • 2306.16410 • Published Jun 28, 2023 • 27

Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold

Paper • 2305.10973 • Published May 18, 2023 • 31

Segment Anything in High Quality

Paper • 2306.01567 • Published Jun 2, 2023 • 7

GANeRF: Leveraging Discriminators to Optimize Neural Radiance Fields

Paper • 2306.06044 • Published Jun 9, 2023 • 4

DreamEditor: Text-Driven 3D Scene Editing with Neural Fields

Paper • 2306.13455 • Published Jun 23, 2023 • 8

Blended-NeRF: Zero-Shot Object Generation and Blending in Existing Neural Radiance Fields

Paper • 2306.12760 • Published Jun 22, 2023 • 8

Fast Segment Anything

Paper • 2306.12156 • Published Jun 21, 2023 • 34

Progressively Optimized Local Radiance Fields for Robust View Synthesis

Paper • 2303.13791 • Published Mar 24, 2023 • 7

Large-scale Language Model Rescoring on Long-form Data

Paper • 2306.08133 • Published Jun 13, 2023 • 4

MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing

Paper • 2306.10012 • Published Jun 16, 2023 • 35

NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations

Paper • 2306.09109 • Published Jun 15, 2023 • 4

Exploring the MIT Mathematics and EECS Curriculum Using Large Language Models

Paper • 2306.08997 • Published Jun 15, 2023 • 10

TryOnDiffusion: A Tale of Two UNets

Paper • 2306.08276 • Published Jun 14, 2023 • 72