Sylvain Filoni
fffiloni
AI & ML interests
ML for Animation • Alumni Arts Déco Paris
Articles
Organizations
fffiloni's activity
Disco4D: Disentangled 4D Human Generation and Animation from a Single Image
Paper
•
2409.17280
•
Published
•
8
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Paper
•
2409.18124
•
Published
•
23
PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation
Paper
•
2409.18964
•
Published
•
20
upvoted
an
article
9 days ago
Article
🌟 Easy Fine-Tuning with Hugging Face SQL Console, Notebook Creator, and SFT
By
•
•
12upvoted
a
collection
9 days ago
upvoted
a
paper
9 days ago
upvoted
a
paper
10 days ago
upvoted
an
article
13 days ago
Article
Exploring the Daily Papers Page on Hugging Face
•
25
upvoted
an
article
15 days ago
Article
Introducing Community Tools on HuggingChat
•
26
Apollo: Band-sequence Modeling for High-Quality Audio Restoration
Paper
•
2409.08514
•
Published
•
8
DrawingSpinUp: 3D Animation from Single Character Drawings
Paper
•
2409.08615
•
Published
•
14
InstantDrag: Improving Interactivity in Drag-based Image Editing
Paper
•
2409.08857
•
Published
•
30
ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds
Paper
•
2409.09213
•
Published
•
10
Seed-Music: A Unified Framework for High Quality and Controlled Music Generation
Paper
•
2409.09214
•
Published
•
45
SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer
Paper
•
2409.08425
•
Published
•
9
StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation
Paper
•
2409.12576
•
Published
•
14
FlexiTex: Enhancing Texture Generation with Visual Guidance
Paper
•
2409.12431
•
Published
•
9
3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion
Paper
•
2409.12957
•
Published
•
17
LVCD: Reference-based Lineart Video Colorization with Diffusion Models
Paper
•
2409.12960
•
Published
•
20
upvoted
an
article
22 days ago
Article
"Diffusers Image Fill" guide
By
•
•
31upvoted
a
paper
23 days ago
LinFusion: 1 GPU, 1 Minute, 16K Image
Paper
•
2409.02097
•
Published
•
31
FLUX that Plays Music
Paper
•
2409.00587
•
Published
•
31
VideoLLaMB: Long-context Video Understanding with Recurrent Memory Bridges
Paper
•
2409.01071
•
Published
•
26
FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation
Paper
•
2409.02245
•
Published
•
9
Geometry Image Diffusion: Fast and Data-Efficient Text-to-3D with Image-Based Surface Representation
Paper
•
2409.03718
•
Published
•
25
Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing
Paper
•
2409.01322
•
Published
•
95
Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency
Paper
•
2409.02634
•
Published
•
85
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Paper
•
2409.02095
•
Published
•
33
Kalman-Inspired Feature Propagation for Video Face Super-Resolution
Paper
•
2408.05205
•
Published
•
8
Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation
Paper
•
2408.15239
•
Published
•
27
MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement
Paper
•
2408.14211
•
Published
•
8
Diffusion Models Are Real-Time Game Engines
Paper
•
2408.14837
•
Published
•
121
Reenact Anything: Semantic Video Motion Transfer Using Motion-Textual Inversion
Paper
•
2408.00458
•
Published
•
10
TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models
Paper
•
2408.00735
•
Published
•
15
SAM 2: Segment Anything in Images and Videos
Paper
•
2408.00714
•
Published
•
105
MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models
Paper
•
2408.01337
•
Published
•
10
TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling
Paper
•
2408.01291
•
Published
•
11
ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
Paper
•
2408.03284
•
Published
•
9
Facing the Music: Tackling Singing Voice Separation in Cinematic Audio Source Separation
Paper
•
2408.03588
•
Published
•
6
Fast Sprite Decomposition from Animated Graphics
Paper
•
2408.03923
•
Published
•
7
Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from User's Casual Sketches
Paper
•
2408.04567
•
Published
•
23
upvoted
an
article
about 2 months ago
Article
A Complete Guide to Audio Datasets
•
17
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
Paper
•
2403.14610
•
Published
•
3
Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
Paper
•
2407.11398
•
Published
•
8
Kinetic Typography Diffusion Model
Paper
•
2407.10476
•
Published
•
1
Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle
Paper
•
2407.19548
•
Published
•
22
Visual Riddles: a Commonsense and World Knowledge Challenge for Large Vision and Language Models
Paper
•
2407.19474
•
Published
•
22
Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone Capture
Paper
•
2407.19593
•
Published
•
12
Artist: Aesthetically Controllable Text-Driven Stylization without Training
Paper
•
2407.15842
•
Published
•
13
AccDiffusion: An Accurate Method for Higher-Resolution Image Generation
Paper
•
2407.10738
•
Published
•
3
DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors
Paper
•
2407.16260
•
Published
•
1
SHIC: Shape-Image Correspondences with no Keypoint Supervision
Paper
•
2407.18907
•
Published
•
39
Text2Place: Affordance-aware Text Guided Human Placement
Paper
•
2407.15446
•
Published
•
2
BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation
Paper
•
2407.17952
•
Published
•
27
Floating No More: Object-Ground Reconstruction from a Single Image
Paper
•
2407.18914
•
Published
•
18
EVLM: An Efficient Vision-Language Model for Visual Understanding
Paper
•
2407.14177
•
Published
•
42
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds
Paper
•
2407.01494
•
Published
•
13