kai mo's picture

12 7

kai mo

k3vlm

AI & ML interests

None yet

Organizations

None yet

k3vlm's activity

upvoted 2 papers about 1 month ago

BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline

Paper • 2408.15079 • Published Aug 27 • 51

Writing in the Margins: Better Inference Pattern for Long Context Retrieval

Paper • 2408.14906 • Published Aug 27 • 138

upvoted 3 papers about 2 months ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21 • 53

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Paper • 2408.11039 • Published Aug 20 • 56

Imagen 3

Paper • 2408.07009 • Published Aug 13 • 60

upvoted 3 papers 2 months ago

SAM 2: Segment Anything in Images and Videos

Paper • 2408.00714 • Published Aug 1 • 105

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31 • 103

Mixture of Nested Experts: Adaptive Processing of Visual Tokens

Paper • 2407.19985 • Published Jul 29 • 34

upvoted a paper 4 months ago

Jailbreaking as a Reward Misspecification Problem

Paper • 2406.14393 • Published Jun 20 • 12

upvoted a paper 6 months ago

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2 • 104

upvoted 2 papers 7 months ago

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Paper • 2403.05530 • Published Mar 8 • 59

InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding

Paper • 2403.01487 • Published Mar 3 • 15