Baichuan Zhou's picture

Baichuan Zhou

bczhou

·

https://baichuanzhou.github.io/

baichuanzhou

AI & ML interests

Computer Vision

Organizations

bczhou's activity

upvoted a paper about 19 hours ago

Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining

Paper • 2410.08102 • Published 4 days ago • 8

upvoted 2 papers 4 days ago

Pixtral 12B

Paper • 2410.07073 • Published 5 days ago • 54

Personalized Visual Instruction Tuning

Paper • 2410.07113 • Published 5 days ago • 65

upvoted a paper 13 days ago

MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning

Paper • 2409.20566 • Published 14 days ago • 49

upvoted a paper 14 days ago

MinerU: An Open-Source Solution for Precise Document Content Extraction

Paper • 2409.18839 • Published 17 days ago • 24

upvoted 6 papers about 1 month ago

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4 • 72

Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing

Paper • 2409.01322 • Published Sep 2 • 95

CDM: A Reliable Metric for Fair and Accurate Formula Recognition Evaluation

Paper • 2409.03643 • Published Sep 5 • 18

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3 • 77

CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis

Paper • 2408.14765 • Published Aug 27 • 12

UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios

Paper • 2408.17267 • Published Aug 30 • 22

upvoted a collection 2 months ago

InternLM-XComposer2.5

3 items • Updated Jul 19 • 5

upvoted a paper 3 months ago

TinyLLaVA: A Framework of Small-scale Large Multimodal Models

Paper • 2402.14289 • Published Feb 22 • 19

upvoted a collection 4 months ago

TinyLLaVA

TinyLLaVA: A Framework of Small-scale Large Multimodal Models • 7 items • Updated Mar 19 • 5