zhangysk (Ge Zhang)

upvoted a paper 6 days ago

MIO: A Foundation Model on Multimodal Tokens

Paper • 2409.17692 • Published 10 days ago • 45

upvoted 2 papers 11 days ago

OmniBench: Towards The Future of Universal Omni-Language Models

Paper • 2409.15272 • Published 12 days ago • 24

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

Paper • 2409.16191 • Published 11 days ago • 40

upvoted a paper 18 days ago

Seed-Music: A Unified Framework for High Quality and Controlled Music Generation

Paper • 2409.09214 • Published 22 days ago • 45

upvoted a paper 25 days ago

SongCreator: Lyrics-based Universal Song Generation

Paper • 2409.06029 • Published 26 days ago • 19

upvoted a paper 26 days ago

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4 • 72

upvoted a paper 30 days ago

FuzzCoder: Byte-level Fuzzing Test via Large Language Model

Paper • 2409.01944 • Published Sep 3 • 44

upvoted a paper about 1 month ago

MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark

Paper • 2409.02813 • Published Sep 4 • 27

upvoted a collection about 1 month ago

OLMoE

Collection

Artifacts for open mixture-of-experts language models. • 13 items • Updated 11 days ago • 21

upvoted a paper about 1 month ago

Foundation Models for Music: A Survey

Paper • 2408.14340 • Published Aug 26 • 38

upvoted 3 papers about 2 months ago

TableBench: A Comprehensive and Complex Benchmark for Table Question Answering

Paper • 2408.09174 • Published Aug 17 • 51

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Paper • 2408.08152 • Published Aug 15 • 51

I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm

Paper • 2408.08072 • Published Aug 15 • 31

upvoted 3 papers 3 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25 • 85

LongIns: A Challenging Long-context Instruction-based Exam for LLMs

Paper • 2406.17588 • Published Jun 25 • 20

MantisScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation

Paper • 2406.15252 • Published Jun 21 • 14

upvoted 4 papers 4 months ago

PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents

Paper • 2406.13923 • Published Jun 20 • 21

McEval: Massively Multilingual Code Evaluation

Paper • 2406.07436 • Published Jun 11 • 39

II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models

Paper • 2406.05862 • Published Jun 9 • 4

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series

Paper • 2405.19327 • Published May 29 • 43

upvoted a paper 6 months ago

Sailor: Open Language Models for South-East Asia

Paper • 2404.03608 • Published Apr 4 • 20

upvoted a collection 6 months ago

MusiLingo

Collection

This is the checkpoints and datasets of MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response • 5 items • Updated Apr 4 • 2

upvoted a paper 7 months ago

Yi: Open Foundation Models by 01.AI

Paper • 2403.04652 • Published Mar 7 • 61

upvoted a collection 7 months ago

StructLM

Collection

The structure knowledge grounded language model • 6 items • Updated Apr 6 • 6

upvoted a paper 7 months ago

StructLM: Towards Building Generalist Models for Structured Knowledge Grounding

Paper • 2402.16671 • Published Feb 26 • 26

upvoted a collection 8 months ago

OpenCodeInterpreter

Collection

18 items • Updated Mar 3 • 82

upvoted 2 papers 8 months ago

AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling

Paper • 2402.12226 • Published Feb 19 • 40

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation

Paper • 2402.04324 • Published Feb 6 • 23

upvoted 4 papers 9 months ago

upvoted a collection 10 months ago

TIGERScore

Collection

List of model variates of TIGEREScore checkpoints and the associated dataset • 8 items • Updated 9 days ago • 4

upvoted 3 papers 10 months ago

LLM360: Towards Fully Transparent Open-Source LLMs

Paper • 2312.06550 • Published Dec 11, 2023 • 56

UniIR: Training and Benchmarking Universal Multimodal Information Retrievers

Paper • 2311.17136 • Published Nov 28, 2023 • 7

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

Paper • 2311.16502 • Published Nov 27, 2023 • 35

Ge Zhang

AI & ML interests

Organizations

zhangysk's activity

MIO: A Foundation Model on Multimodal Tokens

OmniBench: Towards The Future of Universal Omni-Language Models

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

Seed-Music: A Unified Framework for High Quality and Controlled Music Generation

SongCreator: Lyrics-based Universal Song Generation

Towards a Unified View of Preference Learning for Large Language Models: A Survey

FuzzCoder: Byte-level Fuzzing Test via Large Language Model

MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark

OLMoE

Foundation Models for Music: A Survey

TableBench: A Comprehensive and Complex Benchmark for Table Question Answering

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

LongIns: A Challenging Long-context Instruction-based Exam for LLMs

MantisScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation

PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents

McEval: Massively Multilingual Code Evaluation

II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series

Sailor: Open Language Models for South-East Asia

MusiLingo

Yi: Open Foundation Models by 01.AI

StructLM

StructLM: Towards Building Generalist Models for Structured Knowledge Grounding

OpenCodeInterpreter

AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation

CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark

E^2-LLM: Efficient and Extreme Length Extension of Large Language Models

LLaMA Beyond English: An Empirical Study on Language Capability Transfer

A Comprehensive Study of Knowledge Editing for Large Language Models

TIGERScore

LLM360: Towards Fully Transparent Open-Source LLMs

UniIR: Training and Benchmarking Universal Multimodal Information Retrievers

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI