diwank
's Collections
Preview
•
Updated
•
7
•
74
argilla/intel-orca-dpo-pairs-helm-instruct
Viewer
•
Updated
•
5
•
2
•
1
argilla/OpenHermes2.5-dpo-binarized-alpha
Viewer
•
Updated
•
9.79k
•
12
•
64
argilla/ultrafeedback-critique
Viewer
•
Updated
•
253k
•
4
•
4
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
•
Updated
•
60.9k
•
5.33k
•
116
ai2lumos/lumos_maths_plan_onetime
Viewer
•
Updated
•
19.8k
•
5
•
2
ai2lumos/lumos_unified_plan_iterative
Viewer
•
Updated
•
55.4k
•
5
•
2
ai2lumos/lumos_complex_qa_plan_onetime
Viewer
•
Updated
•
19.4k
•
5
•
3
Viewer
•
Updated
•
10k
•
163
•
27
lmsys/mt_bench_human_judgments
Viewer
•
Updated
•
5.76k
•
288
•
108
lmsys/chatbot_arena_conversations
Viewer
•
Updated
•
33k
•
794
•
330
Nexusflow/Starling-LM-7B-beta
Text Generation
•
Updated
•
4.77k
•
340
Qwen/Qwen1.5-32B
Text Generation
•
Updated
•
12.2k
•
81
vicgalle/configurable-system-prompt-multitask
Viewer
•
Updated
•
1.95k
•
38
•
19
paraloq/json_data_extraction
Viewer
•
Updated
•
484
•
23
•
16
Viewer
•
Updated
•
479
•
13
•
4
iamtarun/python_code_instructions_18k_alpaca
Viewer
•
Updated
•
18.6k
•
5.93k
•
205
LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
Paper
•
2403.15042
•
Published
•
24
Viewer
•
Updated
•
2.35k
•
2
•
1
Paper
•
2402.12219
•
Published
•
15
Viewer
•
Updated
•
20.2k
•
76
•
29
M4-ai/prm_dpo_pairs_cleaned
Viewer
•
Updated
•
7.99k
•
48
•
10
SanjiWatsuki/Kunoichi-DPO-v2-7B
Text Generation
•
Updated
•
722
•
80
Viewer
•
Updated
•
17.3k
•
5
•
19
mlabonne/orpo-dpo-mix-40k
Viewer
•
Updated
•
44.2k
•
1.21k
•
227
Viewer
•
Updated
•
530k
•
1.16k
•
115
meta-llama/Meta-Llama-3-8B
Text Generation
•
Updated
•
1.8M
•
5.71k
Viewer
•
Updated
•
149k
•
11
•
6
FreedomIntelligence/evol-instruct-hindi
Viewer
•
Updated
•
59k
•
23
•
1
totally-not-an-llm/EverythingLM-data-V3
Viewer
•
Updated
•
1.07k
•
9
•
31
RUCAIBox/Story-Generation
Updated
•
1
•
11
imone/Llama-3-8B-fixed-special-embedding
Text Generation
•
Updated
•
378
•
15
Viewer
•
Updated
•
49.6k
•
199
•
103
Norquinal/claude_multiround_chat_30k
Viewer
•
Updated
•
32.2k
•
114
•
50
Norquinal/claude_multi_instruct_30k
Viewer
•
Updated
•
32.2k
•
30
•
11
Viewer
•
Updated
•
1.72M
•
2
•
8
Locutusque/OpenCerebrum-2.0-SFT
Viewer
•
Updated
•
6.4k
•
2
•
5
Locutusque/OpenCerebrum-2.0-DPO
Viewer
•
Updated
•
720
•
2
•
4
Preview
•
Updated
•
2
•
12
Preview
•
Updated
•
26
•
25
gradientai/Llama-3-70B-Instruct-Gradient-262k
Text Generation
•
Updated
•
209
•
55
princeton-nlp/QuRating-GPT3.5-Judgments
Viewer
•
Updated
•
250k
•
58
•
5
Viewer
•
Updated
•
1.46M
•
54
•
15
mustafaaljadery/gemma-2B-10M
jondurbin/airoboros-70b-3.3
Text Generation
•
Updated
•
2.26k
•
14
princeton-nlp/Llama-3-Instruct-8B-SimPO
Text Generation
•
Updated
•
1.48k
•
55
Viewer
•
Updated
•
21.4k
•
8.91k
•
221
nvidia/Nemotron-4-340B-Reward
Updated
•
25
•
105
Magpie-Align/Magpie-Pro-MT-300K-v0.1
Viewer
•
Updated
•
300k
•
1k
•
27
Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.1
Text Generation
•
Updated
•
348
•
4
nvidia/Aegis-AI-Content-Safety-Dataset-1.0
Viewer
•
Updated
•
12k
•
1.32k
•
38
Salesforce/xlam-function-calling-60k
Viewer
•
Updated
•
60k
•
3.17k
•
364
Viewer
•
Updated
•
20.4M
•
5.34k
•
502
diwank/llmlingua-compressed-text
Viewer
•
Updated
•
222k
•
2
•
2
diwank/python-code-execution-output
Viewer
•
Updated
•
3.61k
•
3
GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on
Mobile Devices
Paper
•
2406.08451
•
Published
•
23
Viewer
•
Updated
•
99.5k
•
1.23k
•
17
cognitivecomputations/samantha-1.5
Viewer
•
Updated
•
327
•
19
•
11
Viewer
•
Updated
•
728
•
3
•
8
HannahRoseKirk/prism-alignment
Viewer
•
Updated
•
77.9k
•
710
•
58
RLHFlow/ArmoRM-Llama3-8B-v0.1
Text Classification
•
Updated
•
36.8k
•
134
sfairXC/FsfairX-LLaMA3-RM-v0.1
Text Classification
•
Updated
•
15.2k
•
46
PKU-Alignment/PKU-SafeRLHF-30K
Viewer
•
Updated
•
29.9k
•
235
•
7
instruction-pretrain/ft-instruction-synthesizer-collection
Viewer
•
Updated
•
249k
•
300
•
58
Viewer
•
Updated
•
11.1M
•
1.79k
•
52
Viewer
•
Updated
•
68.8k
•
2
•
20
Viewer
•
Updated
•
12.7k
•
2
•
5
imbue/human_question_quality_judgments
Viewer
•
Updated
•
167k
•
2
•
8
Viewer
•
Updated
•
54k
•
352
•
19
imbue/high_quality_public_evaluations
Viewer
•
Updated
•
12.8k
•
684
•
5
imbue/high_quality_private_evaluations
Viewer
•
Updated
•
10.6k
•
3
•
7
google/gemma-2-27b
Text Generation
•
Updated
•
23.9k
•
164
Viewer
•
Updated
•
1.46M
•
2
•
4
UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3
Text Generation
•
Updated
•
4.73k
•
75
Viewer
•
Updated
•
375k
•
1.14k
•
428
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Paper
•
2406.20094
•
Published
•
94
Viewer
•
Updated
•
1.24M
•
7
•
6
Viewer
•
Updated
•
1.25M
•
11
•
4
Viewer
•
Updated
•
2.05M
•
5
•
3
Viewer
•
Updated
•
326k
•
5
•
8
hubertsiuzdak/snac_24khz
Updated
•
131k
•
13
hubertsiuzdak/snac_32khz
Updated
•
42.9k
•
5
hubertsiuzdak/snac_44khz
Updated
•
40.9k
•
5
facebook/chameleon-30b
Image-Text-to-Text
•
Updated
•
261
•
81
facebook/chameleon-7b
Image-Text-to-Text
•
Updated
•
24.9k
•
160
gokaygokay/random_instruct_docci
Viewer
•
Updated
•
14.6k
•
7
•
5
internlm/internlm2_5-7b
Text Generation
•
Updated
•
3.52k
•
15
Gryphe/Opus-WritingPrompts
Viewer
•
Updated
•
14.9k
•
175
•
27
Viewer
•
Updated
•
3k
•
2
•
9
Are You Sure? Rank Them Again: Repeated Ranking For Better Preference
Datasets
Paper
•
2405.18952
•
Published
•
10
OpenGVLab/InternVL2-4B
Image-Text-to-Text
•
Updated
•
19.3k
•
34
OpenGVLab/InternVL2-Llama3-76B
Image-Text-to-Text
•
Updated
•
162k
•
190
QuasarResearch/apollo-preview-v0.2
Viewer
•
Updated
•
51.4k
•
90
•
42
fireworks-ai/nexus_parallel_messages
Viewer
•
Updated
•
70
•
2
•
6
fireworks-ai/nexus_parallel_functions
Viewer
•
Updated
•
29
•
2
•
4
Viewer
•
Updated
•
539
•
9
•
21
Viewer
•
Updated
•
18.6k
•
3
•
7
Viewer
•
Updated
•
259
•
2
•
2
Viewer
•
Updated
•
486k
•
23
•
35
Viewer
•
Updated
•
1.75M
•
341
•
70
Viewer
•
Updated
•
860k
•
2.64k
•
184
Gryphe/Sonnet3.5-SlimOrcaDedupCleaned
Viewer
•
Updated
•
181k
•
891
•
71
chargoddard/WebInstructSub-prometheus
Viewer
•
Updated
•
2.39M
•
30
•
16
Viewer
•
Updated
•
1.96k
•
8
•
29
Viewer
•
Updated
•
294k
•
17
•
24
chargoddard/chai-feedback-pairs
Viewer
•
Updated
•
30.1k
•
4
•
5
nayohan/multi_session_chat
Viewer
•
Updated
•
23.4k
•
5
•
1
nvidia/Mistral-NeMo-12B-Instruct
Updated
•
116
•
133
nvidia/Mistral-NeMo-12B-Base
Updated
•
185
•
27
meta-llama/Llama-3.1-8B
Text Generation
•
Updated
•
630k
•
909
meta-llama/Prompt-Guard-86M
Text Classification
•
Updated
•
35.6k
•
179
Viewer
•
Updated
•
6.41k
•
86
•
27
mistralai/Mistral-Large-Instruct-2407
Text Generation
•
Updated
•
20.3k
•
747
Symbol-LLM/Symbolic_Collection
Viewer
•
Updated
•
975k
•
6
•
6
Viewer
•
Updated
•
100k
•
4.81k
•
86
roborovski/dolly-entity-extraction
Viewer
•
Updated
•
5.95k
•
10
•
2
kalomaze/Opus_Instruct_25k
Viewer
•
Updated
•
25.1k
•
68
•
28
Vezora/Code-Preference-Pairs
Viewer
•
Updated
•
54k
•
229
•
11
Nexusflow/Athene-70B
Text Generation
•
Updated
•
5.7k
•
175
arcee-ai/Arcee-Spark
Text Generation
•
Updated
•
3.48k
•
85
Viewer
•
Updated
•
270k
•
9
•
7
OpenBuddy/openbuddy-llama3.1-8b-v22.2-131k
Text Generation
•
Updated
•
203
•
2
google/gemma-2-2b
Text Generation
•
Updated
•
4.55M
•
356
google/gemma-scope
google/shieldgemma-2b
Text Generation
•
Updated
•
5.45k
•
43
Viewer
•
Updated
•
11.2k
•
2
•
6
argilla/magpie-ultra-v0.1
Viewer
•
Updated
•
50k
•
914
•
203
mlabonne/Llama-3.1-70B-Instruct-lorablated-GGUF
Updated
•
1.48k
•
35
Viewer
•
Updated
•
55.1k
•
49
•
87
internlm/internlm2_5-20b
Text Generation
•
Updated
•
276
•
16
Viewer
•
Updated
•
1.02k
•
4
•
13
Viewer
•
Updated
•
2.39M
•
2
•
8
Viewer
•
Updated
•
6k
•
765
•
159
Viewer
•
Updated
•
282
•
2
•
1
Gryphe/Sonnet3.5-Charcard-Roleplay
Updated
•
518
•
30
NousResearch/hermes-function-calling-v1
Viewer
•
Updated
•
11.6k
•
5.15k
•
202
AlgorithmicResearchGroup/ArXivDLInstruct
Viewer
•
Updated
•
778k
•
78
•
12
upstage/solar-pro-preview-instruct
Text Generation
•
Updated
•
4.45k
•
401
mistral-community/pixtral-12b-240910
Image-Text-to-Text
•
Updated
•
377
arcee-ai/Llama-3.1-SuperNova-Lite
Text Generation
•
Updated
•
10.4k
•
162
Skywork/Skywork-Reward-Gemma-2-27B
Text Classification
•
Updated
•
3.89k
•
32
Viewer
•
Updated
•
59.4k
•
382
•
56
Updated
•
78
•
47
argilla/FinePersonas-v0.1
Viewer
•
Updated
•
21.1M
•
361
•
299
Training Language Models to Self-Correct via Reinforcement Learning
Paper
•
2409.12917
•
Published
•
128
bespokelabs/Bespoke-MiniCheck-7B
Text Classification
•
Updated
•
3.32k
•
31