Jon Durbin PRO
jondurbin
AI & ML interests
None yet
Organizations
jondurbin's activity
Update README.md with license information
#5 opened 3 months ago
by
Chen-01AI
airoboros-110b-3.3 disappeared after running?
3
#746 opened 5 months ago
by
jondurbin
Question
2
#1 opened 5 months ago
by
dillfrescott
Model name
1
#1 opened 5 months ago
by
Ezk-Trahu-77
Thank you! Got more details on the fine tuning?
2
#1 opened 6 months ago
by
KnutJaegersberg
Holy shit this model is amazing!
1
#1 opened 6 months ago
by
PartTimePhilosopher
amazing model...can you finetune on a smaller one?
2
#4 opened 6 months ago
by
aaha
Okay, here's a review. Sorta.
1
#3 opened 6 months ago
by
MateoTeo
Hey, got interesting probem here.
2
#6 opened 7 months ago
by
MateoTeo
Yi-34b-200k v2, in the cards?
2
#2 opened 7 months ago
by
SabinStargem
Retain with the latest Yi-34B-200K?
1
#1 opened 7 months ago
by
Hoioi
Weight updates?
8
#13 opened 7 months ago
by
brucethemoose
I can’t help but feel like it is worse.
4
#4 opened 7 months ago
by
Nycoorias
When can we anticipate the release of the DPO version?
2
#3 opened 8 months ago
by
HR1777
Difference between v0.2 and v0.4?
1
#2 opened 8 months ago
by
Light4Bear
Could you please quantify the model?
2
#1 opened 8 months ago
by
Serpen
wtf is this?
1
#1 opened 8 months ago
by
biship
Chatml format for Bagel
1
#4 opened 8 months ago
by
adam3245
Dataset with normal text output?
1
#2 opened 8 months ago
by
HankN
Applied reversely for alignment?
4
#2 opened 10 months ago
by
Yhyu13
Weird output with instruction following
1
#4 opened 9 months ago
by
ndurkee
[bot] Conversion to Parquet
#1 opened 10 months ago
by
parquet-converter
this is really great dataset
1
#2 opened 9 months ago
by
cloudyu
[fine-tuning] attention_dropout not defined
#2 opened 9 months ago
by
jondurbin
Benchmarks?
1
#2 opened 9 months ago
by
rombodawg
Remove mathinstruct
1
#3 opened 9 months ago
by
distantquant
Thank you for your model!
11
#1 opened 9 months ago
by
rombodawg
How may gpu and gpu time used for this training?
2
#3 opened 9 months ago
by
aisensiy
Add some aditional metadata
#1 opened 9 months ago
by
davanstrien
Empty rows
4
#2 opened 9 months ago
by
HoangHa
add code language metadata
#1 opened 9 months ago
by
davanstrien
Update Massed Compute rental. New Coupon Code
#3 opened 9 months ago
by
nic-mc
Great Model and Name ;-)
1
#2 opened 9 months ago
by
DaryoushV
Include Massed Compute VM with Steps
#1 opened 9 months ago
by
nic-mc
DPO ruined Bagel's versitility
1
#2 opened 9 months ago
by
Henk717
Nice model!
4
#1 opened 9 months ago
by
acrastt
Context Length?
4
#1 opened 9 months ago
by
brucethemoose
Could you please finetune Bagel on Solar 10.7B too?
2
#1 opened 9 months ago
by
HR1777
ChatML format
1
#1 opened 10 months ago
by
andysalerno
Space after [/INST]
7
#2 opened 12 months ago
by
Satya93
[bot] Conversion to Parquet
#1 opened 10 months ago
by
parquet-converter
Any positive results so far?
1
#1 opened 11 months ago
by
Thireus
Mistral Model?
1
#1 opened 11 months ago
by
jjboi8708
Max Context Token Length
2
#1 opened 11 months ago
by
lazyDataScientist
License?
5
#1 opened 12 months ago
by
acrastt
Update tokenizer_config.json
#1 opened 12 months ago
by
jondurbin
Update tokenizer_config.json
#1 opened 12 months ago
by
jondurbin
Update tokenizer_config.json
#1 opened 12 months ago
by
jondurbin
Update tokenizer_config.json
#1 opened 12 months ago
by
jondurbin
Update tokenizer_config.json
#1 opened 12 months ago
by
jondurbin
Update tokenizer_config.json
#1 opened 12 months ago
by
jondurbin
Update tokenizer_config.json
#1 opened 12 months ago
by
jondurbin
Update tokenizer_config.json
#1 opened 12 months ago
by
jondurbin
Ability to generalise
6
#1 opened 12 months ago
by
vmajor
ChatML prompt format confusion - please reconsider
36
#3 opened about 1 year ago
by
kalomaze
Update tokenizer_config.json
#1 opened 12 months ago
by
jondurbin
Update tokenizer_config.json
2
#1 opened 12 months ago
by
jondurbin
Remove non-safe model files.
#1 opened about 1 year ago
by
jondurbin