Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
312.6
TFLOPS
74
23
121
Yaowei Zheng
hiyouga
Follow
sgsg6s6s's profile picture
tuyen4656789's profile picture
niefeng's profile picture
620 followers
·
14 following
https://github.com/hiyouga
llamafactory_ai
hiyouga
AI & ML interests
LLM Knowledge Management
Articles
GaLore: Advancing Large Model Training on Consumer-grade Hardware
Mar 20
•
24
Organizations
hiyouga
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
hiyouga/Qwen2-VL-7B-Pokemon
about 1 month ago
How to finetune model?
1
#1 opened about 1 month ago by
baohuynhbk14
New activity in
llamafactory/pokemon-gpt4o-captions
about 1 month ago
[bot] Conversion to Parquet
#1 opened about 1 month ago by
parquet-converter
New activity in
Qwen/Qwen2-VL-7B-Instruct
about 1 month ago
LoRA Finetuning Tool for Qwen2-VL-7B in Web UI (DPO updated)
10
#2 opened about 1 month ago by
hiyouga
New activity in
hiyouga/LLaMA-Board
2 months ago
Upload 2 files
#12 opened 2 months ago by
predictanythingsoftware
New activity in
llamafactory/demo_data
4 months ago
[bot] Conversion to Parquet
#1 opened 4 months ago by
parquet-converter
New activity in
THUDM/glm-4-9b
4 months ago
Fix tensor shape error
#7 opened 4 months ago by
hiyouga
New activity in
llamafactory/tiny-supervised-dataset
4 months ago
[bot] Conversion to Parquet
#1 opened 4 months ago by
parquet-converter
New activity in
llamafactory/ultrafeedback_binarized
4 months ago
[bot] Conversion to Parquet
#1 opened 4 months ago by
parquet-converter
New activity in
hiyouga/LLaMA-Board
4 months ago
Technical_reports
1
#10 opened 5 months ago by
Parssky
New activity in
hiyouga/PaliGemma-3B-Chat-v0.1
4 months ago
finetune bug
2
#1 opened 4 months ago by
Raku-Yihan
New activity in
BUAADreamer/PaliGemma-3B-Chat-v0.2
4 months ago
Update README.md
#3 opened 4 months ago by
hiyouga
Update tokenizer_config.json
#2 opened 4 months ago by
hiyouga
Update config.json
#1 opened 4 months ago by
hiyouga
New activity in
THUDM/glm-4-9b-chat
4 months ago
fix tensor shape error when torch version less than 2 #4
#9 opened 4 months ago by
hiyouga
New activity in
THUDM/glm-4-9b-chat-1m
4 months ago
fix tensor shape error when torch version less than 2
#4 opened 4 months ago by
hiyouga
fixes https://github.com/THUDM/GLM-4/issues/22
#1 opened 4 months ago by
hiyouga
New activity in
THUDM/glm-4-9b-chat
4 months ago
fixes https://github.com/THUDM/GLM-4/issues/22
#5 opened 4 months ago by
hiyouga
New activity in
llamafactory/PaliGemma-3B-Chat-v0.2
4 months ago
Great work!
3
#1 opened 4 months ago by
merve
New activity in
BUAADreamer/Yi-VL-6B-hf
5 months ago
Update config.json
#3 opened 5 months ago by
hiyouga
Update config.json
#2 opened 5 months ago by
hiyouga
Update config.json
#1 opened 5 months ago by
hiyouga
New activity in
shenzhi-wang/Llama3-70B-Chinese-Chat
5 months ago
Update tokenizer_config.json
#2 opened 5 months ago by
hiyouga
Update config.json
#1 opened 5 months ago by
hiyouga
New activity in
shenzhi-wang/Llama3-8B-Chinese-Chat
5 months ago
Update README.md
#20 opened 5 months ago by
hiyouga
Update README.md
#19 opened 5 months ago by
hiyouga
Delete trainer_log.jsonl
#18 opened 5 months ago by
hiyouga
Delete all_results.json
#17 opened 5 months ago by
hiyouga
BFloat16 is not supported on MPS
5
#13 opened 5 months ago by
RDY97
New activity in
hiyouga/LLaMA-Board
5 months ago
llama3 available on the local demo but is unavailable on the Spaces
1
#9 opened 5 months ago by
ysharma
New activity in
hiyouga/DPO-En-Zh-20k
6 months ago
[bot] Conversion to Parquet
#1 opened 6 months ago by
parquet-converter
New activity in
shenzhi-wang/Llama3-8B-Chinese-Chat
6 months ago
🚀Fix metadata dict bug
#10 opened 6 months ago by
hiyouga
Delete training_args.bin
#9 opened 6 months ago by
hiyouga
Update generation_config.json
#6 opened 6 months ago by
hiyouga
Update generation_config.json
#7 opened 6 months ago by
hiyouga
Update config.json
#5 opened 6 months ago by
hiyouga
Update model.safetensors.index.json
#4 opened 6 months ago by
hiyouga
🚀 Fix the bug of checkpoint files
#3 opened 6 months ago by
hiyouga
add Usage
#2 opened 6 months ago by
hiyouga
Update README.md
#1 opened 6 months ago by
hiyouga
New activity in
hiyouga/Llama-2-70b-AQLM-2Bit-QLoRA-function-calling
6 months ago
just for curiosity
9
#1 opened 7 months ago by
prudant
New activity in
llamafactory/adgen_tiny
6 months ago
[bot] Conversion to Parquet
#1 opened 6 months ago by
parquet-converter
New activity in
hiyouga/LLaMA-Board
7 months ago
Add link to paper so it's automatically linked from Arxiv and paper page
#8 opened 7 months ago by
osanseviero
Update data/dataset_info.json
1
#3 opened 7 months ago by
tonymds
Upload dev.csv
#4 opened 7 months ago by
zongyang
Upload jd.json
#5 opened 7 months ago by
zongyang
Create a
#6 opened 7 months ago by
zongyang
Create a.json
#7 opened 7 months ago by
zongyang
New activity in
hiyouga/Qwen-14B-Chat-LLaMAfied
7 months ago
Adding Evaluation Results
#2 opened 7 months ago by
leaderboard-pr-bot
New activity in
google/gemma-7b-it
7 months ago
how to extract model response from the output of tokenizer
3
#54 opened 7 months ago by
mans-0987
New activity in
baichuan-inc/Baichuan2-13B-Chat
8 months ago
Missing module: torch.utils.checkpoint
#13 opened about 1 year ago by
hiyouga
New activity in
google/gemma-2b-it
8 months ago
Update readme to match chat template
1
#22 opened 8 months ago by
hiyouga
Update chat template
2
#21 opened 8 months ago by
pcuenq
New activity in
google/gemma-7b-it
8 months ago
Fix chat template does not compatible with ConversationalPipeline
5
#42 opened 8 months ago by
hiyouga
New activity in
mistralai/Mistral-7B-v0.1
8 months ago
How to finetune this model mistralai/Mistral-7B-v0.1 and also merge the weights
5
#126 opened 8 months ago by
yeniceriSGK
New activity in
mistralai/Mixtral-8x7B-v0.1
9 months ago
How to fine tune mixtral 8x7B?
3
#30 opened 9 months ago by
tzivi
Fine-tuning toolkit for Mixtral 8x7B MoE model
18
#10 opened 10 months ago by
hiyouga
New activity in
THUDM/chatglm3-6b
9 months ago
fix can't set attribute 'eos_token' when loading the saved tokenizer
#27 opened 9 months ago by
hiyouga
New activity in
hiyouga/Qwen-14B-Chat-LLaMAfied
9 months ago
eval error with LLaMA-Factory
4
#1 opened 9 months ago by
charry2000
New activity in
hiyouga/Baichuan2-7B-Chat-LLaMAfied
11 months ago
Adding Evaluation Results
#1 opened 11 months ago by
leaderboard-pr-bot
New activity in
hiyouga/Baichuan2-7B-Base-LLaMAfied
11 months ago
Adding Evaluation Results
#1 opened 11 months ago by
leaderboard-pr-bot
Load more