Yaowei Zheng

#1 opened about 1 month ago by

baohuynhbk14

New activity in llamafactory/pokemon-gpt4o-captions about 1 month ago

[bot] Conversion to Parquet

#1 opened about 1 month ago by

New activity in Qwen/Qwen2-VL-7B-Instruct about 1 month ago

LoRA Finetuning Tool for Qwen2-VL-7B in Web UI (DPO updated)

10

#2 opened about 1 month ago by

New activity in hiyouga/LLaMA-Board 2 months ago

Upload 2 files

#12 opened 2 months ago by

predictanythingsoftware

New activity in llamafactory/demo_data 4 months ago

[bot] Conversion to Parquet

#1 opened 4 months ago by

New activity in THUDM/glm-4-9b 4 months ago

Fix tensor shape error

#7 opened 4 months ago by

New activity in llamafactory/tiny-supervised-dataset 4 months ago

[bot] Conversion to Parquet

#1 opened 4 months ago by

New activity in llamafactory/ultrafeedback_binarized 4 months ago

[bot] Conversion to Parquet

#1 opened 4 months ago by

New activity in hiyouga/LLaMA-Board 4 months ago

Technical_reports

#10 opened 5 months ago by

Parssky

New activity in hiyouga/PaliGemma-3B-Chat-v0.1 4 months ago

finetune bug

2

#1 opened 4 months ago by

Raku-Yihan

New activity in BUAADreamer/PaliGemma-3B-Chat-v0.2 4 months ago

Update README.md

#3 opened 4 months ago by

Update tokenizer_config.json

#2 opened 4 months ago by

Update config.json

#1 opened 4 months ago by

New activity in THUDM/glm-4-9b-chat 4 months ago

fix tensor shape error when torch version less than 2 #4

#9 opened 4 months ago by

New activity in THUDM/glm-4-9b-chat-1m 4 months ago

fix tensor shape error when torch version less than 2

#4 opened 4 months ago by

fixes https://github.com/THUDM/GLM-4/issues/22

#1 opened 4 months ago by

New activity in THUDM/glm-4-9b-chat 4 months ago

fixes https://github.com/THUDM/GLM-4/issues/22

#5 opened 4 months ago by

New activity in llamafactory/PaliGemma-3B-Chat-v0.2 4 months ago

Great work!

3

#1 opened 4 months ago by

merve

New activity in BUAADreamer/Yi-VL-6B-hf 5 months ago

Update config.json

#3 opened 5 months ago by

Update config.json

#2 opened 5 months ago by

Update config.json

#1 opened 5 months ago by

New activity in shenzhi-wang/Llama3-70B-Chinese-Chat 5 months ago

Update tokenizer_config.json

#2 opened 5 months ago by

Update config.json

#1 opened 5 months ago by

New activity in shenzhi-wang/Llama3-8B-Chinese-Chat 5 months ago

Update README.md

#20 opened 5 months ago by

Update README.md

#19 opened 5 months ago by

Delete trainer_log.jsonl

#18 opened 5 months ago by

Delete all_results.json

#17 opened 5 months ago by

BFloat16 is not supported on MPS

5

#13 opened 5 months ago by

RDY97

New activity in hiyouga/LLaMA-Board 5 months ago

llama3 available on the local demo but is unavailable on the Spaces

#9 opened 5 months ago by

ysharma

New activity in hiyouga/DPO-En-Zh-20k 6 months ago

[bot] Conversion to Parquet

#1 opened 6 months ago by

New activity in shenzhi-wang/Llama3-8B-Chinese-Chat 6 months ago

🚀Fix metadata dict bug

#10 opened 6 months ago by

Delete training_args.bin

#9 opened 6 months ago by

Update generation_config.json

#6 opened 6 months ago by

Update generation_config.json

#7 opened 6 months ago by

Update config.json

#5 opened 6 months ago by

Update model.safetensors.index.json

#4 opened 6 months ago by

🚀 Fix the bug of checkpoint files

#3 opened 6 months ago by

add Usage

#2 opened 6 months ago by

Update README.md

#1 opened 6 months ago by

New activity in hiyouga/Llama-2-70b-AQLM-2Bit-QLoRA-function-calling 6 months ago

just for curiosity

9

#1 opened 7 months ago by

prudant

New activity in llamafactory/adgen_tiny 6 months ago

[bot] Conversion to Parquet

#1 opened 6 months ago by

New activity in hiyouga/LLaMA-Board 7 months ago

Add link to paper so it's automatically linked from Arxiv and paper page

#8 opened 7 months ago by

osanseviero

Update data/dataset_info.json

#3 opened 7 months ago by

tonymds

Upload dev.csv

#4 opened 7 months ago by

Upload jd.json

#5 opened 7 months ago by

Create a

#6 opened 7 months ago by

Create a.json

#7 opened 7 months ago by

New activity in hiyouga/Qwen-14B-Chat-LLaMAfied 7 months ago

Adding Evaluation Results

#2 opened 7 months ago by

leaderboard-pr-bot

New activity in google/gemma-7b-it 7 months ago

how to extract model response from the output of tokenizer

3

#54 opened 7 months ago by

mans-0987

New activity in baichuan-inc/Baichuan2-13B-Chat 8 months ago

Missing module: torch.utils.checkpoint

#13 opened about 1 year ago by

New activity in google/gemma-2b-it 8 months ago

Update readme to match chat template

#22 opened 8 months ago by

Update chat template

2

#21 opened 8 months ago by

pcuenq

New activity in google/gemma-7b-it 8 months ago

Fix chat template does not compatible with ConversationalPipeline

5

#42 opened 8 months ago by

New activity in mistralai/Mistral-7B-v0.1 8 months ago

How to finetune this model mistralai/Mistral-7B-v0.1 and also merge the weights

5

#126 opened 8 months ago by

yeniceriSGK

New activity in mistralai/Mixtral-8x7B-v0.1 9 months ago

How to fine tune mixtral 8x7B?

3

#30 opened 9 months ago by

tzivi

Fine-tuning toolkit for Mixtral 8x7B MoE model

18

#10 opened 10 months ago by

New activity in THUDM/chatglm3-6b 9 months ago

fix can't set attribute 'eos_token' when loading the saved tokenizer

#27 opened 9 months ago by