Yaowei Zheng

hiyouga

AI & ML interests

LLM Knowledge Management

Articles

Organizations

hiyouga's activity

New activity in hiyouga/Qwen2-VL-7B-Pokemon 17 days ago

How to finetune model?

1
#1 opened 17 days ago by baohuynhbk14
New activity in hiyouga/LLaMA-Board about 2 months ago

Upload 2 files

#12 opened about 2 months ago by predictanythingsoftware
New activity in llamafactory/demo_data 3 months ago
New activity in THUDM/glm-4-9b 3 months ago

Fix tensor shape error

#7 opened 3 months ago by hiyouga
New activity in llamafactory/tiny-supervised-dataset 3 months ago
New activity in llamafactory/ultrafeedback_binarized 3 months ago
New activity in hiyouga/LLaMA-Board 3 months ago

Technical_reports

1
#10 opened 5 months ago by Parssky
New activity in hiyouga/PaliGemma-3B-Chat-v0.1 4 months ago

finetune bug

2
#1 opened 4 months ago by Raku-Yihan
New activity in BUAADreamer/PaliGemma-3B-Chat-v0.2 4 months ago

Update README.md

#3 opened 4 months ago by hiyouga

Update tokenizer_config.json

#2 opened 4 months ago by hiyouga

Update config.json

#1 opened 4 months ago by hiyouga
New activity in THUDM/glm-4-9b-chat 4 months ago
New activity in llamafactory/PaliGemma-3B-Chat-v0.2 4 months ago

Great work!

3
#1 opened 4 months ago by merve
New activity in BUAADreamer/Yi-VL-6B-hf 4 months ago

Update config.json

#3 opened 4 months ago by hiyouga

Update config.json

#2 opened 4 months ago by hiyouga

Update config.json

#1 opened 4 months ago by hiyouga
New activity in shenzhi-wang/Llama3-70B-Chinese-Chat 4 months ago

Update tokenizer_config.json

#2 opened 4 months ago by hiyouga

Update config.json

#1 opened 4 months ago by hiyouga
New activity in shenzhi-wang/Llama3-8B-Chinese-Chat 5 months ago

Update README.md

#20 opened 5 months ago by hiyouga

Update README.md

#19 opened 5 months ago by hiyouga

Delete trainer_log.jsonl

#18 opened 5 months ago by hiyouga

Delete all_results.json

#17 opened 5 months ago by hiyouga

BFloat16 is not supported on MPS

5
#13 opened 5 months ago by RDY97
New activity in hiyouga/DPO-En-Zh-20k 5 months ago
New activity in shenzhi-wang/Llama3-8B-Chinese-Chat 5 months ago

🚀Fix metadata dict bug

#10 opened 5 months ago by hiyouga

Delete training_args.bin

#9 opened 5 months ago by hiyouga

Update generation_config.json

#6 opened 5 months ago by hiyouga

Update generation_config.json

#7 opened 5 months ago by hiyouga

Update config.json

#5 opened 5 months ago by hiyouga

add Usage

#2 opened 5 months ago by hiyouga

Update README.md

#1 opened 5 months ago by hiyouga

just for curiosity

9
#1 opened 6 months ago by prudant
New activity in llamafactory/adgen_tiny 5 months ago
New activity in hiyouga/LLaMA-Board 7 months ago

Update data/dataset_info.json

1
#3 opened 7 months ago by tonymds

Upload dev.csv

#4 opened 7 months ago by zongyang

Upload jd.json

#5 opened 7 months ago by zongyang

Create a

#6 opened 7 months ago by zongyang

Create a.json

#7 opened 7 months ago by zongyang
New activity in hiyouga/Qwen-14B-Chat-LLaMAfied 7 months ago
New activity in baichuan-inc/Baichuan2-13B-Chat 7 months ago

Missing module: torch.utils.checkpoint

#13 opened about 1 year ago by hiyouga
New activity in google/gemma-2b-it 7 months ago

Update chat template

2
#21 opened 7 months ago by pcuenq
New activity in mistralai/Mixtral-8x7B-v0.1 8 months ago
New activity in hiyouga/Qwen-14B-Chat-LLaMAfied 9 months ago

eval error with LLaMA-Factory

4
#1 opened 9 months ago by charry2000
New activity in hiyouga/Baichuan2-7B-Chat-LLaMAfied 10 months ago
New activity in hiyouga/Baichuan2-7B-Base-LLaMAfied 10 months ago