Trl

Trl taxonomy generated by the site skill importer.

1 skills

huggingface-llm-trainer

bởi huggingface

huggingface-llm-trainer giúp bạn huấn luyện hoặc fine-tune các mô hình ngôn ngữ và thị giác trên Hugging Face Jobs bằng TRL hoặc Unsloth. Dùng skill huggingface-llm-trainer cho SFT, DPO, GRPO, reward modeling, kiểm tra dataset, chọn GPU, lưu lên Hub, theo dõi bằng Trackio và xuất GGUF cho các quy trình phát triển backend.

Backend Development

Yêu thích 0GitHub 10.4k

Trl