HomeExploresft

Sft Collection

Repositories tagged with "sft"

RARE

TCG-style cards with ATK/DEF/SPD stats

RARE

⭐14.6kHP

◆

🔮Psychic

★★★

ms-swift

modelscope

Pythondeepseek-r1embedding

“Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, Phi4, ...) (AAAI 2025).”

★

14.6k

1.5k

14.6k

1.5k forks

ATK

DEF

SPD

GitPedia #012

3/5

View wiki →𝕏

GitPedia

Repository Card

RARE

★

14.6k

1.5k

14.6k

RARE

⭐11.5kHP

◆

💎Aqua

★★★

bisheng

dataelement

TypeScriptagentai

“BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.”

★

11.5k

1.9k

11.5k

1.9k forks

ATK

DEF

SPD

GitPedia #719

3/5

View wiki →𝕏

GitPedia

Repository Card

RARE

★

11.5k

1.9k

11.5k

RARE

⭐9.3kHP

◆

🔮Psychic

★★★

oumi

oumi-ai

Pythondpoevaluation

“Easily fine-tune, evaluate and deploy Gemma 4, Qwen3.5, Qwen3.6, gpt-oss, DeepSeek-R1, or any open source LLM / VLM!”

★

9.3k

779

9.3k

779 forks

ATK

DEF

SPD

GitPedia #135

3/5

View wiki →𝕏

GitPedia

Repository Card

RARE

★

9.3k

779

9.3k

RARE

⭐6.1kHP

◆

🔥Flame

★★★

AgentGuide

adongwanai

HTMLagenticragai-agent

★

6.1k

608

6.1k

608 forks

ATK

DEF

SPD

GitPedia #903

3/5

View wiki →𝕏

GitPedia

Repository Card

RARE

★

6.1k

608

6.1k

UNCOMMON

⭐2.3kHP

◆

🔮Psychic

★★

maxtext

AI-Hypercomputer

Pythondeepseekfine-tuning

“A simple, performant and scalable Jax LLM!”

★

2.3k

540

2.3k

540 forks

ATK

DEF

SPD

GitPedia #335

2/5

View wiki →𝕏

GitPedia

Repository Card

UNCOMMON

★

2.3k

540

2.3k

UNCOMMON

⭐1.5kHP

◆

🔮Psychic

★★

chatglm_finetuning

ssbuild

Pythonadalorachatglm

“chatglm 6b finetuning and alpaca finetuning”

★

1.5k

172

1.5k

172 forks

ATK

DEF

SPD

GitPedia #096

2/5

View wiki →𝕏

GitPedia

Repository Card

UNCOMMON

★

1.5k

172

1.5k

UNCOMMON

⭐1.0kHP

◆

🔮Psychic

★★

GraphGen

InternScience

Pythonai4sciencedata-generation

“GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation”

★

1.0k

81 forks

ATK

DEF

SPD

GitPedia #233

2/5

View wiki →𝕏

GitPedia

Repository Card

UNCOMMON

★

1.0k

UNCOMMON

⭐966HP

◆

📦Normal

★★

diy-llm

datawhalechina

Jupyter Notebookgpu-programmingllm

“🎓 系统性大语言模型构建课程｜🛠️ 覆盖预训练数据工程、Tokenizer、Transformer、MoE、GPU 编程 (CUDA/Triton)、分布式训练、Scaling Laws、推理优化及对齐 (SFT/RLHF/GRPO)｜🚀 6 个渐进式作业 + 代码驱动，建立 LLM 全栈认知体系”

★

966

105

966

105 forks

ATK

DEF

SPD

GitPedia #595

2/5

View wiki →𝕏

GitPedia

Repository Card

UNCOMMON

★

966

105

966

UNCOMMON

⭐809HP

◆

🔮Psychic

★★

DeepSeek-671B-SFT-Guide

ScienceOne-AI

Pythondeepseek-r1llm

“An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to inference, as well as some practical experiences and conclusions. (DeepSeek-V3/R1 满血版 671B 全参数微调的开源解决方案，包含从训练到推理的完整代码和脚本，以及实践中积累一些经验和结论。)”

★

809

98 forks

ATK

DEF

SPD

GitPedia #067

2/5

View wiki →𝕏

GitPedia

Repository Card

UNCOMMON

★

809

UNCOMMON

⭐801HP

◆

🔥Fire

★★

surogate

invergent-ai

C++cudadeep-learning

“Training/Fine-tuning at the speed of light”

★

801

5 forks

ATK

DEF

SPD

GitPedia #642

2/5

View wiki →𝕏

GitPedia

Repository Card

UNCOMMON

★

801

UNCOMMON

⭐657HP

◆

🔮Psychic

★★

Cornucopia-LLaMA-Fin-Chinese

jerry1993-tech

Pythonchinesefinance

“聚宝盆(Cornucopia): 中文金融系列开源可商用大模型，并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)”

★

657

67 forks

ATK

DEF

SPD

GitPedia #417

2/5

View wiki →𝕏

GitPedia

Repository Card

UNCOMMON

★

657

UNCOMMON

⭐635HP

◆

🔮Psychic

★★

trainable-agents

choosewhatulike

Pythonagentcharacter

“Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"”

★

635

48 forks

ATK

DEF

SPD

GitPedia #730

2/5

View wiki →𝕏

GitPedia

Repository Card

UNCOMMON

★

635

UNCOMMON

⭐578HP

◆

📦Normal

★★

tensorflow-nlp-tutorial

ukairia777

Jupyter Notebookbertbert-ner

“tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림 태스크들을 정리한 Deep Learning NLP 저장소입니다.”

★

578

286

578

286 forks

ATK

DEF

SPD

GitPedia #927

2/5

View wiki →𝕏

GitPedia

Repository Card

UNCOMMON

★

578

286

578

COMMON

⭐459HP

◆

📦Normal

★

awesome-rag

agentai

“Awesome-RAG: Collect typical RAG papers and systems.”

★

459

39 forks

ATK

DEF

SPD

GitPedia #297

1/5

View wiki →𝕏

GitPedia

Repository Card

COMMON

★

459

COMMON

⭐325HP

◆

🔮Psychic

★

Qwen3-Medical-SFT

Zeyi-Lin

Pythonfine-tuningqwen3

“Qwen3 Fine-tuning: Medical R1 Style Chat”

★

325

55 forks

ATK

DEF

SPD

GitPedia #826

1/5

View wiki →𝕏

GitPedia

Repository Card

COMMON

★

325

COMMON

⭐318HP

◆

💎Aqua

★

erc-1155

0xsequence

TypeScripterc1155ethereum

“Ethereum Semi Fungible Standard (ERC-1155)”

★

318

112

318

112 forks

ATK

DEF

SPD

GitPedia #617

1/5

View wiki →𝕏

GitPedia

Repository Card

COMMON

★

318

112

318

COMMON

⭐288HP

◆

🔮Psychic

★

LoongForge

baidu-baige

Pythonaidiffusion

“A modular, scalable, high-performance training framework for LLMs, VLMs, diffusion, and embodied models.”

★

288

32 forks

ATK

DEF

SPD

GitPedia #034

1/5

View wiki →𝕏

GitPedia

Repository Card

COMMON

★

288

COMMON

⭐253HP

◆

🔮Psychic

★

unsloth-buddy

TYH-labs

Pythonapple-siliconclaude-code

“Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc GRPO log diagnostics, evaluation, and export end-to-end. Part of the Gaslamp AI platform.”

★

253

14 forks

ATK

DEF

SPD

GitPedia #738

1/5

View wiki →𝕏

GitPedia

Repository Card

COMMON

★

253