User Guide¶
This section provides comprehensive guides for using TuFT in various scenarios.
Chat SFT
Supervised fine-tuning on chat-formatted data with assistant-only loss masking.
Countdown RL
Reinforcement learning with GRPO-style training on verifiable tasks.
Persistence
Enable server state persistence with Redis for crash recovery.
Observability
OpenTelemetry integration for tracing, metrics, and logs.
Console
Dashboard for monitoring training runs, checkpoints, and sampling playground.