User Guide¶

This section provides comprehensive guides for using TuFT in various scenarios.

Chat SFT

Supervised fine-tuning on chat-formatted data with assistant-only loss masking.

Countdown RL

Reinforcement learning with GRPO-style training on verifiable tasks.

Persistence

Enable server state persistence with Redis for crash recovery.

Observability

OpenTelemetry integration for tracing, metrics, and logs.

Console

Dashboard for monitoring training runs, checkpoints, and sampling playground.