User Guide

This section provides comprehensive guides for using TuFT in various scenarios.

Chat SFT

Supervised fine-tuning on chat-formatted data with assistant-only loss masking.

Chat Supervised Fine-Tuning (SFT)
Countdown RL

Reinforcement learning with GRPO-style training on verifiable tasks.

Countdown Reinforcement Learning (RL)
Persistence

Enable server state persistence with Redis for crash recovery.

Persistence
Observability

OpenTelemetry integration for tracing, metrics, and logs.

Observability (OpenTelemetry)
Console

Dashboard for monitoring training runs, checkpoints, and sampling playground.

User console