Project workspaces
Create isolated project folders for tokenizers, configs, checkpoints, logs, caches, and datasets.
Haiku Studio
BY ROOTCOMPUTER
GitHub
Haiku Studio wraps our h2 Python training engine in a local Electron interface for project management, tokenizer building, pretraining, SFT, DPO, checkpoint testing, and kernel-level runtime visibility.
0001 LOG [project] Loaded tokenizer into data/tokenizer.json
0002 LOG [train] eval_loss=2.184 checkpoint saved
0003 WARN [tokenizer] RAM guard enabled
The desktop app stays close to the Python training scripts while making the project lifecycle visible and manageable.
Create isolated project folders for tokenizers, configs, checkpoints, logs, caches, and datasets.
Build a BPE tokenizer from one file or an entire corpus folder with RAM-aware input sampling.
Launch pretraining runs, monitor metrics, save checkpoints, and resume from project-scoped artifacts.
Fine-tune on user/bot dialogue data with completion-only masking and project-local outputs.
Train from prompt/chosen/rejected preference pairs using a frozen reference checkpoint.
Stream process output into the app with warning and error highlighting for serious runtime issues.
Tokenizer, project config, checkpoints, logs, cache, and datasets.
Active project tokenizer and config are staged for the Python backend scripts.
Results are written back to the active project rather than scattered globally.
Every tab maps to a real engine workflow: project setup, tokenizer creation, training, alignment, chat testing, and export.
0001LOG[train] saved projects/haiku_studio/checkpoints/model.pt
0002WARN[tokenizer] corpus sampled under RAM guard
0003ERR[runtime] serious errors surface immediately
Test the best checkpoint from the active project without leaving the workspace.
Use the installer for the desktop workspace, or clone the repository and run the Python scripts directly.