Tanka

Research Model

Tanka scales the Haiku architecture to study how increased context length, additional depth, and larger pretraining budgets affect reasoning quality and coherence in mid-scale language models.

Overview

Tanka shares its core architecture with Haiku Mini but operates at higher parameter count and longer context. The goal is straightforward: understand what changes — and what breaks — when the same design runs at a larger scale with more training data.

Research Focus

Reasoning and coherence under longer context windows
Scaling behavior of the Haiku architecture beyond its base configuration
Training dynamics and curriculum effects at mid-scale
Stability and failure modes in extended multi-turn interaction

Intended Use

Tanka is an internal research model. It is not publicly accessible and is not optimized for deployment. Its role is to inform decisions about architecture and training that will eventually apply to production-oriented work.