Blog

Introducing Hermes 4.3 by Hieu

The latest Hermes release improves tool-call reliability, reduces refusal rates on benign prompts, and ships with a 256k context window out of the box.

Open Weights, Open Evals by Shivani

Why we publish not just our models but the full evaluation harness, seeds, and raw outputs behind every benchmark number we report.

NousCoder-14B by emozilla

Introducing our code-specialized open model, trained on a curated corpus of permissively-licensed repositories with a focus on multi-file reasoning and tool use.

Reasoning Traces as First-Class Data by Nous Research

We argue that intermediate reasoning should be stored, versioned, and trained on directly — and show empirical gains from doing so.

Field Notes on Scaling MoE Expert Parallelism by Shivani

Practical lessons from sharding a mixture-of-experts model across a churn-prone distributed cluster: routing collapse, the all-to-all bottleneck, and how we fixed both.

Distributed Training Without a Datacenter by Karan4D

A retrospective on our first cross-continental training run coordinated entirely over commodity internet links using the Psyche protocol.

Unbiased Data Synthesis by teknium

How we generate synthetic training data without baking in the biases of a single teacher model, using an ensemble-and-debate pipeline with provenance tracking.

Efficient Pretraining with Token Superposition by teknium

By packing multiple weakly-correlated training signals into superposed token streams, we squeeze more gradient information per FLOP during the early phase of pretraining.

The Next Phase of Psyche by emozilla

Psyche moves from testnet to a permissionless coordination layer, letting anyone contribute compute to distributed pretraining runs with verifiable gradient attestation.

Lighthouse Attention by Hieu

A sparse attention variant that anchors long-context reasoning to a handful of learned beacon tokens, cutting memory by 40% while preserving recall on needle-in-haystack benchmarks.

Tinker-Atropos: RL Environments at Scale by Shivani

Atropos is our framework for defining, sandboxing, and scaling reinforcement learning environments for language model agents, now integrated with the Tinker training loop.

Model Neuroscience: Dissecting Behavioral Change by Hieu

We trace how fine-tuning reshapes internal representations by probing activation manifolds before and after alignment, revealing that behavioral shifts often localize to a surprisingly small subspace.