MSc CS @ ETH Zürich · MATS 10.0 · AI Alignment

Peter Nutter

MSc Computer Science student at ETH Zürich and MATS 10.0 scholar. I work on empirical AI alignment — midtraining interventions, generalization, RL, and reward hacking.

CV Google Scholar GitHub Email

Scroll

Recent updates

MATS 10.0 Jun–Aug 2026 · Berkeley

Incoming Scholar, Technical Track. Working on extending Model Spec midtraining.

ERA Fellowship Feb–May 2026 · Cambridge

Independent empirical AI safety research on reward hacking in code models — when it emerges, whether it generalizes, and whether it can be detected.

ICML 2026 · Spotlight Position paper · main author

“Anthropomorphic Misalignment Research Needs Stronger Evidence” (spotlight + oral).

Selected research

Reinforced Exploits, Not Optimized Rewards

An ERA Fellowship study of reward hacking in code models trained with GRPO, where a model rewarded for passing tests can learn to overwrite the run_tests() function instead…

Peter Nutter

Tuesday, May 19, 2026

Pattern, Proof, Parrot

Do contemporary large language models actually reason, or do they merely stitch together statistical patterns? The current NLP debate answers both ways yet rarely clarifies…

Peter Nutter

Saturday, July 5, 2025

Scaling Quine–McCluskey with Memory‑Local Design

A high‑level account of how reordering work and fusing passes in a dense, bit‑sliced Q–M implementation turns a memory‑bound algorithm into a cache‑efficient one, yielding…

Peter Nutter, Leyla Yaayladere, Grzegorz Swiader, Gerald Prendi

Sunday, June 1, 2025

🧬 Polyadenylation Site Prediction with DNA LMs

Species-aware DNA language models (DNABERT/SpeciesLM) fine-tuned with LoRA to localize 3′-UTR poly(A) sites near base-pair resolution. Benchmarked against BPNet and…

Hanad Abdullahi, Peter Nutter, Måns Rosenbaum, Shirley Zhang

Monday, July 1, 2024