High-fidelity Claude runs
Run local Claude Code with terminal TUI capture, live stream, and deterministic replay.
Claude Code Evaluation Platform
Build tasksets, run Claude in a captured terminal session, and inspect completion, efficiency, and capability evidence in one report flow.
Run local Claude Code with terminal TUI capture, live stream, and deterministic replay.
Completion, duration, token usage, and tool behavior are parsed directly from Claude session JSONL.
Each run persists a report snapshot and final report for review, download, and comparisons.
2026-02-15 06:57:36.653Z
Automate changelog generation from commits, PRs, and releases following Keep a Changelog format. Use when setting up release workflows, generating release notes, or standardizing commit conventions.
Completion
100%
Duration
13m 19s
Tokens
1,930,868
Tool success
93%
2026-02-15 00:18:49.813Z
No description available
Completion
100%
Duration
10m 17s
Tokens
1,919,030
Tool success
95%