Run With Full Evidence
Capture live TUI, terminal cast replay, and session JSONL so every score has inspectable proof.
Claude Code Evaluation Platform
Build tasksets, run Claude in a captured terminal session, and inspect completion, efficiency, and capability evidence in one report flow.
CLI Workflow
$ npm i -g @skillscore/cli
$ ssc doctor
$ ssc start --setup
$ ssc publish --run-id <id>
Latest report: changelog-automation · 2026-02-15 06:57:36.653Z
Capture live TUI, terminal cast replay, and session JSONL so every score has inspectable proof.
Completion, duration, token usage, and tool-call quality are computed from session events.
Use run and task trend views to find regressions, compare versions, and iterate on SKILL.md quality.
2026-02-15 06:57:36.653Z
Automate changelog generation from commits, PRs, and releases following Keep a Changelog format. Use when setting up release workflows, generating release notes, or standardizing commit conventions.
Completion
100%
Duration
13m 19s
Tokens
1,930,868
Tool success
93%
2026-02-15 00:18:49.813Z
No description available
Completion
100%
Duration
10m 17s
Tokens
1,919,030
Tool success
95%