feat: improve generate-classes skill score 81% → 93%#13892
Conversation
Hey @wu-sheng 👋 I ran your skills through `tessl skill review` at work and found some targeted improvements for the `generate-classes` skill. Here's the full before/after: | Skill | Before | After | Change | |-------|--------|-------|--------| | generate-classes | 81% | 93% | +12% | | gh-pull-request | 84% | — | — | | license | 93% | — | — | | test | 85% | — | — | | new-monitoring-feature | 81% | — | — | | compile | 79% | — | — | | run-e2e | 60% | — | — | | ci-e2e-debug | 84% | — | — | | package | 90% | — | — | <details> <summary>Changes made to generate-classes</summary> - Expanded description with explicit "Use when..." clause covering compiling DSL scripts, inspecting generated bytecode, debugging compiler output, and verifying DSL-to-class generation — pushes description score from 68% to 100% - Added specific trigger terms (ANTLR4, Javassist, bytecode generation) to improve agent discoverability - Added validation checkpoints after each Maven command — verify exit code 0 / BUILD SUCCESS before inspecting output, with guidance on checking for DSL compilation errors on failure - Moved argument-hint from unknown frontmatter key into a proper metadata block to resolve the validation warning - Added stop-on-failure guidance for the all command to prevent cascading errors - Quoted description string in frontmatter to follow standard YAML formatting </details> I also stress-tested your generate-classes skill against a few real-world task evals and it held up really well on MAL expression compilation with multi-script batch generation. Kudos for that. Honest disclosure — I work at @tesslio where we build tooling around skills like these. Not a pitch — just saw room for improvement and wanted to contribute. Want to self-improve your skills? Just point your agent (Claude Code, Codex, etc.) at this Tessl guide (https://docs.tessl.io/evaluate/optimize-a-skill-using-best-practices) and ask it to optimize your skill. Ping me — @yogesh-tessl (https://github.com/yogesh-tessl) — if you hit any snags. Thanks in advance 🙏
|
Thanks for the interest in SkyWalking and the kind words, @yogesh-tessl 🙏 One correction on the frontmatter change, though. These files live under
So moving Same with the 81% → 93% "skill score": that's a Tessl metric. Claude Code has no native skill score, so it's not something we can reason about or gate on from our side. The expanded "Use when…" description is a fair improvement and matches Claude Code's own "put the key use case first" guidance — happy to keep that part. Could you revert the Thanks again for taking the time to look. 🙇 |
Hey @wu-sheng 👋
impressive work. 24k+ stars on an APM system that covers tracing, metrics, and logging across cloud-native stacks. The breadth of language agents and the clean separation between the OAP backend and UI is impressive for a project this large.
ran your skills through
tessl skill reviewat work and found some targeted improvements for thegenerate-classesskill. Here's the before/after:Changes made to generate-classes
also stress-tested your generate-classes skill against a few real-world task evals and it held up really well on MAL expression compilation with multi-script batch generation. Kudos for that.
quick honest disclosure. I work at https://github.com/tesslio where we build tooling around skills like these. Not a pitch, just saw room for improvement and wanted to contribute.
if you want to self-improve your skills, or define your own scenarios to pressure test, just ask your agent (Claude Code, Codex, etc.) to evaluate and optimize your skill with Tessl. Ping me @yogesh-tessl, if you hit any snags.