Skip to content

fix: improve command descriptions for LLM tool discoverability (round 2)#65

Merged
CodesMcCabe merged 2 commits intomainfrom
cm/more-eval
Apr 16, 2026
Merged

fix: improve command descriptions for LLM tool discoverability (round 2)#65
CodesMcCabe merged 2 commits intomainfrom
cm/more-eval

Conversation

@CodesMcCabe
Copy link
Copy Markdown
Collaborator

Summary

Targeted fixes for 3 eval failures from run cli-anthropic-1776364842452 (91.4% → target ≥97%):

  • alchemy wallet address — added explicit deflections to transfers, portfolio tokens, and nfts so "wallet" queries don't incorrectly land here
  • alchemy transfers — rewritten to lead with "transfer history" + USE WHEN pattern (was being confused with wallet address)
  • alchemy balance — now deflects to alchemy portfolio tokens for multi-network/portfolio queries (was absorbing portfolio queries)
  • alchemy portfolio tokens — leads with "cross-network portfolio" signal + USE WHEN pattern
  • alchemy webhooks create — amplified write/create signal with "(write operation)" + USE WHEN/DOES NOT contrast
  • alchemy webhooks list — strengthened read-only framing to contrast with create

Test plan

  • pnpm run build passes (verified locally)
  • Re-run eval suite against updated build — expect transfers, portfolio, and webhooks-create to pass
  • Spot-check --help output for each changed command

🤖 Generated with Claude Code

Target three eval failures (transfers 0%, portfolio 0%, webhooks-create 67%):
- wallet address: add explicit deflections to transfers/portfolio/nfts
- transfers: lead with "transfer history", add USE WHEN pattern
- balance: deflect to portfolio tokens for multi-network queries
- portfolio tokens: lead with cross-network signal, add USE WHEN pattern
- webhooks create: amplify write/create signal with (write operation)
- webhooks list: strengthen read-only contrast with create

Co-Authored-By: Claude Sonnet 4.6 (1M context) <[email protected]>
@CodesMcCabe CodesMcCabe requested a review from a team as a code owner April 16, 2026 20:18
Co-Authored-By: Claude Sonnet 4.6 (1M context) <[email protected]>
@CodesMcCabe CodesMcCabe merged commit 58e1249 into main Apr 16, 2026
12 checks passed
@CodesMcCabe CodesMcCabe deleted the cm/more-eval branch April 16, 2026 20:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants