Commit Graph

2 Commits

Author SHA1 Message Date
kazuki
06e7c003e9 feat: migrate research and index-repo to plugin, delete all slash commands
## Plugin Migration
Added to pm-agent plugin:
- /research: Deep web research with adaptive planning
- /index-repo: Repository index (94% token reduction)
- Total: 3 commands (pm, research, index-repo)

## Slash Commands Deleted
Removed all 27 slash commands from ~/.claude/commands/sc/:
- analyze, brainstorm, build, business-panel, cleanup
- design, document, estimate, explain, git, help
- implement, improve, index, load, pm, reflect
- research, save, select-tool, spawn, spec-panel
- task, test, troubleshoot, workflow

## Architecture Change
Strategy: Minimal start with PM Agent orchestration
- PM Agent = orchestrator (統括コマンダー)
- Task tool (general-purpose, Explore) = execution
- Plugin commands = specialized tasks when needed
- Avoid reinventing the wheel (use official tools first)

## Files Changed
- .claude-plugin/plugin.json: Added research + index-repo
- .claude-plugin/commands/research.md: Copied from slash command
- .claude-plugin/commands/index-repo.md: Copied from slash command
- ~/.claude/commands/sc/: DELETED (all 27 commands)

## Benefits
 Minimal footprint (3 commands vs 27)
 Plugin-based distribution
 Version control
 Easy to extend when needed

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-10-21 14:07:01 +09:00
kazuki
373c313033 feat: PM Agent plugin architecture with confidence check test suite
## Plugin Architecture (Token Efficiency)
- Plugin-based PM Agent (97% token reduction vs slash commands)
- Lazy loading: 50 tokens at install, 1,632 tokens on /pm invocation
- Skills framework: confidence_check skill for hallucination prevention

## Confidence Check Test Suite
- 8 test cases (4 categories × 2 cases each)
- Real data from agiletec commit history
- Precision/Recall evaluation (target: ≥0.9/≥0.85)
- Token overhead measurement (target: <150 tokens)

## Research & Analysis
- PM Agent ROI analysis: Claude 4.5 baseline vs self-improving agents
- Evidence-based decision framework
- Performance benchmarking methodology

## Files Changed
### Plugin Implementation
- .claude-plugin/plugin.json: Plugin manifest
- .claude-plugin/commands/pm.md: PM Agent command
- .claude-plugin/skills/confidence_check.py: Confidence assessment
- .claude-plugin/marketplace.json: Local marketplace config

### Test Suite
- .claude-plugin/tests/confidence_test_cases.json: 8 test cases
- .claude-plugin/tests/run_confidence_tests.py: Evaluation script
- .claude-plugin/tests/EXECUTION_PLAN.md: Next session guide
- .claude-plugin/tests/README.md: Test suite documentation

### Documentation
- TEST_PLUGIN.md: Token efficiency comparison (slash vs plugin)
- docs/research/pm_agent_roi_analysis_2025-10-21.md: ROI analysis

### Code Changes
- src/superclaude/pm_agent/confidence.py: Updated confidence checks
- src/superclaude/pm_agent/token_budget.py: Deleted (replaced by /context)

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-10-21 13:31:28 +09:00