MCPipeChekr — The MCP Testing & Fix Harness

Built with:
opencode • oh-my-openagent • mcp-tef • MCP

🚀 The Problem

MCP server tool calls are unreliable across models and runs. LLMs often pick sub-optimal tools, miss parameters, or invent workarounds (like iterating date ranges) that inflate token costs. These regressions are invisible until they hit production, and there is no structured way to measure efficiency or automate the fix loop.

MCPipeChekr solves this by providing a multi-phase evaluation harness that benchmarks tool-call behavior against a ground-truth baseline and drives an agentic coding loop to fix detected bugs automatically.

🛠 Quick Start (MVP)

Clone the repository:

git clone https://github.com/thomasmaerz/mcpipechekr.git
cd mcpipechekr

Install dependencies: Ensure opencode, oh-my-openagent, and mcp-tef are installed and in your PATH.
Configure your MCP server: Edit config.yaml to point to your emailindex (or other MCP) server.
Define your tasks: Add test prompts to tasks.yaml.
Generate a baseline:
```
./harness.sh --regenerate-baseline
```
Run the harness loop:
```
./harness.sh
```
Review & Approve: Check the Phase 2 findings in your CLI, then type approve to let the agent fix the code.

🏗 Architecture & Docs

For detailed documentation on the phase pipeline, data schemas, and configuration, visit our GitHub Wiki.

Phase 0: Ground Truth Generation
Phase 0.5: Tool Description Linting (mcp-tef)
Phase 1: Blind Execution (Trace Capture)
Phase 2: Trace Evaluation (Efficiency & Correctness)
Phase 3: Agentic Fix Loop (Git Commit)

📊 KPIs

Efficiency Ratio: Target ≤ 1.3 (Observed vs. Optimal calls)
Correctness: Target ≥ 95% match against baseline
Token Cost Gap: Monotonically decreasing across fix loops

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MCPipeChekr — The MCP Testing & Fix Harness

🚀 The Problem

🛠 Quick Start (MVP)

🏗 Architecture & Docs

📊 KPIs

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

MCPipeChekr — The MCP Testing & Fix Harness

🚀 The Problem

🛠 Quick Start (MVP)

🏗 Architecture & Docs

📊 KPIs

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages