Projects
EvalSys has launched MCPMark, with more exciting projects on the way.
MCPMark
Stress-Testing Comprehensive MCP Use
An evaluation suite for agentic models in real MCP tool environments (Notion / GitHub / Filesystem / Postgres / Playwright).
MCPMark provides a reproducible, extensible benchmark for researchers and engineers: one-command tasks, isolated sandboxes, auto-resume for failures, unified metrics, and aggregated reports.
MCPMark will continuously update emerging MCP Servers to stay in step with the vibrant ecosystem!