Projects

EvalSys has launched MCPMark, with more exciting projects on the way.

MCPMark

Stress-Testing Comprehensive MCP Use

An evaluation suite for agentic models in real MCP tool environments (Notion / GitHub / Filesystem / Postgres / Playwright).

MCPMark provides a reproducible, extensible benchmark for researchers and engineers: one-command tasks, isolated sandboxes, auto-resume for failures, unified metrics, and aggregated reports.

MCPMark will continuously update emerging MCP Servers to stay in step with the vibrant ecosystem!