Skip to main content
The Scale Gymnasium Web UI at gym.scale.com provides a visual interface for exploring environments, running agent loops, and verifying task completion.
Prerequisites: Access to the Scale Gymnasium Web UI at gym.scale.com. Contact Scale if you don’t have access.

MCP Environment

The MCP environment provides access to 45+ MCP servers and 300+ tools for tool-based agent interactions.Select an MCP server from the left sidebar under TOOL USE (e.g., Quickbooks, Hubspot CRM, Calendar, Email, Slack).

Tools Panel

The Tools for [Server] panel on the left shows:
  • Search tools: Filter tools by name
  • Refresh: Reload the tool list
  • List of available tools with descriptions (e.g., quickbooks_create_customer, calendar_get_events)
Click a tool to view its parameters and execute it.

Server Data Panel

The Server Data panel on the right shows the database state:
  • Tabs: Switch between data tables (e.g., customers, invoices, bills, items, vendors)
  • Refresh: Reload the data
  • Hide: Collapse the panel
  • Pagination: Navigate through records with Previous/Next
Use this to inspect the current state and verify changes after tool calls.

Tips

Find tools quickly in the MCP interface:
  • Use the search box to filter by tool name
  • Select different servers from the sidebar to see their tools
  • Read tool descriptions for parameter guidance

Working with Verifiers

The Web UI provides integrated verification for Website and Desktop environments.
MCP environments do not have verifier support in the Gymnasium UI.

Running Verification

  1. Complete a task (manually or via agent)
  2. Click Execute next to the verifier
  3. View check results

Check Types

TypeDescriptionResult
State CheckVerifies database changesPass/Fail
Log CheckVerifies interaction logsPass/Fail
Rubric CheckLLM-evaluated criteriaPass/Fail/Pending

Interpreting Results

  • ✅ Passed: All checks succeeded
  • ❌ Failed: One or more checks failed (see details)
  • ⏳ Pending: Rubric checks awaiting LLM evaluation

Next Steps