Upload Harbor agent runs. Visualize benchmark tasks in an interactive embedding space. Understand how your tasks compare.
Run one command to log in via your browser, then upload Harbor job directories directly from your terminal. No API key needed — or use one for CI.
Or install globally: npm install -g trajectories-sh
For CI, use an API key: npx trajectories-sh auth login --api-key YOUR_KEY
Supports --visibility public | unlisted | private and --slug custom-name.
Push agent trajectory jobs via CLI or the web UI. Every upload automatically gets a Harbor viewer — see every step, screenshot, and tool call.
Each trajectory gets an embedded Harbor viewer with step-by-step replay, screenshots, agent logs, verifier output, and terminal recordings.
Explore Terminal Bench 2, proposed Terminal Bench 3 community tasks, and your own tasksets — all plotted in an interactive 2D and 3D embedding space.
OpenSee pass rates, run counts, and how your tasks compare to public benchmarks across tasksets.
Keep trajectories private, share via unlisted link, or publish publicly. You control who sees your data.
Trajectories are linked to their benchmark task via Harbor checksums, so you can see all runs for any task.
Sign in to get an API key, upload trajectory jobs from the web UI, link private GitHub repos for automatic task ingestion, and manage visibility for everything you upload.