Preprint · cad-bench/v0.5 · sweep 2026-04-12open · MIT
CAD·Benchv0.5
← all tasks
CAM-001 · CAM Toolpath Validity · difficulty 3/5

5-pocket plate, Ø3 endmill finish

sha256:9c0fa1ee08b4c172

§1Prompt verbatim

Aluminium plate 100 × 60 × 12 mm with five rectangular pockets (20×20×6 mm deep) on a 2×3 grid (one corner cell empty), inside corners R 1.6 mm. The geometry must produce a collision-free 3-axis G-code program at 0.05 mm finish stepover with a Ø 3 mm endmill.

§2Ground-truth spec

shells1
watertighttrue
manifoldtrue
acceptance ε±0.05 mm
featurespocket_x5, internal_R1.6

§3Reference render

canonical reference · drag to orbit, scroll to zoom

Visualisation is rebuilt in-browser from the canonical parametric description. Scoring is performed against the held-out reference STEP file (sha-256 fingerprint above).

§4Per-agent renders

reference + 10 agent outputs · scored against the held-out STEP
vol IoU · BREP · manifold

Each tile is rebuilt from the canonical parametric description and degraded to match the agent's scored profile (tessellation, non-manifold face removal, dimension scale jitter, missing features). Image-only diffusion models render visually plausible meshes but score in the single digits on BREP fidelity — the geometry is not a manifold solid even when the render reads clean.

§5Per-agent metrics

ranked by Vol IoU · same data as the leaderboard, restricted to this task
AgentWatert.Manif.FeatRecCAM ReachabilityP@1p50latencycost
Zoo Text-to-CAD0.9570.7090.7370.0006.7s$0.144
Human Baseline (Mech-E)0.9580.8870.9050.000536.7s$7.008
Adam (CADcrush)0.9410.6400.6130.00010.3s$0.224
GPT-5 → CadQuery0.9330.7280.6390.00040.4s$0.207
Claude Opus 4.7 → CadQuery0.9270.7530.6910.00035.2s$0.289
Claude Opus 4.7 → OpenSCAD0.9250.5620.4870.00039.5s$0.367
Gemini 2.5 Pro → OpenSCAD0.9160.5480.4480.00027.1s$0.086
DeepCAD
kernel error: BRepCheck_NotClosed
×0.0000.0004.6s$0.021
Trellis 3D
kernel error: BRepCheck_NotClosed
×0.0000.00011.2s$0.043
Spline AI×0.8500.0930.0440.0006.6s$0.041