Preprint · cad-bench/v0.5 · sweep 2026-04-12open · MIT
CAD·Benchv0.5
← all tasks
KIN-002 · Kinematic Mechanisms · difficulty 5/5

Four-bar linkage, Grashof crank-rocker

sha256:611ac9802d310bff

§1Prompt verbatim

Planar four-bar linkage with link lengths a=20, b=80, c=60, d=70 mm (Grashof crank-rocker). Build the four bodies and the three revolute joints — output one assembly STEP. Acceptance: simulate one full crank revolution; rocker angular sweep must be 70 ± 1° and no body interpenetrates at any point.

§2Ground-truth spec

shells4
watertighttrue
manifoldtrue
acceptance ε±0.05 mm
featuresrevolute_joint_x4

§3Reference render

canonical reference · drag to orbit, scroll to zoom

Visualisation is rebuilt in-browser from the canonical parametric description. Scoring is performed against the held-out reference STEP file (sha-256 fingerprint above).

§4Per-agent renders

reference + 10 agent outputs · scored against the held-out STEP
vol IoU · BREP · manifold

Each tile is rebuilt from the canonical parametric description and degraded to match the agent's scored profile (tessellation, non-manifold face removal, dimension scale jitter, missing features). Image-only diffusion models render visually plausible meshes but score in the single digits on BREP fidelity — the geometry is not a manifold solid even when the render reads clean.

§5Per-agent metrics

ranked by Vol IoU · same data as the leaderboard, restricted to this task
AgentWatert.Manif.FeatRecMating ClearanceParamEditP@1p50latencycost
Human Baseline (Mech-E)0.9570.8850.8030.8390.000808.6s$4.866
Claude Opus 4.7 → CadQuery0.9360.7150.6670.8100.00041.9s$0.401
Zoo Text-to-CAD0.9420.7700.6490.6570.0006.8s$0.149
GPT-5 → CadQuery0.9300.6590.5750.7680.00037.1s$0.217
Adam (CADcrush)0.9270.7340.6340.7010.00010.5s$0.239
Claude Opus 4.7 → OpenSCAD×0.9070.5450.5460.6150.00037.3s$0.359
DeepCAD×0.9000.4440.4490.2980.0005.5s$0.016
Gemini 2.5 Pro → OpenSCAD×0.8940.4870.4530.6150.00035.7s$0.094
Spline AI×0.8540.0970.0440.0430.0007.0s$0.042
Trellis 3D
kernel error: BRepCheck_NotClosed
×0.0000.00010.8s$0.046