Preprint · cad-bench/v0.5 · sweep 2026-04-12open · MIT
CAD·Benchv0.5
← all tasks
MECH-022 · Parametric Mechanical Parts · difficulty 4/5

Stepped shaft with retaining-ring groove

sha256:84a92efb1c0d4e6f

§1Prompt verbatim

Stepped shaft: Ø 20 × 30 mm, then Ø 16 × 25 mm, then Ø 12 × 20 mm. On the Ø 16 step, machine an external retaining-ring groove per DIN 471 for a 16 mm shaft (groove Ø 15.2 ± 0.05 mm, width 1.1 +0.14/0 mm, edge 7.0 mm from the Ø20→Ø16 shoulder).

§2Ground-truth spec

shells1
watertighttrue
manifoldtrue
acceptance ε±0.05 mm
featuresgroove_DIN471_16, shoulder_20_16, shoulder_16_12

§3Reference render

canonical reference · drag to orbit, scroll to zoom

Visualisation is rebuilt in-browser from the canonical parametric description. Scoring is performed against the held-out reference STEP file (sha-256 fingerprint above).

§4Per-agent renders

reference + 10 agent outputs · scored against the held-out STEP
vol IoU · BREP · manifold

Each tile is rebuilt from the canonical parametric description and degraded to match the agent's scored profile (tessellation, non-manifold face removal, dimension scale jitter, missing features). Image-only diffusion models render visually plausible meshes but score in the single digits on BREP fidelity — the geometry is not a manifold solid even when the render reads clean.

§5Per-agent metrics

ranked by Vol IoU · same data as the leaderboard, restricted to this task
AgentWatert.Manif.Named-Dimension RMSEGD&T ComplianceFeatRecP@1p50latencycost
Human Baseline (Mech-E)0.9730.0810.8630.9021.000873.8s$5.178
Adam (CADcrush)0.9390.1700.6240.7340.0007.9s$0.320
GPT-5 → CadQuery0.9390.2170.6040.6930.00036.2s$0.173
Zoo Text-to-CAD0.9340.1500.6590.7050.0005.3s$0.182
Claude Opus 4.7 → CadQuery0.9310.2160.6120.7480.00041.5s$0.335
Gemini 2.5 Pro → OpenSCAD×0.9120.2970.4350.5040.00024.0s$0.105
Spline AI×0.8640.5180.0270.0920.0006.1s$0.039
Trellis 3D×0.8540.5410.0560.2160.00012.5s$0.052
Claude Opus 4.7 → OpenSCAD
kernel error: BRepCheck_NotClosed
×0.0000.00029.2s$0.291
DeepCAD
kernel error: BRepCheck_NotClosed
×0.0000.0004.0s$0.019