Preprint · cad-bench/v0.5 · sweep 2026-04-12open · MIT
CAD·Benchv0.5
← all tasks
STD-002 · Standards Compliance · difficulty 4/5

ISO 4762 M8×30 socket-head cap screw

sha256:7af0ce123b984091

§1Prompt verbatim

Model an ISO 4762 M8 × 30 mm socket-head cap screw, property class 12.9, with a fully-formed thread (ISO 261 6g) and a hexagonal socket sized to take a 6 mm hex key. Head Ø 13, head height 8, fillet under-head R 0.4 mm.

§2Ground-truth spec

shells1
watertighttrue
manifoldtrue
acceptance ε±0.05 mm
featuresM8x1.25_thread, hex_socket_6mm, underhead_R0.4

§3Reference render

canonical reference · drag to orbit, scroll to zoom

Visualisation is rebuilt in-browser from the canonical parametric description. Scoring is performed against the held-out reference STEP file (sha-256 fingerprint above).

§4Per-agent renders

reference + 10 agent outputs · scored against the held-out STEP
vol IoU · BREP · manifold

Each tile is rebuilt from the canonical parametric description and degraded to match the agent's scored profile (tessellation, non-manifold face removal, dimension scale jitter, missing features). Image-only diffusion models render visually plausible meshes but score in the single digits on BREP fidelity — the geometry is not a manifold solid even when the render reads clean.

§5Per-agent metrics

ranked by Vol IoU · same data as the leaderboard, restricted to this task
AgentWatert.Manif.GD&T ComplianceStandards ComplianceP@1p50latencycost
Human Baseline (Mech-E)0.9520.9000.9440.000518.4s$4.932
Adam (CADcrush)0.9470.6470.7210.0009.7s$0.247
GPT-5 → CadQuery0.9360.5570.6860.00051.2s$0.183
Claude Opus 4.7 → CadQuery0.9370.6690.7200.00041.4s$0.285
Zoo Text-to-CAD0.9320.6420.7520.0005.3s$0.196
Claude Opus 4.7 → OpenSCAD0.9230.4540.5780.00037.5s$0.268
Gemini 2.5 Pro → OpenSCAD×0.9110.4520.5050.00028.3s$0.089
DeepCAD×0.8810.3620.2620.0004.5s$0.024
Trellis 3D×0.8500.0540.0470.00010.4s$0.049
Spline AI×0.8500.0270.0190.0009.6s$0.045