Preprint · cad-bench/v0.5 · sweep 2026-04-12open · MIT
CAD·Benchv0.5
← all tasks
PARAM-006 · Constraint Solving & Editability · difficulty 3/5

Editable flange (bolt circle param sweep)

sha256:0ce7b1d445aa9088

§1Prompt verbatim

Flange: hub Ø 30 × 20 mm, plate Ø 100 × 8 mm, six Ø 7 mm bolt holes on PCD 'D'. Build it once at D = 80 mm, then expose 'D' as a parameter. We will edit D to 70, 75, 85, 90 mm and re-evaluate.

§2Ground-truth spec

shells1
watertighttrue
manifoldtrue
acceptance ε±0.05 mm
parametric edits
D : 8070 (ΔV expected 0 mm³)
D : 7075 (ΔV expected 0 mm³)
D : 7585 (ΔV expected 0 mm³)
D : 8590 (ΔV expected 0 mm³)

§3Reference render

canonical reference · drag to orbit, scroll to zoom

Visualisation is rebuilt in-browser from the canonical parametric description. Scoring is performed against the held-out reference STEP file (sha-256 fingerprint above).

§4Per-agent renders

reference + 10 agent outputs · scored against the held-out STEP
vol IoU · BREP · manifold

Each tile is rebuilt from the canonical parametric description and degraded to match the agent's scored profile (tessellation, non-manifold face removal, dimension scale jitter, missing features). Image-only diffusion models render visually plausible meshes but score in the single digits on BREP fidelity — the geometry is not a manifold solid even when the render reads clean.

§5Per-agent metrics

ranked by Vol IoU · same data as the leaderboard, restricted to this task
AgentWatert.Manif.ParamEditConSolveParametric Range IntegrityP@1p50latencycost
Adam (CADcrush)0.9740.6910.7270.7011.0009.6s$0.318
Claude Opus 4.7 → CadQuery0.9730.7900.8380.7491.00033.7s$0.358
Human Baseline (Mech-E)0.9750.7990.9080.8181.000751.2s$5.386
GPT-5 → CadQuery0.9500.6950.7910.6710.00053.1s$0.185
Zoo Text-to-CAD0.9390.7310.7340.6310.0005.6s$0.209
Claude Opus 4.7 → OpenSCAD0.9360.5880.6290.5420.00029.7s$0.352
Gemini 2.5 Pro → OpenSCAD0.9290.5680.6020.5450.00027.0s$0.107
DeepCAD×0.8720.2800.3170.2150.0004.1s$0.023
Trellis 3D×0.8500.0570.0560.0360.0009.6s$0.041
Spline AI×0.8500.0440.0500.0200.0006.2s$0.033