Preprint · cad-bench/v0.5 · sweep 2026-04-12open · MIT
CAD·Benchv0.5
← all agents

GPT-5 → CadQuery

A · Engineering-Capable
composite 67.1 [63.7, 70.6]p5 52.5IRT θ 76.8
OpenAI + CadQuery 2.4· vgpt-5 + cadquery 2.4· released 2026-03-08· LLM+CadQuery· emits CadQuery· proprietary· 400k ctx· $0.01/$0.04 per 1k tok

Same scaffold as the Claude pipeline for fair comparison. Self-repair budget capped at 3 attempts.

L1 GeomL2 EngL3 MfgL4 Cogvs. human
solid: agent · dashed: human baseline

PER-CATEGORY SCORE (95 % CI)

Geometric Primitives
87.5
Boolean Robustness
74.5
BREP Fidelity
62.6
Free-form Surfaces
84.5
Parametric Mechanical Parts
68.7
Assembly & Mating
53.7
Standards Compliance
62.1
Sheet-Metal Bodies
71.2
Sealing-Groove Design
73.8
Kinematic Mechanisms
66.7
DFM · 3-Axis CNC
69.0
DFM · Injection Mould
57.2
DFM · FDM 3D Print
76.8
CAM Toolpath Validity
68.3
Constraint Solving & Editability
70.8
Reverse Engineering
66.0
2-D Sketch Constraints
83.8
Functional Intent · FEA-Gated
61.4
Paraphrase Robustness
77.3
Confidence Calibration
34.7

MEAN METRICS BY LAYER

L1 · Geometry
CategoryVol IoUChamferHausdorffNormConsWatert.Manif.EulerSTEP RTP@1
Geometric Primitives0.6280.1470.944
Boolean Robustness0.5480.933
BREP Fidelity0.9180.135
Free-form Surfaces0.2061.1110.771
L2 · Engineering
CategoryDim RMSEGD&TFeatRecMatingFit-ClsStd
Parametric Mechanical Parts0.2160.5740.701
Assembly & Mating0.6750.5600.376
Standards Compliance0.5570.686
Sheet-Metal Bodies0.2340.736
Sealing-Groove Design0.2440.720
Kinematic Mechanisms0.6590.575
L3 · Manufacturing
CategoryDFMDraftMin-WallCAM-ReachSuppWall-Unif
DFM · 3-Axis CNC80.5000.6170.647
DFM · Injection Mould0.5220.6070.588
DFM · FDM 3D Print79.5000.6580.295
CAM Toolpath Validity0.639
L4 · Cognition
CategoryParamEditConSolveP-RangeFEA-σPara σSeed σBrierEditLat
Constraint Solving & Editability0.6940.7450.686
Reverse Engineering
2-D Sketch Constraints0.745
Functional Intent · FEA-Gated0.512
Paraphrase Robustness0.0370.054
Confidence Calibration0.153

TASK-BY-TASK ARTIFACTS

PRIM-001Hollow cylinder (60 × 40 × 100)0.6780.00PRIM-007Right hexagonal prism with pitch fillet0.5780.00BOOL-003Coplanar-face union (knife-edge stress)0.6170.00BOOL-009High-genus subtraction (lattice block)0.4800.00BREP-004NURBS-handle goblet (G2 swept loft)0.4400.00SURF-002Compressor blade (NACA 65-(12)10)0.4000.00MECH-014L-bracket with M6 + slotted hole0.6250.00MECH-022Stepped shaft with retaining-ring groove0.5850.00MECH-027Planetary-gear carrier plate0.6930.00ASM-005Pin-in-hole, H7/g6 sliding fit0.5820.00ASM-011Dovetail slide (60° flanks)0.5070.00STD-002ISO 4762 M8×30 socket-head cap screw0.5790.00SHEET-003U-channel bracket, 1.5 mm Al, k=0.400.5560.00SEAL-001AS568-214 face-seal groove0.7310.00KIN-002Four-bar linkage, Grashof crank-rocker0.5230.00DFMCNC-0023-ax-machinable manifold block0.4490.00DFMMOLD-002Injection-mouldable enclosure half0.5330.00DFMFDM-008FDM-printable hinge (no support)0.5710.00CAM-0015-pocket plate, Ø3 endmill finish0.5360.00PARAM-006Editable flange (bolt circle param sweep)0.6300.00PARAM-013Editable bracket (length+30 %, hole→M8)0.5770.00REVENG-002Three-view ortho → bracket0.4420.00REVENG-009Three-view ortho → housing with cores0.5500.00SKETCH-003Tangent-arc transition profile0.6720.00FUNC-001Cantilever bracket — 250 N tip load, 6061-T6, SF≥40.3050.00FUNC-007Heat-sink fin array for 25 W TO-2200.4130.00PARA-0015× paraphrased L-bracket0.6570.00CAL-003Confidence-calibrated planetary carrier0.3210.00