OpenAI o4 (reasoning) → CadQuery
A · Engineering-Capablecomposite 71.4 [67.1, 75.2]p5 43.1IRT θ 100.0
OpenAI + CadQuery 2.4· vo4-2026-02 + cadquery 2.4· released 2026-02-26· LLM+CadQuery· emits CadQuery· proprietary· 256k ctx· $0.018/$0.072 per 1k tok
Reasoning model with private chain-of-thought. Self-repair often unnecessary — single-shot pass-rate is ~14 pp higher than GPT-5 on PARAM-* tasks at the cost of 3-4× wall-clock latency.
solid: agent · dashed: human baseline
PER-CATEGORY SCORE (95 % CI)
Geometric Primitives
88.9
Boolean Robustness
65.6
BREP Fidelity
77.5
Free-form Surfaces
87.5
Parametric Mechanical Parts
63.2
Assembly & Mating
63.8
Standards Compliance
68.3
Sheet-Metal Bodies
73.4
Sealing-Groove Design
75.3
Kinematic Mechanisms
75.9
DFM · 3-Axis CNC
74.6
DFM · Injection Mould
64.4
DFM · FDM 3D Print
76.5
CAM Toolpath Validity
70.5
Constraint Solving & Editability
80.2
Reverse Engineering
69.8
2-D Sketch Constraints
89.9
Functional Intent · FEA-Gated
68.3
Paraphrase Robustness
84.5
Confidence Calibration
43.0
MEAN METRICS BY LAYER
L1 · Geometry
| Category | Vol IoU | Chamfer | Hausdorff | NormCons | Watert. | Manif. | Euler | STEP RT | P@1 |
|---|---|---|---|---|---|---|---|---|---|
| Geometric Primitives | 0.673 | 0.138 | — | — | — | 0.951 | — | — | — |
| Boolean Robustness | 0.475 | — | — | — | — | 0.751 | — | — | — |
| BREP Fidelity | — | — | — | — | — | 0.930 | — | 0.111 | — |
| Free-form Surfaces | — | 0.170 | 0.866 | 0.815 | — | — | — | — | — |
L2 · Engineering
| Category | Dim RMSE | GD&T | FeatRec | Mating | Fit-Cls | Std |
|---|---|---|---|---|---|---|
| Parametric Mechanical Parts | 0.205 | 0.666 | 0.749 | — | — | — |
| Assembly & Mating | — | — | 0.769 | 0.659 | 0.486 | — |
| Standards Compliance | — | 0.643 | — | — | — | 0.723 |
| Sheet-Metal Bodies | 0.229 | — | 0.754 | — | — | — |
| Sealing-Groove Design | 0.227 | — | — | — | — | 0.733 |
| Kinematic Mechanisms | — | — | 0.783 | 0.666 | — | — |
L3 · Manufacturing
| Category | DFM | Draft | Min-Wall | CAM-Reach | Supp | Wall-Unif |
|---|---|---|---|---|---|---|
| DFM · 3-Axis CNC | 85.675 | — | 0.688 | 0.694 | — | — |
| DFM · Injection Mould | — | 0.631 | 0.670 | — | — | 0.632 |
| DFM · FDM 3D Print | 82.967 | — | 0.661 | — | 0.388 | — |
| CAM Toolpath Validity | — | — | — | 0.657 | — | — |
L4 · Cognition
| Category | ParamEdit | ConSolve | P-Range | FEA-σ | Para σ | Seed σ | Brier | EditLat |
|---|---|---|---|---|---|---|---|---|
| Constraint Solving & Editability | 0.829 | 0.810 | 0.767 | — | — | — | — | — |
| Reverse Engineering | — | — | — | — | — | — | — | — |
| 2-D Sketch Constraints | — | 0.862 | — | — | — | — | — | — |
| Functional Intent · FEA-Gated | — | — | — | 0.658 | — | — | — | — |
| Paraphrase Robustness | — | — | — | — | 0.026 | 0.036 | — | — |
| Confidence Calibration | — | — | — | — | — | — | 0.070 | — |
TASK-BY-TASK ARTIFACTS
PRIM-001Hollow cylinder (60 × 40 × 100)0.7130.00PRIM-007Right hexagonal prism with pitch fillet0.6880.00BOOL-003Coplanar-face union (knife-edge stress)0.5840.00BOOL-009High-genus subtraction (lattice block)0.0000.00BREP-004NURBS-handle goblet (G2 swept loft)0.5640.00SURF-002Compressor blade (NACA 65-(12)10)0.3820.00MECH-014L-bracket with M6 + slotted hole0.6810.00MECH-022Stepped shaft with retaining-ring groove0.5630.00MECH-027Planetary-gear carrier plate0.6050.00ASM-005Pin-in-hole, H7/g6 sliding fit0.5070.00ASM-011Dovetail slide (60° flanks)0.5040.00STD-002ISO 4762 M8×30 socket-head cap screw0.5770.00SHEET-003U-channel bracket, 1.5 mm Al, k=0.400.6260.00SEAL-001AS568-214 face-seal groove0.6600.00KIN-002Four-bar linkage, Grashof crank-rocker0.6570.00DFMCNC-0023-ax-machinable manifold block0.4800.00DFMMOLD-002Injection-mouldable enclosure half0.6110.00DFMFDM-008FDM-printable hinge (no support)0.5560.00CAM-0015-pocket plate, Ø3 endmill finish0.5930.00PARAM-006Editable flange (bolt circle param sweep)0.8191.00PARAM-013Editable bracket (length+30 %, hole→M8)0.7280.00REVENG-002Three-view ortho → bracket0.5160.00REVENG-009Three-view ortho → housing with cores0.4240.00SKETCH-003Tangent-arc transition profile0.8801.00FUNC-001Cantilever bracket — 250 N tip load, 6061-T6, SF≥40.5740.00FUNC-007Heat-sink fin array for 25 W TO-2200.5700.00PARA-0015× paraphrased L-bracket0.6960.00CAL-003Confidence-calibrated planetary carrier0.5030.00PRIM-002Sphere with planar cap0.7410.00PRIM-003Right truncated cone (frustum)0.6150.00PRIM-004Square-base pyramid frustum0.6140.00PRIM-005Tilted-axis box (30°)0.7060.00PRIM-009Hollow torus (Ø100 mean × Ø8 tube, 1 mm wall)0.6310.00BOOL-001Tangent cylinder onto cube (line-of-contact)0.5960.00BOOL-002Two interpenetrating spheres (lens intersection)0.7460.00BOOL-005ε-offset extrusion (sliver-face stress)0.4480.00BREP-001Periodic-spline cylinder (closed in U)0.4770.00BREP-002G1-only loft (tangent discontinuity)0.5260.00BREP-007Trimmed sphere with hole through pole0.6030.00SURF-001Ergonomic mug handle (revolved spline)0.6661.00SURF-007Mouse top-shell (Class-A)0.4900.00MECH-002Bearing block — two 6204 deep-groove bearings0.0000.00MECH-005Cam-follower lever (eccentric pivot)0.6990.00MECH-018Heat-set insert boss array (4× M3)0.6150.00MECH-031Threaded cap with diamond knurl0.6030.00ASM-001Threaded coupling M16×1.5 (male+female pair)0.6450.00ASM-008Spline shaft + hub (DIN 5480 W25×1.25×18)0.7040.00ASM-013Bayonet quarter-turn mount0.6730.00STD-001ISO 7050 self-tapping screw ST4.2 × 160.7220.00STD-005ISO 8734 dowel pin Ø6 m6 × 300.8661.00STD-008ASME B18.6.3 button-head 1/4-20 × 5/80.5340.00SHEET-001Box-pan with corner relief cuts0.6280.00SHEET-0073-bend electronics chassis0.6010.00SEAL-004AS568-218 piston-type radial groove0.5900.00SEAL-007ISO 3601-2 quad-ring face groove0.6840.00KIN-005Geneva drive — 4 station0.5660.00KIN-008Planetary gearset full mesh (sun + 3 planets + ring)0.5230.00DFMCNC-0054-pocket plate, R 0.5 internal corners0.5870.00DFMCNC-009Long-aspect bracket (vise-fixturable)0.6750.00DFMCNC-0135-axis-only fish-mouth saddle0.5050.00DFMMOLD-005Telephone handset shell0.5680.00DFMFDM-002Bridge-test calibration cube0.5270.00DFMFDM-011Print-in-place hinge0.6030.00CAM-005T-slot pocket array (Ø 8 + Ø 4 endmills)0.6500.00PARAM-009Configurable bottle (height + cap params)0.7671.00REVENG-005ABC dataset stepped pulley (multi-view)0.6350.00SKETCH-014Symmetric four-bar profile0.6260.00FUNC-003Heat-sink for 60 W CoB LED0.5580.00PARA-0055× paraphrased planetary carrier0.7861.00CAL-007Confidence-bracketed mounting flange0.5440.00