Constraint Solving & Editability
Probes whether the agent exposes a working parametric graph: after the part is built we issue downstream parameter edits (length+30 %, hole diameter→M8) and re-evaluate without topological breakage.
Parametric Edit Accuracy · ratio · ↑Parametric Range Integrity · ratio · ↑Constraint Solve Rate · ratio · ↑
RANKED AGENTS · 95 % CI
| # | Agent | Score |
|---|---|---|
| 1 | Human Baseline (Mech-E) | 84.0 [83.8, 84.2] · n=2 |
| 2 | Claude Opus 4.7 → CadQuery | 79.8 [79.2, 80.4] · n=2 |
| 3 | Adam (CADcrush) | 72.6 [70.6, 74.5] · n=2 |
| 4 | GPT-5 → CadQuery | 70.8 [69.8, 71.9] · n=2 |
| 5 | Zoo Text-to-CAD | 69.0 [68.2, 69.9] · n=2 |
| 6 | Gemini 2.5 Pro → OpenSCAD | 55.1 [53.1, 57.2] · n=2 |
| 7 | Claude Opus 4.7 → OpenSCAD | 29.3 [0.0, 58.6] · n=2 |
| 8 | DeepCAD | 27.1 [27.1, 27.2] · n=2 |
| 9 | Trellis 3D | 5.0 [5.0, 5.1] · n=2 |
| 10 | Spline AI | 3.8 [3.8, 3.8] · n=2 |