Preprint · cad-bench/v0.5 · sweep 2026-04-12open · MIT
CAD·Benchv0.5
← all categories

Constraint Solving & Editability

Probes whether the agent exposes a working parametric graph: after the part is built we issue downstream parameter edits (length+30 %, hole diameter→M8) and re-evaluate without topological breakage.

Parametric Edit Accuracy · ratio · Parametric Range Integrity · ratio · Constraint Solve Rate · ratio ·

RANKED AGENTS · 95 % CI

#AgentScore
1Human Baseline (Mech-E)
84.0
[83.8, 84.2] · n=2
2Claude Opus 4.7 → CadQuery
79.8
[79.2, 80.4] · n=2
3Adam (CADcrush)
72.6
[70.6, 74.5] · n=2
4GPT-5 → CadQuery
70.8
[69.8, 71.9] · n=2
5Zoo Text-to-CAD
69.0
[68.2, 69.9] · n=2
6Gemini 2.5 Pro → OpenSCAD
55.1
[53.1, 57.2] · n=2
7Claude Opus 4.7 → OpenSCAD
29.3
[0.0, 58.6] · n=2
8DeepCAD
27.1
[27.1, 27.2] · n=2
9Trellis 3D
5.0
[5.0, 5.1] · n=2
10Spline AI
3.8
[3.8, 3.8] · n=2

TASKS IN THIS CATEGORY