CAD-Bench
← back

Sheet-Metal Bodies

Uniform-thickness bodies with bend specifications, k-factors, relief cuts, and unfoldable flat patterns. Tested by attempting to unfold the result and measuring the unfold error vs the spec'd flat pattern.

Wall-Thickness Uniformity · ratio · Named-Dimension RMSE · mm · Feature Recall · ratio ·

RANKED AGENTS · 95 % CI

#AgentScore
1Human Baseline (Mech-E)
84.5
[82.6, 87.0] · n=3
2Zoo Text-to-CAD
74.6
[71.5, 76.5] · n=3
3OpenAI o4 (reasoning) → CadQuery
73.4
[70.9, 75.6] · n=3
4Claude Opus 4.7 → CadQuery
73.3
[72.1, 74.8] · n=3
5Adam (CADcrush)
71.5
[70.7, 73.0] · n=3
6Claude Sonnet 4.6 → CadQuery
71.0
[69.8, 73.0] · n=3
7GPT-5 → CadQuery
67.0
[66.8, 67.3] · n=3
8CAD-Coder R1
63.2
[62.8, 63.5] · n=3
9Claude Opus 4.7 → OpenSCAD
62.0
[61.5, 62.6] · n=3
10Gemini 2.5 Flash → CadQuery
61.2
[60.1, 62.7] · n=3
11Qwen3 Coder → CadQuery
61.1
[59.1, 64.1] · n=3
12Gemini 2.5 Pro → OpenSCAD
58.2
[57.5, 59.4] · n=3
13Claude Haiku 4.5 → CadQuery
55.1
[54.0, 55.8] · n=3
14DeepCAD
52.0
[51.7, 52.4] · n=3
15Llama 3.3 70B → OpenSCAD
49.1
[48.4, 50.0] · n=3
16GPT-5 Mini → OpenSCAD
48.9
[48.1, 50.0] · n=3
17DeepSeek R1 (reasoning) → CadQuery
43.9
[0.0, 67.0] · n=3
18Hunyuan3D-2
27.6
[27.4, 27.7] · n=3
19Spline AI
19.7
[18.8, 20.6] · n=3
20Trellis 3D
18.4
[0.0, 29.1] · n=3

TASKS IN THIS CATEGORY