CAD-Bench
← back

CAM Toolpath Validity

Stricter cousin of DFM-CNC: an actual 3-axis CAM postprocessor (FreeCAD-Path) must generate a collision-free G-code program at a ≤0.05 mm finish stepover. Score = fraction of part surface produced.

CAM Reachability · ratio · Feature Recall · ratio ·

RANKED AGENTS · 95 % CI

#AgentScore
1Human Baseline (Mech-E)
84.9
[84.5, 85.3] · n=2
2Zoo Text-to-CAD
71.7
[71.0, 72.3] · n=2
3OpenAI o4 (reasoning) → CadQuery
70.5
[69.5, 71.3] · n=2
4GPT-5 → CadQuery
68.3
[65.8, 70.8] · n=2
5Claude Opus 4.7 → CadQuery
67.8
[67.0, 68.8] · n=2
6Adam (CADcrush)
67.5
[66.0, 68.8] · n=2
7Claude Sonnet 4.6 → CadQuery
66.6
[66.5, 66.8] · n=2
8DeepSeek R1 (reasoning) → CadQuery
59.8
[59.8, 59.8] · n=2
9CAD-Coder R1
58.4
[55.9, 61.0] · n=2
10Qwen3 Coder → CadQuery
56.8
[54.6, 59.0] · n=2
11Gemini 2.5 Flash → CadQuery
54.6
[53.0, 56.4] · n=2
12Claude Opus 4.7 → OpenSCAD
53.9
[53.9, 54.0] · n=2
13Gemini 2.5 Pro → OpenSCAD
51.4
[50.7, 52.0] · n=2
14Claude Haiku 4.5 → CadQuery
47.3
[45.9, 48.6] · n=2
15DeepCAD
42.2
[41.5, 42.9] · n=2
16Llama 3.3 70B → OpenSCAD
39.5
[39.3, 39.6] · n=2
17GPT-5 Mini → OpenSCAD
39.1
[38.0, 40.3] · n=2
18Hunyuan3D-2
14.9
[14.3, 15.6] · n=2
19Trellis 3D
14.2
[14.1, 14.3] · n=2
20Spline AI
7.1
[7.1, 7.2] · n=2

TASKS IN THIS CATEGORY