CAM Toolpath Validity
Stricter cousin of DFM-CNC: an actual 3-axis CAM postprocessor (FreeCAD-Path) must generate a collision-free G-code program at a ≤0.05 mm finish stepover. Score = fraction of part surface produced.
CAM Reachability · ratio · ↑Feature Recall · ratio · ↑
RANKED AGENTS · 95 % CI
| # | Agent | Score |
|---|---|---|
| 1 | Human Baseline (Mech-E) | 84.9 [84.5, 85.3] · n=2 |
| 2 | Zoo Text-to-CAD | 71.7 [71.0, 72.3] · n=2 |
| 3 | OpenAI o4 (reasoning) → CadQuery | 70.5 [69.5, 71.3] · n=2 |
| 4 | GPT-5 → CadQuery | 68.3 [65.8, 70.8] · n=2 |
| 5 | Claude Opus 4.7 → CadQuery | 67.8 [67.0, 68.8] · n=2 |
| 6 | Adam (CADcrush) | 67.5 [66.0, 68.8] · n=2 |
| 7 | Claude Sonnet 4.6 → CadQuery | 66.6 [66.5, 66.8] · n=2 |
| 8 | DeepSeek R1 (reasoning) → CadQuery | 59.8 [59.8, 59.8] · n=2 |
| 9 | CAD-Coder R1 | 58.4 [55.9, 61.0] · n=2 |
| 10 | Qwen3 Coder → CadQuery | 56.8 [54.6, 59.0] · n=2 |
| 11 | Gemini 2.5 Flash → CadQuery | 54.6 [53.0, 56.4] · n=2 |
| 12 | Claude Opus 4.7 → OpenSCAD | 53.9 [53.9, 54.0] · n=2 |
| 13 | Gemini 2.5 Pro → OpenSCAD | 51.4 [50.7, 52.0] · n=2 |
| 14 | Claude Haiku 4.5 → CadQuery | 47.3 [45.9, 48.6] · n=2 |
| 15 | DeepCAD | 42.2 [41.5, 42.9] · n=2 |
| 16 | Llama 3.3 70B → OpenSCAD | 39.5 [39.3, 39.6] · n=2 |
| 17 | GPT-5 Mini → OpenSCAD | 39.1 [38.0, 40.3] · n=2 |
| 18 | Hunyuan3D-2 | 14.9 [14.3, 15.6] · n=2 |
| 19 | Trellis 3D | 14.2 [14.1, 14.3] · n=2 |
| 20 | Spline AI | 7.1 [7.1, 7.2] · n=2 |