Kinematic Mechanisms
Four-bar linkages, cams (radial/face), gear meshes (involute, ISO 53). Scored by simulating one full kinematic cycle and measuring (a) feasibility — no body interpenetration — and (b) prescribed motion error.
Mating Clearance · ratio · ↑Feature Recall · ratio · ↑Parametric Edit Accuracy · ratio · ↑
RANKED AGENTS · 95 % CI
| # | Agent | Score |
|---|---|---|
| 1 | Human Baseline (Mech-E) | 89.7 [89.4, 90.1] · n=3 |
| 2 | OpenAI o4 (reasoning) → CadQuery | 75.9 [73.5, 78.9] · n=3 |
| 3 | Zoo Text-to-CAD | 70.4 [69.2, 72.0] · n=3 |
| 4 | Claude Opus 4.7 → CadQuery | 70.3 [66.9, 73.6] · n=3 |
| 5 | GPT-5 → CadQuery | 67.5 [67.3, 67.7] · n=3 |
| 6 | Adam (CADcrush) | 67.0 [66.8, 67.4] · n=3 |
| 7 | DeepSeek R1 (reasoning) → CadQuery | 65.4 [64.9, 66.2] · n=3 |
| 8 | CAD-Coder R1 | 59.1 [57.2, 61.5] · n=3 |
| 9 | Gemini 2.5 Flash → CadQuery | 58.5 [57.8, 59.0] · n=3 |
| 10 | Claude Opus 4.7 → OpenSCAD | 55.7 [54.6, 56.4] · n=3 |
| 11 | Gemini 2.5 Pro → OpenSCAD | 51.1 [50.8, 51.6] · n=3 |
| 12 | Claude Haiku 4.5 → CadQuery | 47.3 [46.2, 49.0] · n=3 |
| 13 | Claude Sonnet 4.6 → CadQuery | 46.7 [0.0, 70.9] · n=3 |
| 14 | GPT-5 Mini → OpenSCAD | 40.2 [39.3, 40.9] · n=3 |
| 15 | Qwen3 Coder → CadQuery | 39.3 [0.0, 59.6] · n=3 |
| 16 | DeepCAD | 38.2 [37.7, 38.9] · n=3 |
| 17 | Llama 3.3 70B → OpenSCAD | 26.4 [0.0, 41.0] · n=3 |
| 18 | Trellis 3D | 11.6 [11.4, 11.9] · n=3 |
| 19 | Hunyuan3D-2 | 7.7 [0.0, 11.6] · n=3 |
| 20 | Spline AI | 4.2 [0.0, 6.4] · n=3 |