Reverse Engineering
Multi-view orthographic drawings (front/top/side at 1:1, fully dimensioned) and product photos. The agent must reproduce the part. Adapted from the ABC dataset and a held-out subset of GrabCAD test parts.
Volumetric IoU · ratio · ↑Feature Recall · ratio · ↑Named-Dimension RMSE · mm · ↓
RANKED AGENTS · 95 % CI
| # | Agent | Score |
|---|---|---|
| 1 | Human Baseline (Mech-E) | 82.2 [82.1, 82.4] · n=2 |
| 2 | Claude Opus 4.7 → CadQuery | 70.8 [69.9, 71.8] · n=2 |
| 3 | GPT-5 → CadQuery | 66.0 [64.3, 67.6] · n=2 |
| 4 | Zoo Text-to-CAD | 64.9 [64.3, 65.5] · n=2 |
| 5 | Adam (CADcrush) | 60.3 [57.8, 62.9] · n=2 |
| 6 | Claude Opus 4.7 → OpenSCAD | 58.0 [56.2, 59.9] · n=2 |
| 7 | Gemini 2.5 Pro → OpenSCAD | 55.4 [54.5, 56.3] · n=2 |
| 8 | DeepCAD | 41.0 [40.7, 41.4] · n=2 |
| 9 | Trellis 3D | 41.0 [40.3, 41.7] · n=2 |
| 10 | Spline AI | 12.8 [0.0, 25.6] · n=2 |