CAD-Bench
← back
SHEET-007 · Sheet-Metal Bodies · difficulty 4/5

3-bend electronics chassis

sha256:4cb01ef02d3a0091

§1Prompt verbatim

U-shaped electronics chassis from 1.5 mm Al-5052, three 90° bends, internal volume 200 × 120 × 60 mm. Two louvered vent slots 60 × 8 mm on each long side, six M3 PEM nut clearance holes (Ø 4.2) on the base. Bend radius 1.5 mm, k-factor 0.40.

§2Ground-truth spec

shells1
watertighttrue
manifoldtrue
acceptance ε±0.1 mm
featuresbend_x3, louver_x4, PEM_M3_clearance_x6

§3Reference render

canonical reference · drag to orbit, scroll to zoom

Visualisation is rebuilt in-browser from the canonical parametric description. Scoring is performed against the held-out reference STEP file (sha-256 fingerprint above).

§4Per-agent renders

reference + 10 agent outputs · scored against the held-out STEP
vol IoU · BREP · manifold
canonical reference
REFERENCE
canonical · ground truth
1.000100
Human Baseline (Mech-E)
Human Baseline (Mech-E)
n=4 senior engineers
0.7256
Zoo Text-to-CAD
Zoo Text-to-CAD
Zoo (KittyCAD)
0.69910
Claude Opus 4.7 → CadQuery
Claude Opus 4.7 → CadQuery
Anthropic + CadQuery 2.4
0.67210
OpenAI o4 (reasoning) → CadQuery
OpenAI o4 (reasoning) → CadQuery
OpenAI + CadQuery 2.4
0.60110
Qwen3 Coder → CadQuery
Qwen3 Coder → CadQuery
Alibaba + CadQuery 2.4
0.58912
Gemini 2.5 Flash → CadQuery
Gemini 2.5 Flash → CadQuery
Google + CadQuery 2.4
0.56210
GPT-5 → CadQuery
GPT-5 → CadQuery
OpenAI + CadQuery 2.4
0.54512
Claude Opus 4.7 → OpenSCAD
Claude Opus 4.7 → OpenSCAD
Anthropic + OpenSCAD 2024.06
0.4710
CAD-Coder R1
CAD-Coder R1
CAD-Coder Labs (research)
0.46214
Adam (CADcrush)
Adam (CADcrush)
CADcrush
0.45114
Claude Sonnet 4.6 → CadQuery
Claude Sonnet 4.6 → CadQuery
Anthropic + CadQuery 2.4
0.43414
Claude Haiku 4.5 → CadQuery
Claude Haiku 4.5 → CadQuery
Anthropic + CadQuery 2.4
0.43216
GPT-5 Mini → OpenSCAD
GPT-5 Mini → OpenSCAD
OpenAI + OpenSCAD 2024.06
0.41613
DeepCAD
DeepCAD
Wu et al. 2021 (research)
0.36716
Llama 3.3 70B → OpenSCAD
Llama 3.3 70B → OpenSCAD
Meta + OpenSCAD 2024.06
0.34917
Gemini 2.5 Pro → OpenSCAD
Gemini 2.5 Pro → OpenSCAD
Google + OpenSCAD 2024.06
0.3280
Trellis 3D
Trellis 3D
Microsoft Research
0.1380
Hunyuan3D-2
Hunyuan3D-2
Tencent
0.10444
Spline AI
Spline AI
Spline.design
0.0000
no manifold solid produced
DeepSeek R1 (reasoning) → CadQuery
DeepSeek R1 (reasoning) → CadQuery
DeepSeek + CadQuery 2.4
61

Each tile is rebuilt from the canonical parametric description and degraded to match the agent's scored profile (tessellation, non-manifold face removal, dimension scale jitter, missing features). Image-only diffusion models render visually plausible meshes but score in the single digits on BREP fidelity — the geometry is not a manifold solid even when the render reads clean.

§5Per-agent metrics

ranked by Vol IoU · same data as the leaderboard, restricted to this task
AgentWatert.Manif.Named-Dimension RMSEFeatRecWall-Thickness UniformityP@1p50latencycost
Human Baseline (Mech-E)0.9690.0800.8750.8160.000895.1s$6.401
Zoo Text-to-CAD0.9610.1790.6950.6300.0008.0s$0.187
Claude Opus 4.7 → CadQuery0.9440.1970.7050.6560.00036.4s$0.404
OpenAI o4 (reasoning) → CadQuery0.9490.1850.7870.6650.00099.6s$1.005
Qwen3 Coder → CadQuery0.9340.2880.5750.4860.00018.1s$0.028
Gemini 2.5 Flash → CadQuery0.9300.2800.5820.5240.0008.5s$0.018
GPT-5 → CadQuery0.9350.2700.6970.5780.00044.2s$0.173
Claude Opus 4.7 → OpenSCAD0.9190.2480.5470.5790.00039.9s$0.339
CAD-Coder R10.9250.3060.6830.5270.0006.1s$0.005
Adam (CADcrush)0.9220.1820.6380.6640.0007.9s$0.232
Claude Sonnet 4.6 → CadQuery0.9140.1930.7140.5740.00012.7s$0.067
Claude Haiku 4.5 → CadQuery×0.9120.3020.5350.4380.00010.0s$0.017
GPT-5 Mini → OpenSCAD×0.9120.3100.3960.4130.0009.8s$0.009
DeepCAD×0.9070.3100.4690.4030.0005.3s$0.020
Llama 3.3 70B → OpenSCAD×0.9000.3040.4450.3600.00027.6s$0.018
Gemini 2.5 Pro → OpenSCAD×0.8970.3190.5150.5300.00030.7s$0.084
Trellis 3D×0.8700.4650.2080.1310.00013.0s$0.047
Hunyuan3D-2×0.8670.5090.1950.1350.00035.1s$0.072
Spline AI×0.8500.5980.1000.0620.0009.5s$0.033
DeepSeek R1 (reasoning) → CadQuery
kernel error: BRepCheck_NotClosed
×0.0000.00073.7s$0.038