CAD-Bench
← back
SHEET-003 · Sheet-Metal Bodies · difficulty 3/5

U-channel bracket, 1.5 mm Al, k=0.40

sha256:311fac0bb472ea91

§1Prompt verbatim

Sheet-metal U-channel from 1.5 mm Al-5052: outside dimensions 80 × 30 × 25 mm tall, bend radius 1.5 mm (inside), k-factor 0.40. Two M5 clearance holes (Ø 5.5) on the base, centred 15 mm from each end. Output the folded body and report the unfolded flat pattern dimensions.

§2Ground-truth spec

shells1
watertighttrue
manifoldtrue
acceptance ε±0.1 mm
featuresbend_x2, M5_clearance_x2

§3Reference render

canonical reference · drag to orbit, scroll to zoom

Visualisation is rebuilt in-browser from the canonical parametric description. Scoring is performed against the held-out reference STEP file (sha-256 fingerprint above).

§4Per-agent renders

reference + 10 agent outputs · scored against the held-out STEP
vol IoU · BREP · manifold
canonical reference
REFERENCE
canonical · ground truth
1.000100
Human Baseline (Mech-E)
Human Baseline (Mech-E)
n=4 senior engineers
0.8486
Claude Opus 4.7 → CadQuery
Claude Opus 4.7 → CadQuery
Anthropic + CadQuery 2.4
0.77011
Claude Opus 4.7 → OpenSCAD
Claude Opus 4.7 → OpenSCAD
Anthropic + OpenSCAD 2024.06
0.6840
Claude Sonnet 4.6 → CadQuery
Claude Sonnet 4.6 → CadQuery
Anthropic + CadQuery 2.4
0.6737
Adam (CADcrush)
Adam (CADcrush)
CADcrush
0.6609
OpenAI o4 (reasoning) → CadQuery
OpenAI o4 (reasoning) → CadQuery
OpenAI + CadQuery 2.4
0.62610
CAD-Coder R1
CAD-Coder R1
CAD-Coder Labs (research)
0.62010
DeepSeek R1 (reasoning) → CadQuery
DeepSeek R1 (reasoning) → CadQuery
DeepSeek + CadQuery 2.4
0.6038
Zoo Text-to-CAD
Zoo Text-to-CAD
Zoo (KittyCAD)
0.5728
Gemini 2.5 Flash → CadQuery
Gemini 2.5 Flash → CadQuery
Google + CadQuery 2.4
0.5599
GPT-5 → CadQuery
GPT-5 → CadQuery
OpenAI + CadQuery 2.4
0.5579
Claude Haiku 4.5 → CadQuery
Claude Haiku 4.5 → CadQuery
Anthropic + CadQuery 2.4
0.49414
Qwen3 Coder → CadQuery
Qwen3 Coder → CadQuery
Alibaba + CadQuery 2.4
0.49213
Gemini 2.5 Pro → OpenSCAD
Gemini 2.5 Pro → OpenSCAD
Google + OpenSCAD 2024.06
0.3790
DeepCAD
DeepCAD
Wu et al. 2021 (research)
0.37513
Llama 3.3 70B → OpenSCAD
Llama 3.3 70B → OpenSCAD
Meta + OpenSCAD 2024.06
0.33216
GPT-5 Mini → OpenSCAD
GPT-5 Mini → OpenSCAD
OpenAI + OpenSCAD 2024.06
0.31120
Hunyuan3D-2
Hunyuan3D-2
Tencent
0.11844
Trellis 3D
Trellis 3D
Microsoft Research
0.0990
Spline AI
Spline AI
Spline.design
0.0000

Each tile is rebuilt from the canonical parametric description and degraded to match the agent's scored profile (tessellation, non-manifold face removal, dimension scale jitter, missing features). Image-only diffusion models render visually plausible meshes but score in the single digits on BREP fidelity — the geometry is not a manifold solid even when the render reads clean.

§5Per-agent metrics

ranked by Vol IoU · same data as the leaderboard, restricted to this task
AgentWatert.Manif.Named-Dimension RMSEFeatRecWall-Thickness UniformityP@1p50latencycost
Human Baseline (Mech-E)0.9790.1120.8240.8021.000666.8s$6.217
Claude Opus 4.7 → CadQuery0.9600.1930.7870.6491.00030.4s$0.382
Claude Opus 4.7 → OpenSCAD0.9470.2650.5370.5860.00025.4s$0.299
Claude Sonnet 4.6 → CadQuery0.9580.2000.7390.6500.00022.4s$0.065
Adam (CADcrush)0.9420.1640.6400.6490.00010.5s$0.257
OpenAI o4 (reasoning) → CadQuery0.9420.2480.7220.6530.000106.8s$1.028
CAD-Coder R10.9480.2660.6730.4960.0007.5s$0.005
DeepSeek R1 (reasoning) → CadQuery0.9390.2570.6840.5820.00074.1s$0.039
Zoo Text-to-CAD0.9420.1410.7300.7060.0007.6s$0.201
Gemini 2.5 Flash → CadQuery0.9370.3020.5800.5260.00011.6s$0.021
GPT-5 → CadQuery0.9380.2530.6520.6110.00039.3s$0.239
Claude Haiku 4.5 → CadQuery0.9250.3550.5410.4330.0009.0s$0.016
Qwen3 Coder → CadQuery0.9250.2680.5940.4760.00014.2s$0.033
Gemini 2.5 Pro → OpenSCAD×0.9120.2660.5230.4740.00035.7s$0.077
DeepCAD×0.9080.3100.4910.3690.0003.5s$0.024
Llama 3.3 70B → OpenSCAD×0.8990.3810.4420.4020.00016.7s$0.022
GPT-5 Mini → OpenSCAD×0.8960.3510.3860.4070.00010.8s$0.009
Hunyuan3D-2×0.8680.4850.1900.1240.00037.4s$0.059
Trellis 3D×0.8650.5560.2040.1310.00011.8s$0.041
Spline AI×0.8500.5410.0920.0660.0008.5s$0.039