CAD-Bench
← back
FUNC-007 · Functional Intent · FEA-Gated · difficulty 5/5

Heat-sink fin array for 25 W TO-220

sha256:a012b3e44c510fd0

§1Prompt verbatim

Design a fin-array heat sink in 6063-T5 aluminium that holds a TO-220 device (mounting hole pattern given) below 95 °C junction at 25 W in still air at 25 °C ambient. Footprint ≤ 60 × 60 mm, height ≤ 35 mm. Output STEP.

§2Ground-truth spec

shells1
watertighttrue
manifoldtrue
acceptance ε±0.1 mm
featuresTO220_mount, fin_array

§3Reference render

canonical reference · drag to orbit, scroll to zoom

Visualisation is rebuilt in-browser from the canonical parametric description. Scoring is performed against the held-out reference STEP file (sha-256 fingerprint above).

§4Per-agent renders

reference + 10 agent outputs · scored against the held-out STEP
vol IoU · BREP · manifold
canonical reference
REFERENCE
canonical · ground truth
1.000100
Human Baseline (Mech-E)
Human Baseline (Mech-E)
n=4 senior engineers
0.6338
OpenAI o4 (reasoning) → CadQuery
OpenAI o4 (reasoning) → CadQuery
OpenAI + CadQuery 2.4
0.57013
Zoo Text-to-CAD
Zoo Text-to-CAD
Zoo (KittyCAD)
0.48111
Claude Sonnet 4.6 → CadQuery
Claude Sonnet 4.6 → CadQuery
Anthropic + CadQuery 2.4
0.38814
Claude Opus 4.7 → CadQuery
Claude Opus 4.7 → CadQuery
Anthropic + CadQuery 2.4
0.38512
GPT-5 → CadQuery
GPT-5 → CadQuery
OpenAI + CadQuery 2.4
0.38117
DeepSeek R1 (reasoning) → CadQuery
DeepSeek R1 (reasoning) → CadQuery
DeepSeek + CadQuery 2.4
0.35613
Gemini 2.5 Flash → CadQuery
Gemini 2.5 Flash → CadQuery
Google + CadQuery 2.4
0.24823
Claude Opus 4.7 → OpenSCAD
Claude Opus 4.7 → OpenSCAD
Anthropic + OpenSCAD 2024.06
0.2440
Gemini 2.5 Pro → OpenSCAD
Gemini 2.5 Pro → OpenSCAD
Google + OpenSCAD 2024.06
0.1960
Qwen3 Coder → CadQuery
Qwen3 Coder → CadQuery
Alibaba + CadQuery 2.4
0.17430
CAD-Coder R1
CAD-Coder R1
CAD-Coder Labs (research)
0.15232
Claude Haiku 4.5 → CadQuery
Claude Haiku 4.5 → CadQuery
Anthropic + CadQuery 2.4
0.12840
DeepCAD
DeepCAD
Wu et al. 2021 (research)
0.06273
no manifold solid produced
Adam (CADcrush)
Adam (CADcrush)
CADcrush
79
Trellis 3D
Trellis 3D
Microsoft Research
0.0000
Spline AI
Spline AI
Spline.design
0.0000
GPT-5 Mini → OpenSCAD
GPT-5 Mini → OpenSCAD
OpenAI + OpenSCAD 2024.06
0.000105
no manifold solid produced
Llama 3.3 70B → OpenSCAD
Llama 3.3 70B → OpenSCAD
Meta + OpenSCAD 2024.06
14
Hunyuan3D-2
Hunyuan3D-2
Tencent
0.000102

Each tile is rebuilt from the canonical parametric description and degraded to match the agent's scored profile (tessellation, non-manifold face removal, dimension scale jitter, missing features). Image-only diffusion models render visually plausible meshes but score in the single digits on BREP fidelity — the geometry is not a manifold solid even when the render reads clean.

§5Per-agent metrics

ranked by Vol IoU · same data as the leaderboard, restricted to this task
AgentWatert.Manif.FeatRecMin-Wall ComplianceFEA-Yield PassP@1p50latencycost
Human Baseline (Mech-E)0.9480.9570.8580.8410.000730.0s$4.929
OpenAI o4 (reasoning) → CadQuery0.9350.7220.7140.6220.000131.2s$0.916
Zoo Text-to-CAD0.9180.8070.6910.4790.0006.7s$0.194
Claude Sonnet 4.6 → CadQuery×0.9130.7330.6470.4740.00018.8s$0.072
Claude Opus 4.7 → CadQuery×0.9110.6790.6520.5800.00049.1s$0.291
GPT-5 → CadQuery×0.9050.7060.6590.4750.00030.7s$0.200
DeepSeek R1 (reasoning) → CadQuery×0.9060.6710.5890.5450.000119.6s$0.040
Gemini 2.5 Flash → CadQuery×0.8870.5750.5020.3980.00012.5s$0.020
Claude Opus 4.7 → OpenSCAD×0.8860.6090.5980.3580.00030.3s$0.261
Gemini 2.5 Pro → OpenSCAD×0.8790.4920.5370.3850.00034.2s$0.076
Qwen3 Coder → CadQuery×0.8750.6250.4980.3940.00018.9s$0.025
CAD-Coder R1×0.8750.6720.5170.2950.0006.1s$0.004
Claude Haiku 4.5 → CadQuery×0.8690.4910.4550.2930.0009.0s$0.019
DeepCAD×0.8600.4860.3590.1980.0004.0s$0.017
Adam (CADcrush)
kernel error: BRepCheck_NotClosed
×0.0000.00011.7s$0.255
Trellis 3D×0.8500.2140.2170.1000.00011.7s$0.040
Spline AI×0.8500.0900.0960.0440.0007.1s$0.047
GPT-5 Mini → OpenSCAD×0.8500.4120.4500.2440.00012.7s$0.010
Llama 3.3 70B → OpenSCAD
kernel error: BRepCheck_NotClosed
×0.0000.00026.5s$0.023
Hunyuan3D-2×0.8500.2100.2350.0970.00029.2s$0.070