CAD-Bench
← back
PRIM-001 · Geometric Primitives · difficulty 1/5

Hollow cylinder (60 × 40 × 100)

sha256:e3b0c44298fc1c14

§1Prompt verbatim

Model a hollow cylinder with outer diameter 60 mm, inner diameter 40 mm, height 100 mm. Origin at the centroid of the bottom face. Output a watertight solid.

§2Ground-truth spec

volume157080.0 mm³
surface area34558.0 mm²
bbox60.0 × 60.0 × 100.0 mm
shells1
V−E+F0
genus1
watertighttrue
manifoldtrue
acceptance ε±0.05 mm

§3Reference render

canonical reference · drag to orbit, scroll to zoom

Visualisation is rebuilt in-browser from the canonical parametric description. Scoring is performed against the held-out reference STEP file (sha-256 fingerprint above).

§4Per-agent renders

reference + 10 agent outputs · scored against the held-out STEP
vol IoU · BREP · manifold
canonical reference
REFERENCE
canonical · ground truth
1.000100
Human Baseline (Mech-E)
Human Baseline (Mech-E)
n=4 senior engineers
0.8646
Zoo Text-to-CAD
Zoo Text-to-CAD
Zoo (KittyCAD)
0.8347
CAD-Coder R1
CAD-Coder R1
CAD-Coder Labs (research)
0.77710
Claude Opus 4.7 → CadQuery
Claude Opus 4.7 → CadQuery
Anthropic + CadQuery 2.4
0.71812
DeepSeek R1 (reasoning) → CadQuery
DeepSeek R1 (reasoning) → CadQuery
DeepSeek + CadQuery 2.4
0.7188
OpenAI o4 (reasoning) → CadQuery
OpenAI o4 (reasoning) → CadQuery
OpenAI + CadQuery 2.4
0.7139
DeepCAD
DeepCAD
Wu et al. 2021 (research)
0.6937
GPT-5 → CadQuery
GPT-5 → CadQuery
OpenAI + CadQuery 2.4
0.6367
Claude Sonnet 4.6 → CadQuery
Claude Sonnet 4.6 → CadQuery
Anthropic + CadQuery 2.4
0.6198
Trellis 3D
Trellis 3D
Microsoft Research
0.6130
Gemini 2.5 Pro → OpenSCAD
Gemini 2.5 Pro → OpenSCAD
Google + OpenSCAD 2024.06
0.5930
GPT-5 Mini → OpenSCAD
GPT-5 Mini → OpenSCAD
OpenAI + OpenSCAD 2024.06
0.58210
Claude Opus 4.7 → OpenSCAD
Claude Opus 4.7 → OpenSCAD
Anthropic + OpenSCAD 2024.06
0.5750
Claude Haiku 4.5 → CadQuery
Claude Haiku 4.5 → CadQuery
Anthropic + CadQuery 2.4
0.57310
Gemini 2.5 Flash → CadQuery
Gemini 2.5 Flash → CadQuery
Google + CadQuery 2.4
0.55613
Llama 3.3 70B → OpenSCAD
Llama 3.3 70B → OpenSCAD
Meta + OpenSCAD 2024.06
0.53411
Qwen3 Coder → CadQuery
Qwen3 Coder → CadQuery
Alibaba + CadQuery 2.4
0.50813
Hunyuan3D-2
Hunyuan3D-2
Tencent
0.47312
Spline AI
Spline AI
Spline.design
0.3970
no manifold solid produced
Adam (CADcrush)
Adam (CADcrush)
CADcrush
79

Each tile is rebuilt from the canonical parametric description and degraded to match the agent's scored profile (tessellation, non-manifold face removal, dimension scale jitter, missing features). Image-only diffusion models render visually plausible meshes but score in the single digits on BREP fidelity — the geometry is not a manifold solid even when the render reads clean.

§5Per-agent metrics

ranked by Vol IoU · same data as the leaderboard, restricted to this task
AgentVol IoUBidirectional ChamferWatert.Manif.P@1p50latencycost
Human Baseline (Mech-E)0.8640.1280.9901.000504.6s$6.616
Zoo Text-to-CAD0.8340.1270.9690.0006.3s$0.146
CAD-Coder R10.7770.1240.9630.0004.8s$0.005
Claude Opus 4.7 → CadQuery0.7180.1100.9520.00039.0s$0.372
DeepSeek R1 (reasoning) → CadQuery0.7180.1510.9570.000105.4s$0.040
OpenAI o4 (reasoning) → CadQuery0.7130.1080.9550.000133.4s$1.291
DeepCAD0.6930.1270.9590.0005.5s$0.019
GPT-5 → CadQuery0.6360.1370.9470.00034.2s$0.233
Claude Sonnet 4.6 → CadQuery0.6190.1450.9500.00020.1s$0.058
Trellis 3D0.6130.1530.9370.0009.4s$0.043
Gemini 2.5 Pro → OpenSCAD0.5930.1790.9350.00036.2s$0.083
GPT-5 Mini → OpenSCAD0.5820.1440.9410.00016.6s$0.010
Claude Opus 4.7 → OpenSCAD0.5750.1300.9450.00022.9s$0.319
Claude Haiku 4.5 → CadQuery0.5730.1570.9410.0006.7s$0.019
Gemini 2.5 Flash → CadQuery0.5560.1860.9300.00013.7s$0.018
Llama 3.3 70B → OpenSCAD0.5340.1740.9260.00021.3s$0.016
Qwen3 Coder → CadQuery0.5080.1380.9330.00013.2s$0.035
Hunyuan3D-20.4730.1600.9260.00039.6s$0.078
Spline AI0.3970.227×0.9070.0005.6s$0.034
Adam (CADcrush)
kernel error: BRepCheck_NotClosed
0.000×0.0000.0008.2s$0.308