CAD-Bench
← back
PRIM-002 · Geometric Primitives · difficulty 1/5

Sphere with planar cap

sha256:12bf4e0c44a1aa01

§1Prompt verbatim

Solid sphere of radius 25 mm cut by the plane z = 18 mm; keep the −z portion. Origin at sphere centre. Output a watertight solid.

§2Ground-truth spec

bbox50.0 × 50.0 × 43.0 mm
shells1
V−E+F2
genus0
watertighttrue
manifoldtrue
acceptance ε±0.05 mm

§3Reference render

canonical reference · drag to orbit, scroll to zoom

Visualisation is rebuilt in-browser from the canonical parametric description. Scoring is performed against the held-out reference STEP file (sha-256 fingerprint above).

§4Per-agent renders

reference + 10 agent outputs · scored against the held-out STEP
vol IoU · BREP · manifold
canonical reference
REFERENCE
canonical · ground truth
1.000100
Zoo Text-to-CAD
Zoo Text-to-CAD
Zoo (KittyCAD)
0.88410
CAD-Coder R1
CAD-Coder R1
CAD-Coder Labs (research)
0.83810
DeepCAD
DeepCAD
Wu et al. 2021 (research)
0.7788
Human Baseline (Mech-E)
Human Baseline (Mech-E)
n=4 senior engineers
0.7758
Claude Sonnet 4.6 → CadQuery
Claude Sonnet 4.6 → CadQuery
Anthropic + CadQuery 2.4
0.7617
OpenAI o4 (reasoning) → CadQuery
OpenAI o4 (reasoning) → CadQuery
OpenAI + CadQuery 2.4
0.7417
GPT-5 → CadQuery
GPT-5 → CadQuery
OpenAI + CadQuery 2.4
0.6968
Adam (CADcrush)
Adam (CADcrush)
CADcrush
0.6637
Claude Opus 4.7 → CadQuery
Claude Opus 4.7 → CadQuery
Anthropic + CadQuery 2.4
0.6598
Gemini 2.5 Flash → CadQuery
Gemini 2.5 Flash → CadQuery
Google + CadQuery 2.4
0.62910
Llama 3.3 70B → OpenSCAD
Llama 3.3 70B → OpenSCAD
Meta + OpenSCAD 2024.06
0.60510
Hunyuan3D-2
Hunyuan3D-2
Tencent
0.59811
Gemini 2.5 Pro → OpenSCAD
Gemini 2.5 Pro → OpenSCAD
Google + OpenSCAD 2024.06
0.5930
DeepSeek R1 (reasoning) → CadQuery
DeepSeek R1 (reasoning) → CadQuery
DeepSeek + CadQuery 2.4
0.56912
Qwen3 Coder → CadQuery
Qwen3 Coder → CadQuery
Alibaba + CadQuery 2.4
0.54912
Claude Opus 4.7 → OpenSCAD
Claude Opus 4.7 → OpenSCAD
Anthropic + OpenSCAD 2024.06
0.5370
GPT-5 Mini → OpenSCAD
GPT-5 Mini → OpenSCAD
OpenAI + OpenSCAD 2024.06
0.52012
Claude Haiku 4.5 → CadQuery
Claude Haiku 4.5 → CadQuery
Anthropic + CadQuery 2.4
0.47712
Spline AI
Spline AI
Spline.design
0.4160
Trellis 3D
Trellis 3D
Microsoft Research
0.4030

Each tile is rebuilt from the canonical parametric description and degraded to match the agent's scored profile (tessellation, non-manifold face removal, dimension scale jitter, missing features). Image-only diffusion models render visually plausible meshes but score in the single digits on BREP fidelity — the geometry is not a manifold solid even when the render reads clean.

§5Per-agent metrics

ranked by Vol IoU · same data as the leaderboard, restricted to this task
AgentVol IoUBidirectional ChamferWatert.Manif.P@1p50latencycost
Zoo Text-to-CAD0.8840.1350.9751.0007.3s$0.150
CAD-Coder R10.8380.1360.9710.0005.9s$0.006
DeepCAD0.7780.1000.9640.0005.1s$0.020
Human Baseline (Mech-E)0.7750.1290.9660.000697.9s$6.817
Claude Sonnet 4.6 → CadQuery0.7610.1480.9590.00020.8s$0.065
OpenAI o4 (reasoning) → CadQuery0.7410.1300.9590.000116.0s$1.134
GPT-5 → CadQuery0.6960.1580.9530.00035.9s$0.237
Adam (CADcrush)0.6630.1390.9590.0008.2s$0.219
Claude Opus 4.7 → CadQuery0.6590.1530.9550.00042.1s$0.347
Gemini 2.5 Flash → CadQuery0.6290.1320.9510.00012.6s$0.023
Llama 3.3 70B → OpenSCAD0.6050.1380.9410.00015.9s$0.024
Hunyuan3D-20.5980.1390.9360.00044.0s$0.078
Gemini 2.5 Pro → OpenSCAD0.5930.1410.9410.00029.6s$0.088
DeepSeek R1 (reasoning) → CadQuery0.5690.1720.9410.00073.3s$0.048
Qwen3 Coder → CadQuery0.5490.1400.9360.00013.0s$0.025
Claude Opus 4.7 → OpenSCAD0.5370.1850.9310.00038.5s$0.271
GPT-5 Mini → OpenSCAD0.5200.1900.9290.00010.2s$0.008
Claude Haiku 4.5 → CadQuery0.4770.1740.9250.0009.2s$0.018
Spline AI0.4160.2070.9170.0008.1s$0.042
Trellis 3D0.4030.1900.9130.00011.9s$0.054