CAD-Bench
← back
PRIM-003 · Geometric Primitives · difficulty 1/5

Right truncated cone (frustum)

sha256:2c3fa9001ee04d80

§1Prompt verbatim

Right circular frustum: bottom Ø 60 mm, top Ø 30 mm, height 50 mm. Origin at the centroid of the bottom face. Solid, manifold.

§2Ground-truth spec

bbox60.0 × 60.0 × 50.0 mm
shells1
V−E+F2
genus0
watertighttrue
manifoldtrue
acceptance ε±0.03 mm

§3Reference render

canonical reference · drag to orbit, scroll to zoom

Visualisation is rebuilt in-browser from the canonical parametric description. Scoring is performed against the held-out reference STEP file (sha-256 fingerprint above).

§4Per-agent renders

reference + 10 agent outputs · scored against the held-out STEP
vol IoU · BREP · manifold
canonical reference
REFERENCE
canonical · ground truth
1.000100
Human Baseline (Mech-E)
Human Baseline (Mech-E)
n=4 senior engineers
0.8029
Adam (CADcrush)
Adam (CADcrush)
CADcrush
0.7738
Zoo Text-to-CAD
Zoo Text-to-CAD
Zoo (KittyCAD)
0.76210
Gemini 2.5 Pro → OpenSCAD
Gemini 2.5 Pro → OpenSCAD
Google + OpenSCAD 2024.06
0.7070
CAD-Coder R1
CAD-Coder R1
CAD-Coder Labs (research)
0.67511
DeepSeek R1 (reasoning) → CadQuery
DeepSeek R1 (reasoning) → CadQuery
DeepSeek + CadQuery 2.4
0.64212
DeepCAD
DeepCAD
Wu et al. 2021 (research)
0.63312
Qwen3 Coder → CadQuery
Qwen3 Coder → CadQuery
Alibaba + CadQuery 2.4
0.6319
Claude Opus 4.7 → CadQuery
Claude Opus 4.7 → CadQuery
Anthropic + CadQuery 2.4
0.61610
OpenAI o4 (reasoning) → CadQuery
OpenAI o4 (reasoning) → CadQuery
OpenAI + CadQuery 2.4
0.6157
Claude Sonnet 4.6 → CadQuery
Claude Sonnet 4.6 → CadQuery
Anthropic + CadQuery 2.4
0.5779
GPT-5 → CadQuery
GPT-5 → CadQuery
OpenAI + CadQuery 2.4
0.5738
Gemini 2.5 Flash → CadQuery
Gemini 2.5 Flash → CadQuery
Google + CadQuery 2.4
0.5479
Claude Haiku 4.5 → CadQuery
Claude Haiku 4.5 → CadQuery
Anthropic + CadQuery 2.4
0.53713
Trellis 3D
Trellis 3D
Microsoft Research
0.5190
Llama 3.3 70B → OpenSCAD
Llama 3.3 70B → OpenSCAD
Meta + OpenSCAD 2024.06
0.47211
Spline AI
Spline AI
Spline.design
0.4700
GPT-5 Mini → OpenSCAD
GPT-5 Mini → OpenSCAD
OpenAI + OpenSCAD 2024.06
0.46414
Hunyuan3D-2
Hunyuan3D-2
Tencent
0.42212
no manifold solid produced
Claude Opus 4.7 → OpenSCAD
Claude Opus 4.7 → OpenSCAD
Anthropic + OpenSCAD 2024.06
18

Each tile is rebuilt from the canonical parametric description and degraded to match the agent's scored profile (tessellation, non-manifold face removal, dimension scale jitter, missing features). Image-only diffusion models render visually plausible meshes but score in the single digits on BREP fidelity — the geometry is not a manifold solid even when the render reads clean.

§5Per-agent metrics

ranked by Vol IoU · same data as the leaderboard, restricted to this task
AgentVol IoUBidirectional ChamferWatert.Manif.P@1p50latencycost
Human Baseline (Mech-E)0.8020.1070.9750.000698.4s$6.222
Adam (CADcrush)0.7730.1440.9590.00010.5s$0.312
Zoo Text-to-CAD0.7620.1270.9690.0006.2s$0.172
Gemini 2.5 Pro → OpenSCAD0.7070.1500.9510.00034.6s$0.102
CAD-Coder R10.6750.1520.9500.0006.5s$0.006
DeepSeek R1 (reasoning) → CadQuery0.6420.1290.9430.00096.6s$0.036
DeepCAD0.6330.1570.9490.0004.1s$0.024
Qwen3 Coder → CadQuery0.6310.1300.9500.00022.3s$0.032
Claude Opus 4.7 → CadQuery0.6160.1480.9430.00041.0s$0.299
OpenAI o4 (reasoning) → CadQuery0.6150.1220.9450.000130.8s$0.901
Claude Sonnet 4.6 → CadQuery0.5770.1650.9450.00021.7s$0.080
GPT-5 → CadQuery0.5730.1710.9380.00031.4s$0.220
Gemini 2.5 Flash → CadQuery0.5470.1280.9390.00012.6s$0.022
Claude Haiku 4.5 → CadQuery0.5370.1460.9270.0009.1s$0.023
Trellis 3D0.5190.1590.9290.00015.5s$0.046
Llama 3.3 70B → OpenSCAD0.4720.1820.9160.00018.8s$0.021
Spline AI0.4700.2140.9170.0009.8s$0.040
GPT-5 Mini → OpenSCAD0.4640.2020.9180.00011.3s$0.009
Hunyuan3D-20.4220.1780.9150.00029.1s$0.076
Claude Opus 4.7 → OpenSCAD
kernel error: BRepCheck_NotClosed
0.000×0.0000.00037.7s$0.341