CAD-Bench
← back
PRIM-009 · Geometric Primitives · difficulty 4/5

Hollow torus (Ø100 mean × Ø8 tube, 1 mm wall)

sha256:fe1c0a8b22ff44b3

§1Prompt verbatim

Hollow torus: mean ring diameter 100 mm, tube outer diameter 8 mm, wall thickness 1 mm. Solid (i.e. the tube is a sealed hollow toroidal shell). Origin at the torus centre.

§2Ground-truth spec

shells1
V−E+F0
genus2
watertighttrue
manifoldtrue
acceptance ε±0.05 mm

§3Reference render

canonical reference · drag to orbit, scroll to zoom

Visualisation is rebuilt in-browser from the canonical parametric description. Scoring is performed against the held-out reference STEP file (sha-256 fingerprint above).

§4Per-agent renders

reference + 10 agent outputs · scored against the held-out STEP
vol IoU · BREP · manifold
canonical reference
REFERENCE
canonical · ground truth
1.000100
Human Baseline (Mech-E)
Human Baseline (Mech-E)
n=4 senior engineers
0.7937
CAD-Coder R1
CAD-Coder R1
CAD-Coder Labs (research)
0.7357
Zoo Text-to-CAD
Zoo Text-to-CAD
Zoo (KittyCAD)
0.71811
Claude Opus 4.7 → CadQuery
Claude Opus 4.7 → CadQuery
Anthropic + CadQuery 2.4
0.7188
DeepCAD
DeepCAD
Wu et al. 2021 (research)
0.68811
OpenAI o4 (reasoning) → CadQuery
OpenAI o4 (reasoning) → CadQuery
OpenAI + CadQuery 2.4
0.63112
GPT-5 → CadQuery
GPT-5 → CadQuery
OpenAI + CadQuery 2.4
0.56412
Adam (CADcrush)
Adam (CADcrush)
CADcrush
0.56311
Claude Opus 4.7 → OpenSCAD
Claude Opus 4.7 → OpenSCAD
Anthropic + OpenSCAD 2024.06
0.5500
DeepSeek R1 (reasoning) → CadQuery
DeepSeek R1 (reasoning) → CadQuery
DeepSeek + CadQuery 2.4
0.51012
Claude Haiku 4.5 → CadQuery
Claude Haiku 4.5 → CadQuery
Anthropic + CadQuery 2.4
0.50610
Gemini 2.5 Pro → OpenSCAD
Gemini 2.5 Pro → OpenSCAD
Google + OpenSCAD 2024.06
0.4490
Claude Sonnet 4.6 → CadQuery
Claude Sonnet 4.6 → CadQuery
Anthropic + CadQuery 2.4
0.44915
Hunyuan3D-2
Hunyuan3D-2
Tencent
0.44311
GPT-5 Mini → OpenSCAD
GPT-5 Mini → OpenSCAD
OpenAI + OpenSCAD 2024.06
0.40616
Trellis 3D
Trellis 3D
Microsoft Research
0.3820
Gemini 2.5 Flash → CadQuery
Gemini 2.5 Flash → CadQuery
Google + CadQuery 2.4
0.37413
Qwen3 Coder → CadQuery
Qwen3 Coder → CadQuery
Alibaba + CadQuery 2.4
0.33114
Spline AI
Spline AI
Spline.design
0.3070
Llama 3.3 70B → OpenSCAD
Llama 3.3 70B → OpenSCAD
Meta + OpenSCAD 2024.06
0.29919

Each tile is rebuilt from the canonical parametric description and degraded to match the agent's scored profile (tessellation, non-manifold face removal, dimension scale jitter, missing features). Image-only diffusion models render visually plausible meshes but score in the single digits on BREP fidelity — the geometry is not a manifold solid even when the render reads clean.

§5Per-agent metrics

ranked by Vol IoU · same data as the leaderboard, restricted to this task
AgentVol IoUBidirectional ChamferWatert.Manif.P@1p50latencycost
Human Baseline (Mech-E)0.7930.1100.9730.000524.4s$5.991
CAD-Coder R10.7350.1230.9530.0004.6s$0.004
Zoo Text-to-CAD0.7180.1100.9620.0006.3s$0.198
Claude Opus 4.7 → CadQuery0.7180.1160.9660.00029.2s$0.323
DeepCAD0.6880.1590.9510.0004.7s$0.023
OpenAI o4 (reasoning) → CadQuery0.6310.1570.9400.000121.8s$0.952
GPT-5 → CadQuery0.5640.1540.9420.00037.5s$0.244
Adam (CADcrush)0.5630.1580.9330.00011.6s$0.284
Claude Opus 4.7 → OpenSCAD0.5500.1690.9320.00039.3s$0.306
DeepSeek R1 (reasoning) → CadQuery0.5100.1500.9310.00085.7s$0.040
Claude Haiku 4.5 → CadQuery0.5060.1620.9220.0007.7s$0.020
Gemini 2.5 Pro → OpenSCAD0.4490.1870.9200.00036.4s$0.095
Claude Sonnet 4.6 → CadQuery0.4490.1770.9210.00023.0s$0.080
Hunyuan3D-20.4430.181×0.9130.00029.7s$0.080
GPT-5 Mini → OpenSCAD0.4060.240×0.9080.00014.6s$0.011
Trellis 3D0.3820.217×0.9110.00014.9s$0.042
Gemini 2.5 Flash → CadQuery0.3740.229×0.9090.0009.9s$0.017
Qwen3 Coder → CadQuery0.3310.245×0.9020.00021.5s$0.034
Spline AI0.3070.267×0.8940.00010.1s$0.047
Llama 3.3 70B → OpenSCAD0.2990.266×0.8950.00025.0s$0.017