Preprint · cad-bench/v0.5 · sweep 2026-04-12open · MIT
CAD·Benchv0.5
← all tasks
BREP-004 · BREP Fidelity · difficulty 4/5

NURBS-handle goblet (G2 swept loft)

sha256:9bb19af0271ac46e

§1Prompt verbatim

Goblet: cup is a swept-revolved NURBS surface (12 control points along generatrix), stem is a 10 mm chamfered cylinder, base is a Ø 70 × 5 mm disc. Cup-to-stem and stem-to-base junctions must be G2 continuous. Export AP242.

§2Ground-truth spec

shells1
watertighttrue
manifoldtrue
acceptance ε±0.05 mm
featuresg2_continuity_cup_stem, g2_continuity_stem_base

§3Reference render

canonical reference · drag to orbit, scroll to zoom

Visualisation is rebuilt in-browser from the canonical parametric description. Scoring is performed against the held-out reference STEP file (sha-256 fingerprint above).

§4Per-agent renders

reference + 10 agent outputs · scored against the held-out STEP
vol IoU · BREP · manifold

Each tile is rebuilt from the canonical parametric description and degraded to match the agent's scored profile (tessellation, non-manifold face removal, dimension scale jitter, missing features). Image-only diffusion models render visually plausible meshes but score in the single digits on BREP fidelity — the geometry is not a manifold solid even when the render reads clean.

§5Per-agent metrics

ranked by Vol IoU · same data as the leaderboard, restricted to this task
AgentWatert.Manif.Euler-Poincaré ComplianceSTEP RTFeatRecP@1p50latencycost
Zoo Text-to-CAD0.9630.0970.7221.0006.2s$0.188
Human Baseline (Mech-E)0.9490.1130.8960.000568.6s$6.847
Adam (CADcrush)0.9440.1240.7130.0009.6s$0.298
DeepCAD0.9310.1090.4470.0004.2s$0.024
Claude Opus 4.7 → CadQuery0.9300.0880.7600.00044.9s$0.347
GPT-5 → CadQuery0.918×0.1350.6800.00040.2s$0.174
Claude Opus 4.7 → OpenSCAD×0.862×0.5230.00037.8s$0.344
Trellis 3D×0.852×0.2010.00012.2s$0.056
Gemini 2.5 Pro → OpenSCAD×0.850×0.4800.00026.1s$0.075
Spline AI×0.850×0.0990.0007.1s$0.043