Preprint · cad-bench/v0.5 · sweep 2026-04-12open · MIT
CAD·Benchv0.5
← all tasks
SURF-002 · Free-form Surfaces · difficulty 5/5

Compressor blade (NACA 65-(12)10)

sha256:8de14b209c01ac72

§1Prompt verbatim

Single compressor blade: 60 mm chord, 80 mm span, 12° twist root-to-tip, NACA-65-(12)10 thickness distribution along the camber line. G2 continuous suction and pressure surfaces, sharp trailing edge at 0.3 mm.

§2Ground-truth spec

shells1
watertighttrue
manifoldtrue
acceptance ε±0.05 mm

§3Reference render

canonical reference · drag to orbit, scroll to zoom

Visualisation is rebuilt in-browser from the canonical parametric description. Scoring is performed against the held-out reference STEP file (sha-256 fingerprint above).

§4Per-agent renders

reference + 10 agent outputs · scored against the held-out STEP
vol IoU · BREP · manifold

Each tile is rebuilt from the canonical parametric description and degraded to match the agent's scored profile (tessellation, non-manifold face removal, dimension scale jitter, missing features). Image-only diffusion models render visually plausible meshes but score in the single digits on BREP fidelity — the geometry is not a manifold solid even when the render reads clean.

§5Per-agent metrics

ranked by Vol IoU · same data as the leaderboard, restricted to this task
AgentBidirectional ChamferHausdorff p95NormConsWatert.Manif.P@1p50latencycost
Human Baseline (Mech-E)0.1260.6130.9120.9611.000773.4s$6.746
Trellis 3D0.1400.8390.8400.9340.00011.2s$0.041
Spline AI0.1530.7910.8310.9280.0005.6s$0.039
Zoo Text-to-CAD0.1730.8540.8060.9210.0006.6s$0.202
GPT-5 → CadQuery0.2061.1110.771×0.9060.00046.9s$0.174
Claude Opus 4.7 → OpenSCAD0.2161.0390.758×0.9060.00037.6s$0.306
Gemini 2.5 Pro → OpenSCAD0.2271.2960.748×0.9000.00025.9s$0.086
Claude Opus 4.7 → CadQuery0.3021.3590.731×0.8940.00038.6s$0.320
Adam (CADcrush)0.4322.0570.682×0.8790.00011.4s$0.315
DeepCAD0.8114.0650.643×0.8640.0003.6s$0.018