Preprint · cad-bench/v0.5 · sweep 2026-04-12open · MIT
CAD·Benchv0.5
← all tasks
ASM-011 · Assembly & Mating · difficulty 4/5

Dovetail slide (60° flanks)

sha256:773bb55ecd0c1a2e

§1Prompt verbatim

Male dovetail per ANSI B5.50: 30 mm tall, 50 mm wide at the base, 60° flank angle, 100 mm long. Held-out female slot is 0.04 mm wider on each flank for free running. Assembled clearance must be 0.04 ± 0.01 mm normal to each flank.

§2Ground-truth spec

shells1
watertighttrue
manifoldtrue
acceptance ε±0.01 mm
clearance[0.03, 0.05] mm

§3Reference render

canonical reference · drag to orbit, scroll to zoom

Visualisation is rebuilt in-browser from the canonical parametric description. Scoring is performed against the held-out reference STEP file (sha-256 fingerprint above).

§4Per-agent renders

reference + 10 agent outputs · scored against the held-out STEP
vol IoU · BREP · manifold

Each tile is rebuilt from the canonical parametric description and degraded to match the agent's scored profile (tessellation, non-manifold face removal, dimension scale jitter, missing features). Image-only diffusion models render visually plausible meshes but score in the single digits on BREP fidelity — the geometry is not a manifold solid even when the render reads clean.

§5Per-agent metrics

ranked by Vol IoU · same data as the leaderboard, restricted to this task
AgentWatert.Manif.FeatRecMating ClearanceFit-Class ComplianceP@1p50latencycost
Human Baseline (Mech-E)0.9710.9350.7450.8241.000782.9s$5.630
Zoo Text-to-CAD0.9480.7900.7200.5660.0006.2s$0.208
Claude Opus 4.7 → CadQuery0.9420.7080.5890.4330.00042.2s$0.390
Adam (CADcrush)0.9310.6860.6640.4610.0009.0s$0.305
GPT-5 → CadQuery0.9230.6570.5440.3750.00045.5s$0.211
Claude Opus 4.7 → OpenSCAD0.9170.5820.4600.2590.00041.4s$0.372
DeepCAD×0.9020.4860.3850.1730.0005.2s$0.022
Gemini 2.5 Pro → OpenSCAD×0.8940.5100.4540.1960.00027.1s$0.095
Trellis 3D×0.8560.1900.0850.0050.00014.0s$0.041
Spline AI×0.8530.0980.0430.0010.0006.1s$0.039