Preprint · cad-bench/v0.5 · sweep 2026-04-12open · MIT
CAD·Benchv0.5
← all tasks
REVENG-009 · Reverse Engineering · difficulty 5/5

Three-view ortho → housing with cores

sha256:60189cae3b771acc

§1Prompt verbatim

Reproduce the 80 × 60 × 40 mm housing from the supplied multi-view drawing including all M4 tapped holes, draft, and ribs. Drawing follows ASME Y14.5-2018 third-angle convention.

§2Ground-truth spec

shells1
watertighttrue
manifoldtrue
acceptance ε±0.1 mm
featuresthread_M4_x6, rib_x4, draft_1deg

§3Reference render

canonical reference · drag to orbit, scroll to zoom

Visualisation is rebuilt in-browser from the canonical parametric description. Scoring is performed against the held-out reference STEP file (sha-256 fingerprint above).

§4Per-agent renders

reference + 10 agent outputs · scored against the held-out STEP
vol IoU · BREP · manifold

Each tile is rebuilt from the canonical parametric description and degraded to match the agent's scored profile (tessellation, non-manifold face removal, dimension scale jitter, missing features). Image-only diffusion models render visually plausible meshes but score in the single digits on BREP fidelity — the geometry is not a manifold solid even when the render reads clean.

§5Per-agent metrics

ranked by Vol IoU · same data as the leaderboard, restricted to this task
AgentVol IoUWatert.Manif.Named-Dimension RMSEFeatRecP@1p50latencycost
Human Baseline (Mech-E)0.6030.9460.0730.9320.000556.1s$6.762
GPT-5 → CadQuery0.5500.9380.2260.7040.00038.1s$0.205
Trellis 3D0.5280.9250.5260.2080.00014.3s$0.048
Claude Opus 4.7 → CadQuery0.5180.9250.1890.7670.00026.8s$0.305
Gemini 2.5 Pro → OpenSCAD0.4490.9210.2480.4870.00025.6s$0.094
Zoo Text-to-CAD0.377×0.9100.1820.7690.0006.3s$0.209
Claude Opus 4.7 → OpenSCAD0.314×0.8980.2100.5820.00039.0s$0.360
Adam (CADcrush)0.282×0.8950.2540.7050.0009.0s$0.231
Spline AI0.204×0.8810.5210.0860.0007.8s$0.032
DeepCAD0.099×0.8660.3030.4460.0004.3s$0.024