Preprint · cad-bench/v0.5 · sweep 2026-04-12open · MIT
CAD·Benchv0.5
← all tasks
REVENG-002 · Reverse Engineering · difficulty 4/5

Three-view ortho → bracket

sha256:30d72b41ae9c0014

§1Prompt verbatim

From the supplied 1:1 front/top/side dimensioned drawing (PNG, 600 dpi) reproduce the part. All dimensions and tolerances on the drawing are authoritative.

§2Ground-truth spec

shells1
watertighttrue
manifoldtrue
acceptance ε±0.1 mm

§3Reference render

canonical reference · drag to orbit, scroll to zoom

Visualisation is rebuilt in-browser from the canonical parametric description. Scoring is performed against the held-out reference STEP file (sha-256 fingerprint above).

§4Per-agent renders

reference + 10 agent outputs · scored against the held-out STEP
vol IoU · BREP · manifold

Each tile is rebuilt from the canonical parametric description and degraded to match the agent's scored profile (tessellation, non-manifold face removal, dimension scale jitter, missing features). Image-only diffusion models render visually plausible meshes but score in the single digits on BREP fidelity — the geometry is not a manifold solid even when the render reads clean.

§5Per-agent metrics

ranked by Vol IoU · same data as the leaderboard, restricted to this task
AgentVol IoUWatert.Manif.Named-Dimension RMSEFeatRecP@1p50latencycost
Claude Opus 4.7 → CadQuery0.6760.9560.2320.7090.00037.2s$0.328
Human Baseline (Mech-E)0.6460.9430.1110.9370.000844.1s$6.605
Trellis 3D0.5500.9280.5080.2080.00010.1s$0.055
GPT-5 → CadQuery0.4420.9170.2450.7330.00042.0s$0.229
Claude Opus 4.7 → OpenSCAD0.4150.9130.2180.5990.00034.2s$0.341
Gemini 2.5 Pro → OpenSCAD0.4030.9130.2760.5080.00025.9s$0.101
Zoo Text-to-CAD0.392×0.9060.2010.7380.0005.6s$0.188
Adam (CADcrush)0.316×0.8960.1650.7360.0006.7s$0.228
DeepCAD0.092×0.8640.3000.4280.0004.1s$0.021
Spline AI
kernel error: BRepCheck_NotClosed
0.000×0.0000.0008.3s$0.036