CAD-Bench
← back
MECH-014 · Parametric Mechanical Parts · difficulty 3/5

L-bracket with M6 + slotted hole

sha256:1f3a5e90c44b2210

§1Prompt verbatim

Right-angle L-bracket, leg lengths 60 mm and 40 mm, thickness 5 mm. Through-hole Ø 6.6 mm with 1.5 × 45° chamfer on the long leg, centred 30 mm from the bend. Slotted hole 8 × 16 mm on the short leg, centred 20 mm from the bend, slot major axis parallel to the bend. Position tolerance ±0.1 mm. Output STEP AP242.

§2Ground-truth spec

shells1
genus2
watertighttrue
manifoldtrue
acceptance ε±0.1 mm
featuresbend_R5, thru_hole_M6_clearance, chamfer_1.5x45, slot_8x16

§3Reference render

canonical reference · drag to orbit, scroll to zoom

Visualisation is rebuilt in-browser from the canonical parametric description. Scoring is performed against the held-out reference STEP file (sha-256 fingerprint above).

§4Per-agent renders

reference + 10 agent outputs · scored against the held-out STEP
vol IoU · BREP · manifold
canonical reference
REFERENCE
canonical · ground truth
1.000100
Human Baseline (Mech-E)
Human Baseline (Mech-E)
n=4 senior engineers
0.7637
OpenAI o4 (reasoning) → CadQuery
OpenAI o4 (reasoning) → CadQuery
OpenAI + CadQuery 2.4
0.68111
GPT-5 → CadQuery
GPT-5 → CadQuery
OpenAI + CadQuery 2.4
0.66310
Zoo Text-to-CAD
Zoo Text-to-CAD
Zoo (KittyCAD)
0.63612
Gemini 2.5 Flash → CadQuery
Gemini 2.5 Flash → CadQuery
Google + CadQuery 2.4
0.63011
Adam (CADcrush)
Adam (CADcrush)
CADcrush
0.5919
Claude Opus 4.7 → OpenSCAD
Claude Opus 4.7 → OpenSCAD
Anthropic + OpenSCAD 2024.06
0.5600
Claude Haiku 4.5 → CadQuery
Claude Haiku 4.5 → CadQuery
Anthropic + CadQuery 2.4
0.51310
CAD-Coder R1
CAD-Coder R1
CAD-Coder Labs (research)
0.50111
Claude Opus 4.7 → CadQuery
Claude Opus 4.7 → CadQuery
Anthropic + CadQuery 2.4
0.50014
Gemini 2.5 Pro → OpenSCAD
Gemini 2.5 Pro → OpenSCAD
Google + OpenSCAD 2024.06
0.4550
DeepSeek R1 (reasoning) → CadQuery
DeepSeek R1 (reasoning) → CadQuery
DeepSeek + CadQuery 2.4
0.44410
Qwen3 Coder → CadQuery
Qwen3 Coder → CadQuery
Alibaba + CadQuery 2.4
0.42711
Llama 3.3 70B → OpenSCAD
Llama 3.3 70B → OpenSCAD
Meta + OpenSCAD 2024.06
0.39713
GPT-5 Mini → OpenSCAD
GPT-5 Mini → OpenSCAD
OpenAI + OpenSCAD 2024.06
0.39115
DeepCAD
DeepCAD
Wu et al. 2021 (research)
0.28322
Trellis 3D
Trellis 3D
Microsoft Research
0.2030
Spline AI
Spline AI
Spline.design
0.0000
no manifold solid produced
Claude Sonnet 4.6 → CadQuery
Claude Sonnet 4.6 → CadQuery
Anthropic + CadQuery 2.4
66
no manifold solid produced
Hunyuan3D-2
Hunyuan3D-2
Tencent
5

Each tile is rebuilt from the canonical parametric description and degraded to match the agent's scored profile (tessellation, non-manifold face removal, dimension scale jitter, missing features). Image-only diffusion models render visually plausible meshes but score in the single digits on BREP fidelity — the geometry is not a manifold solid even when the render reads clean.

§5Per-agent metrics

ranked by Vol IoU · same data as the leaderboard, restricted to this task
AgentWatert.Manif.Named-Dimension RMSEGD&T ComplianceFeatRecP@1p50latencycost
Human Baseline (Mech-E)0.9600.1290.8290.9431.000769.7s$5.218
OpenAI o4 (reasoning) → CadQuery0.9540.2480.6930.7850.00086.5s$1.319
GPT-5 → CadQuery0.9490.2050.5890.6300.00051.2s$0.179
Zoo Text-to-CAD0.9390.1400.6370.7090.0004.9s$0.170
Gemini 2.5 Flash → CadQuery0.9440.2900.4400.5350.00012.2s$0.023
Adam (CADcrush)0.9410.1860.6100.7030.0009.8s$0.323
Claude Opus 4.7 → OpenSCAD0.9380.2690.4610.5830.00023.3s$0.319
Claude Haiku 4.5 → CadQuery0.9290.3620.3690.4830.0009.7s$0.017
CAD-Coder R10.9280.3160.4240.6360.0007.0s$0.005
Claude Opus 4.7 → CadQuery0.9270.2430.6220.7700.00029.4s$0.366
Gemini 2.5 Pro → OpenSCAD0.9200.3200.4030.5310.00026.0s$0.095
DeepSeek R1 (reasoning) → CadQuery0.9190.2840.5060.6320.00089.9s$0.043
Qwen3 Coder → CadQuery0.9140.2540.4070.6280.00014.2s$0.029
Llama 3.3 70B → OpenSCAD×0.9070.3740.2810.4440.00018.0s$0.018
GPT-5 Mini → OpenSCAD×0.9130.3160.2870.4080.00017.1s$0.011
DeepCAD×0.8920.3030.3450.4870.0003.4s$0.018
Trellis 3D×0.8810.5580.0540.2130.00013.3s$0.047
Spline AI×0.8500.5790.0270.0850.0008.8s$0.044
Claude Sonnet 4.6 → CadQuery
kernel error: BRepCheck_NotClosed
×0.0000.00015.5s$0.062
Hunyuan3D-2
kernel error: BRepCheck_NotClosed
×0.0000.00040.4s$0.078