Model Security Scorecards
Attack success rate analysis across scanned models
Models Scanned
23
Avg Attack Success Rate (ASR)
18.76%
Total Scans
527
63.7%
Meta
meta-llama-4-scout
Critical Risk
Scans
29
Reports
369
Probes
502,542
Last Scan
about 2 months ago
Vulns
402
Risk Score
50.19%
54.4%
Meta
meta-llama-3-3
Retired
Critical Risk
Scans
29
Reports
365
Probes
504,430
Last Scan
about 2 months ago
Vulns
333
Risk Score
41.57%
39.6%
OpenAI
openai-gpt-4-1
Retired
High Risk
Scans
28
Reports
36
Probes
59,148
Last Scan
about 2 months ago
Vulns
188
Risk Score
23.47%
37.0%
Twitter / X
x-grok-3
Retired
High Risk
Scans
29
Reports
124
Probes
171,368
Last Scan
about 2 months ago
Vulns
52
Risk Score
6.49%
36.1%
Meta
meta-llama-4-maverick
High Risk
Scans
29
Reports
369
Probes
503,046
Last Scan
about 2 months ago
Vulns
348
Risk Score
43.45%
32.0%
OpenAI
openai-gpt-5-2
High Risk
Scans
29
Reports
284
Probes
398,476
Last Scan
about 2 months ago
Vulns
47
Risk Score
5.87%
29.9%
qwen-3-235b
High Risk
Scans
4
Reports
4
Probes
6,572
Last Scan
3 months ago
Vulns
0
Risk Score
0.0%
18.9%
grok-4-20
Moderate Risk
Scans
13
Reports
13
Probes
21,359
Last Scan
about 2 months ago
Vulns
0
Risk Score
0.0%
18.9%
grok-4-20-beta
Moderate Risk
Scans
15
Reports
15
Probes
24,645
Last Scan
2 months ago
Vulns
0
Risk Score
0.0%
17.1%
OpenAI
openai-gpt-5-mini
Moderate Risk
Scans
29
Reports
125
Probes
172,750
Last Scan
about 2 months ago
Vulns
138
Risk Score
17.23%
16.6%
google-gemma-4-31b
Moderate Risk
Scans
8
Reports
19
Probes
530,854
Last Scan
about 2 months ago
Vulns
0
Risk Score
0.0%
13.8%
Anthropic
claude-4-5-haiku
Moderate Risk
Scans
29
Reports
405
Probes
560,103
Last Scan
about 2 months ago
Vulns
100
Risk Score
12.48%
13.8%
Twitter / X
x-grok-4
Moderate Risk
Scans
29
Reports
244
Probes
368,528
Last Scan
about 2 months ago
Vulns
279
Risk Score
34.83%
7.2%
claude-sonnet-4-6
Low Risk
Scans
13
Reports
13
Probes
21,359
Last Scan
about 2 months ago
Vulns
0
Risk Score
0.0%
7.0%
OpenAI
openai-o4
Retired
Low Risk
Scans
9
Reports
15
Probes
20,730
Last Scan
2 months ago
Vulns
14
Risk Score
1.75%
5.9%
Anthropic
claude-4-6-opus
Low Risk
Scans
29
Reports
261
Probes
364,095
Last Scan
about 2 months ago
Vulns
1
Risk Score
0.12%
4.9%
Anthropic
claude-4-5-sonnet
Low Risk
Scans
29
Reports
372
Probes
514,104
Last Scan
about 2 months ago
Vulns
49
Risk Score
6.12%
4.3%
OpenAI
openai-gpt-5-pro
Low Risk
Scans
29
Reports
107
Probes
147,874
Last Scan
about 2 months ago
Vulns
2
Risk Score
0.25%
4.2%
Anthropic
claude-4-opus
Retired
Low Risk
Scans
20
Reports
20
Probes
24,640
Last Scan
2 months ago
Vulns
14
Risk Score
1.75%
3.4%
Anthropic
claude-4-5-opus
Low Risk
Scans
29
Reports
124
Probes
171,368
Last Scan
about 2 months ago
Vulns
40
Risk Score
4.99%
2.9%
OpenAI
openai-gpt-5-nano
Low Risk
Scans
29
Reports
120
Probes
165,840
Last Scan
about 2 months ago
Vulns
111
Risk Score
13.86%
0.0%
openrouter-openai-compatible
Low Risk
Scans
20
Reports
20
Probes
20
Last Scan
2 months ago
Vulns
0
Risk Score
0.0%
0.0%
OpenAI
openai-o4-mini-high
Retired
Low Risk
Scans
20
Reports
20
Probes
40
Last Scan
2 months ago
Vulns
99
Risk Score
12.36%