Detection accuracy of PixelPrism's 16-detector forensic ensemble against every major AI image generator, measured on a held-out canary set the model never sees during training.
| Generator | Accuracy | Last retrain |
|---|---|---|
DALL-E 3 OpenAI | 100.0% | May 5 2026 |
HunyuanDiT Tencent | 100.0% | May 5 2026 |
PixArt-Sigma PixArt-alpha | 100.0% | May 5 2026 |
Recraft V3 Recraft | 99.6% | May 5 2026 |
FLUX 1.1 Black Forest Labs | 97.8% | May 5 2026 |
Nano Banana Google Gemini Image | 97.4% | May 5 2026 |
Midjourney v6 Midjourney | 96.8% | May 5 2026 |
Ideogram 2.0 Ideogram | 94.5% | May 5 2026 |
Imagen 3 Google Vertex AI | 93.3% | May 5 2026 |
Stability SD 3.5 Stability AI | 92.3% | May 5 2026 |
Grok Imagine xAI | 91.0% | May 5 2026 |
● ≥ 95% ● 80–94% ● < 80%
canary set the detector never sees during training.P(AI) ≥ 0.5.
These numbers are auto-generated from data/eval_post_retrain_v8_2026-05.json on lilliemae. Data and methodology are open by design — when our detector misses something, you'll see it here. We do not cherry-pick.