Public GenAI Vulnerability Disclosures

Description

0DIN, the 0Day Investigative Network, was founded by Mozilla in 2024 to reward responsible researchers for their efforts in securing GenAI models. This dataset is a weekly export of the public 0DIN disclosures published at https://0din.ai/disclosures, in accordance with the 0DIN Research Terms and Disclosure Policy. Each record corresponds to a single validated, published disclosure and is grounded in 0DIN's public research frameworks: Security Boundaries (prompt extraction, guardrail jailbreak, interpreter jailbreak, content manipulation, weights/layers disclosure, prompt injection), the Jailbreak Taxonomy (category → strategy → technique), the Social Impact Score (SIS, levels 1–5), and the Nude Imagery Rating System (NIRS, levels 1–5). Records include title, summary, severity, security boundary, taxonomy triplets, affected models and vendors, visible test results, SIS and NIRS scores when assessed, public researcher credit when supplied, reference URLs, and disclosure/publication timestamps. Prompts, model responses, attack payloads, detection signatures, variant prompts, and submitter PII (other than a self-supplied public credit string) are intentionally excluded. Use the data to study GenAI vulnerability trends, train and evaluate safety classifiers, build detection pipelines, or inform model-card transparency on known weaknesses.

Specifics

Licensing

Creative Commons Attribution 4.0 International (CC-BY-4.0)

https://spdx.org/licenses/CC-BY-4.0.html

Description

Specifics

Considerations

Processes

Metadata