Tenable Jailbreaks GPT-5, Gets It To Generate Dangerous Info Despite OpenAI’s New Safety Tech

Keren Katz

August 11, 2025

2 分鐘快讀

Tenable Jailbreaks GPT-5, Gets It To Generate Dangerous Info Despite OpenAI’s New Safety Tech

Within just 24 hours of the release of OpenAI’s GPT-5, Tenable Research successfully managed to jailbreak the model by getting it to share detailed instructions for how to build an explosive. Our finding is concerning, given that OpenAI described GPT-5's prompt safety technology as significantly more sophisticated than the one used by its predecessors.

OpenAI officially unveiled GPT-5 on August 7, 2025, positioning it as their most advanced language model to date and a “significant step along the path to AGI.” Boasting expert-level capabilities across writing, coding, math and science, OpenAI says GPT-5 delivers enhanced speed, accuracy, and contextual understanding, while significantly reducing hallucinations.

OpenAI also promised significant enhancements in the area of prompt safety, saying that GPT-5 has a more sophisticated method of assessing whether and how to respond to prompts. AI models such as GPT-5 are designed with built-in guardrails to ensure responsible usage, including preventing the model from being used for illegal or harmful purposes.

Specifically, GPT-5 no longer uses its predecessors’ “refusal-based safety training” but rather a new method called “safe completions” that OpenAI describes as more nuanced and smarter, as explained in OpenAI’s article “From hard refusals to safe-completions: toward output-centric safety training” and its eponymous research paper.

With that being promised, just 24 hours after its release we were able to successfully jailbreak this “more responsible” model and convince it to provide detailed instructions for how to build a Molotov cocktail.

That’s why our prompt below and the resulting advice from GPT-5 raises alarm bells.

Screenshot showing Tenable's jailbreaking of GPT-5

Using an implementation of the crescendo technique, we managed to get the full recipe for a Molotov cocktail using just four questions.

This is how it happened:

We became a “history student.”
We showed interest in the historical aspects of it.
Now we got super interested in the recipe itself and we got the details on the materials needed.
We wanted to be an expert and that’s how we got the full step-by-step guide to mixing a Molotov cocktail.

Unfortunately, it’s clear that misusing ChatGPT-5 for malicious purposes isn’t that complicated, despite OpenAI’s attempts to beef up the product’s safety features. Our successful jailbreak of GPT-5 is far from the only one. Multiple other researchers and regular users in recent days have documented a variety of problems with the quality of GPT-5’s prompt responses, including jailbreaks and hallucinations.

In response, OpenAI has said that it is implementing fixes. However, your employees may already be using the model and potentially introducing risk into your organization.

This provides further evidence that solutions like Tenable AI Exposure are vitally important for getting control over the AI tools your organization uses, consumes and builds in-house -- in order to ensure your AI use is responsible, secure, ethical and compliant with regulations and laws around the world.

Click here to learn more about Tenable AI Exposure

Keren Katz

Senior Group Manager of Product, Threat Research, and AI, Tenable

Keren Katz is a leader in AI and cybersecurity, specializing in generative AI threat detection. She is currently a Senior Group Manager of Product, Threat Research, and AI at Tenable, following the acquisition of Apex, where she previously led security detection. Keren also led product at Sygnia - an IR and MXDR security company. She has a background in special operations deep-tech, software engineering, and in founding an ML-based startup. Keren is also a key contributor to the OWASP Foundation and a global speaker on AI, security and product management.

Cloud

Tenable One 曝險管理平台

平台類別

平台功能

雲端曝險

弱點曝險

AI Exposure

OT/IoT 曝險

身分曝險

企業需求

產業

合規性

公家機關

Tenable 與眾不同之處

將 Tenable 與下列產品相較：

資源

研究

Tenable Assure 合作夥伴

其他合作夥伴商機

支援

服務

Tenable Trust

關於我們

媒體

交流聯繫

加入我們

Tenable Jailbreaks GPT-5, Gets It To Generate Dangerous Info Despite OpenAI’s New Safety Tech

Keren Katz

Senior Group Manager of Product, Threat Research, and AI, Tenable

Tenable Jailbreaks GPT-5, Gets It To Generate Dangerous Info Despite OpenAI’s New Safety Tech

Keren Katz

Senior Group Manager of Product, Threat Research, and AI, Tenable

相關文章

Cybersecurity Snapshot: What Security Pros’ Are Saying About Exposure Management, Risk Prioritization, Tool Sprawl and More

FAQ About Sha1-Hulud 2.0: The "Second Coming" of the npm Supply-Chain Campaign

Cybersecurity Snapshot: Global Agencies Target Criminal “Bulletproof” Hosts, as CSA Unveils Agentic AI Risk Framework

您可以利用的網路安全最新消息

感謝訂閱！

Tenable Vulnerability Management

Tenable Vulnerability Management

感謝您

Tenable Vulnerability Management

Tenable Vulnerability Management

謝謝您

Tenable Vulnerability Management

Tenable Vulnerability Management

謝謝您

試用 Tenable Web App Scanning

購買 Tenable Web App Scanning

謝謝您

Tenable Security Center

Tenable OT Security

Tenable Identity Exposure

Tenable Cloud Security

Tenable One

Tenable AI Exposure

Tenable Attack Surface Management

Tenable Enclave Security

感謝您

免費試用 Tenable Nessus Professional

購買 Tenable Nessus Professional

免費試用 Tenable Nessus Expert

購買 Tenable Nessus Expert

瞭解 Tenable 如何協助達成 SLCGP 網路安全計畫要求

Tenable Patch Management

聯絡業務代表