GPT-5.2 Can Develop Zero-Day Exploits: Study unveils

infosecbulletin Wednesday , January 21 2026 Vulnerabilities

Recent research shows that AI systems can now handle complex exploit development tasks that used to need specialized human skills. The agents had to create exploits while facing realistic challenges like modern security measures, unknown heap conditions, and restrictions on hardcoded memory addresses.

In six scenarios focused on tasks like spawning shells, writing files, and creating command-and-control connections, the agents produced over 40 unique exploits. GPT-5.2 successfully handled all scenarios, while Opus 4.5 solved all except two.

Meta’s louisiana data center to exceed 250 billion price tag

By infosecbulletin / Tuesday , July 14 2026

Meta announced on Monday that its data center in Richland Parish, Louisiana, will grow to 5 gigawatts of computing power....

Meta’s louisiana data center to exceed 250 billion price tag

Ransomware Crisis in 2026: 5,064 Organizations Affected in 135 Countries

By infosecbulletin / Sunday , July 12 2026

Global ransomware attacks stayed very high in the first seven months of 2026. There were 5,064 confirmed victims in 135...

Ransomware Crisis in 2026: 5,064 Organizations Affected in 135 Countries

Palo Alto Networks Addresses 13 Vulnerabilities

By infosecbulletin / Sunday , July 12 2026

Palo Alto Networks shared warnings on Wednesday about over twelve security issues in its products. The new warnings include 13 security...

Palo Alto Networks Addresses 13 Vulnerabilities

Critical Dell BIOS & Zimbra Flaws Expose Enterprise Systems

By infosecbulletin / Sunday , July 12 2026

A critical flaw with how Dell saves BIOS passwords lets anyone quickly recover these passwords from a flash dump without...

Critical Dell BIOS & Zimbra Flaws Expose Enterprise Systems

CoLoCity Launches New 1.0 MW Data Center Facility at Gulshan

By infosecbulletin / Saturday , July 11 2026

CoLoCity is proud to launch a new Data Center in Gulshan-2. It is designed to meet the growing demand for...

CoLoCity Launches New 1.0 MW Data Center Facility at Gulshan

Daily Cyber security update for 10. 07. 2026

By infosecbulletin / Friday , July 10 2026

Cyberattacks are rising around the world, including ransomware, malware, data leaks, and hacked websites. These events show how complex and...

Daily Cyber security update for 10. 07. 2026

How Hacker Compromise AWS Cloud Environment Using AI in 72 Hours

By infosecbulletin / Friday , July 10 2026

A major AWS attack shows how attackers with AI can connect known cloud strategies to go from first access to...

How Hacker Compromise AWS Cloud Environment Using AI in 72 Hours

Mycelium Framework: First AI-as-a-Service Botnet

By infosecbulletin / Thursday , July 9 2026

A new cybercrime ad is catching attention in the security world. It talks about a botnet that doesn't just get...

Mycelium Framework: First AI-as-a-Service Botnet

CrowdStrike Shows 5 New Prompt Injection Techniques for AI Agents

By infosecbulletin / Wednesday , July 8 2026

CrowdStrike has shared five new ways to inject prompts, showing the rising danger to AI agents as more organizations use...

CrowdStrike Shows 5 New Prompt Injection Techniques for AI Agents

Critical GCP Dialogflow Vulnerability Allows Malicious Code Injection

By infosecbulletin / Wednesday , July 8 2026

A critical flaw in Google Cloud Platform’s Dialogflow CX lets attackers add harmful code to a company's AI chatbot system....

Critical GCP Dialogflow Vulnerability Allows Malicious Code Injection

Security researcher Sean Heelan tested AI agents based on Anthropic’s Opus 4.5 and OpenAI’s GPT-5.2 against a new vulnerability in the QuickJS JavaScript interpreter.

Agents showcased advanced skills by turning raw vulnerabilities into a working API to read and modify the memory of a target process. The AI systems analyzed source code, debugged, and learned through trial and error without human help.

Most challenges were resolved in less than an hour and at low costs, with a standard successful agent run using about 30 million tokens for approximately $30 USD with Opus 4.5.

Modern Security Protections:

The toughest scenario tested GPT-5.2’s ability to write a specific string to disk while multiple enterprise-level protections were active, including address space layout randomization, non-executable memory regions, full RELRO linking protections, fine-grained control-flow integrity, hardware-enforced shadow stacks, and a seccomp sandbox preventing shell execution.

An AI agent created a new solution that used seven function calls with glibc’s exit handler to overcome defenses. Developing this exploit took 50 million tokens over three hours, costing about $50 for that agent run.

The researcher highlighted two key limitations of the experiments. First, QuickJS, although a valid JavaScript interpreter, is much simpler and less complex than browser engines like Chrome’s V8 or Firefox’s SpiderMonkey.

The exploits used existing security gaps and flaws instead of introducing new bypass methods, similar to what human developers do in real situations. The exploit chains were newly designed to target a previously unknown vulnerability.

Implications:

The research suggests the cybersecurity industry should prepare for the “industrialization” of offensive operations, where an organization’s hacking capabilities become limited by computational token throughput rather than skilled personnel availability.

Heelan believes that AI automation is perfect for exploit development due to its clear verification methods, defined tools, and specific areas for agents to explore.

The experimental code, technical documentation, and raw agent outputs are available on GitHub for verification and reproduction.

The researcher urges the security community to test AI capabilities in real-world scenarios using zero-day vulnerabilities instead of just relying on capture-the-flag competitions or synthetic datasets.

InfoSecBulletin Cybersecurity for mankind

GPT-5.2 Can Develop Zero-Day Exploits: Study unveils

Meta’s louisiana data center to exceed 250 billion price tag

Ransomware Crisis in 2026: 5,064 Organizations Affected in 135 Countries

Palo Alto Networks Addresses 13 Vulnerabilities

Critical Dell BIOS & Zimbra Flaws Expose Enterprise Systems

CoLoCity Launches New 1.0 MW Data Center Facility at Gulshan

Daily Cyber security update for 10. 07. 2026

How Hacker Compromise AWS Cloud Environment Using AI in 72 Hours

Mycelium Framework: First AI-as-a-Service Botnet

CrowdStrike Shows 5 New Prompt Injection Techniques for AI Agents

Critical GCP Dialogflow Vulnerability Allows Malicious Code Injection

Related Articles

Check Also

Critical Wazuh Vuln Enables Alert Tampering and Evidence Deletion