OpenAI Codex Security Finds 10,561 High-Severity Vulnerabilities

OpenAI Launches Codex Security for Enhanced Code Vulnerability Detection

OpenAI made headlines on Friday by introducing Codex Security, an AI-powered security agent designed to automatically find, validate, and propose fixes for software vulnerabilities. The tool, now available as a research preview for ChatGPT Pro, Enterprise, Business, and Edu customers, can be accessed through the Codex web interface with free usage for the next month.

The company describes Codex Security as an agent that deeply understands the unique context of each project, identifying complex vulnerabilities that other tools often miss. By focusing on higher-confidence findings and actionable fixes, it aims to improve the overall security of systems while minimizing distracting reports from insignificant bugs.

Codex Security: Built on Proven Foundations

Codex Security is the next evolution of Aardvark, OpenAI’s earlier tool launched in private beta in October 2025. Aardvark was created to help developers and security teams identify and remediate security issues at scale. Over the past 30 days, Codex Security has scanned more than 1.2 million code commits across external repositories during its beta phase, uncovering 792 critical findings and 10,561 high-severity findings.

The vulnerabilities identified span a variety of well-known open-source projects, including OpenSSH, GnuTLS, GOGS, Thorium, libssh, PHP, and Chromium. Some notable vulnerabilities discovered include:

  • GnuPG – CVE-2026-24881, CVE-2026-24882
  • GnuTLS – CVE-2025-32988, CVE-2025-32989
  • GOGS – CVE-2025-64175, CVE-2026-25242
  • Thorium – CVE-2025-35430 through CVE-2025-35436

Advanced AI Capabilities Powering Security

According to OpenAI, this latest iteration leverages the advanced reasoning capabilities of its most sophisticated frontier models, supplementing them with automated validation processes. This approach effectively reduces the risk of false positives and ensures that the proposed fixes are both meaningful and actionable.

Continuous scans over time have shown substantial improvements in precision and a marked decline in false positive rates—down by over 50% across all tested repositories.

How Codex Security Works

OpenAI outlined a three-step process for how Codex Security operates:

  1. Repository Analysis and Threat Modeling: The agent begins by analyzing a repository to understand the security-relevant structure of the system. It creates an editable threat model that highlights what the project does and where it’s most vulnerable.
  2. Vulnerability Identification and Validation: Using the constructed system context, Codex Security identifies vulnerabilities and classifies them based on their real-world impact. Each flagged issue is then pressure-tested in a sandboxed environment to validate its existence and severity.
  3. Actionable Fixes: After validation, the agent proposes fixes that align with the system’s behavior, reducing the risk of introducing new bugs and making it easier for teams to review and deploy patches.

When Codex Security is configured for a specific project environment, it can validate issues directly in the context of the running system. This deeper validation further reduces false positives and can even generate working proofs-of-concept, providing security teams with concrete evidence and clear remediation paths.

Industry Context: Automated Security Agents on the Rise

The introduction of Codex Security comes amid growing competition in the AI-powered code security space. Recently, Anthropic also launched Claude Code Security, another tool aimed at helping users scan codebases for vulnerabilities and suggest effective patches.

These developments reflect a broader trend toward integrating advanced AI capabilities into cybersecurity workflows, enabling organizations to address vulnerabilities at greater scale and speed than ever before.

The Future of DevSecOps with AI

The successful deployment of Codex Security signals a major shift in how developers and security teams can leverage AI for proactive threat detection and remediation. By combining context-aware analysis, sophisticated threat modeling, and automated validation, tools like Codex Security are poised to become invaluable assets in securing modern software development pipelines.


This article is inspired by content from Original Source. It has been rephrased for originality. Images are credited to the original source.

Subscribe to our Newsletter