Hack The Box Launches AI Range Platform to Benchmark Autonomous Security Agents Against Human Cyber Teams in Realistic Threat Environments

December 15, 2025

|

Cybersecurity training provider Hack The Box has launched HTB AI Range, a simulation platform enabling enterprises to test autonomous AI security agents alongside human defenders under realistic operational conditions. The platform addresses the urgent need to continuously validate AI systems in realistic operational contexts where stakes are high and human oversight remains vital Cryptopolitan, as organizations prepare for AI-powered threat environments where attackers deploy automated reconnaissance and exploitation at unprecedented scale.

In a recent AI versus human capture the flag exercise, autonomous AI agents solved 19 out of 20 basic challenges, but in multi-step challenges in more complex environments, human teams significantly outperformed the AI agents Cryptopolitan. AI teams achieved a 95% success rate on easy-tier tasks but faced substantial limitations on final multi-step challenges where humans far outperformed AI capabilities OpenAI.

The AI Range simulates enterprise complexity with thousands of offensive and defensive targets that are continuously updated, supporting mapping to established cyber frameworks including MITRE ATT&CK, NIST/NICE guidelines, and OWASP Top 10 Cryptopolitan. Attackers are already using AI to scale activity to send thousands of automated requests, often multiple per second, targeting large tech, financial, manufacturing and government institutions OpenAI.

Vulnerabilities in AI models add to those already present in traditional IT infrastructure, so before agentic or AI-based cybersecurity tools can be deployed operationally, testing environments where AI agents and human defenders can work together under realistic pressure become essential Cryptopolitan. The platform represents a strategic shift from static security audits toward continuous threat exposure management models.

In a separate 10-day AI red teaming CTF run by HTB and HackerOne, only 43% of registrants completed a single challenge, signaling a significant skills gap OpenAI in the workforce's ability to understand and defend against AI-enabled threats. Adversaries have already demonstrated the ability to perform attacks at 10 times previous speeds, reinforcing existing ransomware and social engineering tactics Tekedia.

The launch coincides with HTB's announcement of an AI Red Teamer Certification available in Q1 2026, developed with Google to align with Google's Secure AI Framework, establishing the first industry credential for end-to-end AI system security assessment.

Haris Pylarinos, CEO and founder of Hack The Box, stated: "For over two years, we've been advancing AI-driven learning paths, labs, and research where machines and humans compete, collaborate, and co-evolve. With HTB AI Range, we're not reacting to AI's rise in cyber; we're defining how defence evolves alongside it" Cryptopolitan.

Dawn-Marie Vaughan, Global Offering Lead for Cybersecurity at DXC, commented: "AI is fundamentally reshaping the threat landscape. Early research is already showing how AI can automate reconnaissance and link potential exploit paths in ways that were extremely difficult just a year ago. As these capabilities mature, defenders will need teams trained to operate under more dynamic, real-world conditions" OpenAI.

The company suggests AI struggles with complexity and multi-stage operations, pointing to the continuing value of human expertise, especially in high-stakes or complex work

Enterprises can use the AI Range to validate whether existing security measures work under AI-powered attacks, give their cybersecurity teams experience of AI-powered threats, and develop more resilient cybersecurity tools based on agentic AI Cryptopolitan. Such exercises could be used to justify cybersecurity investment to financial decision-makers Cryptopolitan, translating technical readiness into business risk metrics.

Continuous testing and validation of cybersecurity defences proves more effective long-term than static audits or penetration testing exercises, aligning closer to Continuous Threat Exposure Management models Cryptopolitan. Organizations deploying AI security agents must establish governance frameworks determining which defensive operations can be fully autonomous, which require human oversight, and which must remain entirely human-controlled before adversaries exploit these systems' inherent limitations in complex, multi-stage attack scenarios.

As AI matures and frameworks like MITRE ATLAS gain traction, tools like HTB's AI Range may become standard components in enterprise security programmes Cryptopolitan. Decision-makers should monitor whether hybrid human-AI defensive teams demonstrate measurable improvements in mean time to detect and respond compared to purely human or fully autonomous approaches. The platform's ability to benchmark AI agent performance against human expertise will likely inform regulatory frameworks governing autonomous security tool deployment, particularly as threat actors weaponize similar AI capabilities for offensive operations at unprecedented velocity and scale.

Source & Date

Source: Artificial Intelligence News, Hack The Box, Business Wire, SiliconANGLE, Morningstar
Date: December 3, 2025

Featured tools

Scalenut AI

Free

Scalenut AI is an all-in-one SEO content platform that combines AI-driven writing, keyword research, competitor insights, and optimization tools to help you plan, create, and rank content.

#

SEO

Learn more

Kreateable AI

Free

Kreateable AI is a white-label, AI-driven design platform that enables logo generation, social media posts, ads, and more for businesses, agencies, and service providers.

#

Logo Generator

Learn more

Learn more about future of AI

Join 80,000+ Ai enthusiast getting weekly updates on exciting AI tools.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Promote Your Tool

Copy Embed Code

Similar Blogs

July 31, 2026

|

Dodo MIDI Enhances Music Production Workflows

Dodo MIDI is part of the music technology ecosystem, focusing on MIDI-based tools that help users create, control, and manage digital musical compositions.

July 31, 2026

|

Polar Cloud Advances Secure Data Management

Polar Cloud is positioned within the cloud computing ecosystem, offering users technology solutions focused on managing and accessing digital resources through cloud-based environments.

July 31, 2026

|

Metastream Enhances Shared Digital Entertainment

Metastream is a digital platform designed to enable synchronized media playback, allowing multiple users to watch online content together from different locations.

July 31, 2026

|

Deep Realms Advances Immersive Digital Experiences

Deep Realms is positioned within the category of immersive digital experiences, offering users a platform focused on exploration, creativity, and interactive engagement.

July 31, 2026

|

Starbackpage Evolves Digital Marketplace Platforms

Starbackpage is part of the online classified marketplace category, where users can create listings, discover services, and interact through digital platforms.

July 31, 2026

|

Calyx VPN Strengthens Digital Privacy Security

Calyx VPN is a privacy-focused virtual private network solution designed to provide users with secure internet connections and enhanced online privacy.

View Blogs