Hack The Box Launches AI Range Platform to Benchmark Autonomous Security Agents Against Human Cyber Teams in Realistic Threat Environments

December 15, 2025
|

Cybersecurity training provider Hack The Box has launched HTB AI Range, a simulation platform enabling enterprises to test autonomous AI security agents alongside human defenders under realistic operational conditions. The platform addresses the urgent need to continuously validate AI systems in realistic operational contexts where stakes are high and human oversight remains vital Cryptopolitan, as organizations prepare for AI-powered threat environments where attackers deploy automated reconnaissance and exploitation at unprecedented scale.

In a recent AI versus human capture the flag exercise, autonomous AI agents solved 19 out of 20 basic challenges, but in multi-step challenges in more complex environments, human teams significantly outperformed the AI agents Cryptopolitan. AI teams achieved a 95% success rate on easy-tier tasks but faced substantial limitations on final multi-step challenges where humans far outperformed AI capabilities OpenAI.

The AI Range simulates enterprise complexity with thousands of offensive and defensive targets that are continuously updated, supporting mapping to established cyber frameworks including MITRE ATT&CK, NIST/NICE guidelines, and OWASP Top 10 Cryptopolitan. Attackers are already using AI to scale activity to send thousands of automated requests, often multiple per second, targeting large tech, financial, manufacturing and government institutions OpenAI.

Vulnerabilities in AI models add to those already present in traditional IT infrastructure, so before agentic or AI-based cybersecurity tools can be deployed operationally, testing environments where AI agents and human defenders can work together under realistic pressure become essential Cryptopolitan. The platform represents a strategic shift from static security audits toward continuous threat exposure management models.

In a separate 10-day AI red teaming CTF run by HTB and HackerOne, only 43% of registrants completed a single challenge, signaling a significant skills gap OpenAI in the workforce's ability to understand and defend against AI-enabled threats. Adversaries have already demonstrated the ability to perform attacks at 10 times previous speeds, reinforcing existing ransomware and social engineering tactics Tekedia.

The launch coincides with HTB's announcement of an AI Red Teamer Certification available in Q1 2026, developed with Google to align with Google's Secure AI Framework, establishing the first industry credential for end-to-end AI system security assessment.

Haris Pylarinos, CEO and founder of Hack The Box, stated: "For over two years, we've been advancing AI-driven learning paths, labs, and research where machines and humans compete, collaborate, and co-evolve. With HTB AI Range, we're not reacting to AI's rise in cyber; we're defining how defence evolves alongside it" Cryptopolitan.

Dawn-Marie Vaughan, Global Offering Lead for Cybersecurity at DXC, commented: "AI is fundamentally reshaping the threat landscape. Early research is already showing how AI can automate reconnaissance and link potential exploit paths in ways that were extremely difficult just a year ago. As these capabilities mature, defenders will need teams trained to operate under more dynamic, real-world conditions" OpenAI.

The company suggests AI struggles with complexity and multi-stage operations, pointing to the continuing value of human expertise, especially in high-stakes or complex work

Enterprises can use the AI Range to validate whether existing security measures work under AI-powered attacks, give their cybersecurity teams experience of AI-powered threats, and develop more resilient cybersecurity tools based on agentic AI Cryptopolitan. Such exercises could be used to justify cybersecurity investment to financial decision-makers Cryptopolitan, translating technical readiness into business risk metrics.

Continuous testing and validation of cybersecurity defences proves more effective long-term than static audits or penetration testing exercises, aligning closer to Continuous Threat Exposure Management models Cryptopolitan. Organizations deploying AI security agents must establish governance frameworks determining which defensive operations can be fully autonomous, which require human oversight, and which must remain entirely human-controlled before adversaries exploit these systems' inherent limitations in complex, multi-stage attack scenarios.

As AI matures and frameworks like MITRE ATLAS gain traction, tools like HTB's AI Range may become standard components in enterprise security programmes Cryptopolitan. Decision-makers should monitor whether hybrid human-AI defensive teams demonstrate measurable improvements in mean time to detect and respond compared to purely human or fully autonomous approaches. The platform's ability to benchmark AI agent performance against human expertise will likely inform regulatory frameworks governing autonomous security tool deployment, particularly as threat actors weaponize similar AI capabilities for offensive operations at unprecedented velocity and scale.

Source & Date

Source: Artificial Intelligence News, Hack The Box, Business Wire, SiliconANGLE, Morningstar
Date: December 3, 2025

  • Featured tools
Hostinger Website Builder
Paid

Hostinger Website Builder is a drag-and-drop website creator bundled with hosting and AI-powered tools, designed for businesses, blogs and small shops with minimal technical effort.It makes launching a site fast and affordable, with templates, responsive design and built-in hosting all in one.

#
Productivity
#
Startup Tools
#
Ecommerce
Learn more
Neuron AI
Free

Neuron AI is an AI-driven content optimization platform that helps creators produce SEO-friendly content by combining semantic SEO, competitor analysis, and AI-assisted writing workflows.

#
SEO
Learn more

Learn more about future of AI

Join 80,000+ Ai enthusiast getting weekly updates on exciting AI tools.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Hack The Box Launches AI Range Platform to Benchmark Autonomous Security Agents Against Human Cyber Teams in Realistic Threat Environments

December 15, 2025

Cybersecurity training provider Hack The Box has launched HTB AI Range, a simulation platform enabling enterprises to test autonomous AI security agents alongside human defenders under realistic operational conditions. The platform addresses the urgent need to continuously validate AI systems in realistic operational contexts where stakes are high and human oversight remains vital Cryptopolitan, as organizations prepare for AI-powered threat environments where attackers deploy automated reconnaissance and exploitation at unprecedented scale.

In a recent AI versus human capture the flag exercise, autonomous AI agents solved 19 out of 20 basic challenges, but in multi-step challenges in more complex environments, human teams significantly outperformed the AI agents Cryptopolitan. AI teams achieved a 95% success rate on easy-tier tasks but faced substantial limitations on final multi-step challenges where humans far outperformed AI capabilities OpenAI.

The AI Range simulates enterprise complexity with thousands of offensive and defensive targets that are continuously updated, supporting mapping to established cyber frameworks including MITRE ATT&CK, NIST/NICE guidelines, and OWASP Top 10 Cryptopolitan. Attackers are already using AI to scale activity to send thousands of automated requests, often multiple per second, targeting large tech, financial, manufacturing and government institutions OpenAI.

Vulnerabilities in AI models add to those already present in traditional IT infrastructure, so before agentic or AI-based cybersecurity tools can be deployed operationally, testing environments where AI agents and human defenders can work together under realistic pressure become essential Cryptopolitan. The platform represents a strategic shift from static security audits toward continuous threat exposure management models.

In a separate 10-day AI red teaming CTF run by HTB and HackerOne, only 43% of registrants completed a single challenge, signaling a significant skills gap OpenAI in the workforce's ability to understand and defend against AI-enabled threats. Adversaries have already demonstrated the ability to perform attacks at 10 times previous speeds, reinforcing existing ransomware and social engineering tactics Tekedia.

The launch coincides with HTB's announcement of an AI Red Teamer Certification available in Q1 2026, developed with Google to align with Google's Secure AI Framework, establishing the first industry credential for end-to-end AI system security assessment.

Haris Pylarinos, CEO and founder of Hack The Box, stated: "For over two years, we've been advancing AI-driven learning paths, labs, and research where machines and humans compete, collaborate, and co-evolve. With HTB AI Range, we're not reacting to AI's rise in cyber; we're defining how defence evolves alongside it" Cryptopolitan.

Dawn-Marie Vaughan, Global Offering Lead for Cybersecurity at DXC, commented: "AI is fundamentally reshaping the threat landscape. Early research is already showing how AI can automate reconnaissance and link potential exploit paths in ways that were extremely difficult just a year ago. As these capabilities mature, defenders will need teams trained to operate under more dynamic, real-world conditions" OpenAI.

The company suggests AI struggles with complexity and multi-stage operations, pointing to the continuing value of human expertise, especially in high-stakes or complex work

Enterprises can use the AI Range to validate whether existing security measures work under AI-powered attacks, give their cybersecurity teams experience of AI-powered threats, and develop more resilient cybersecurity tools based on agentic AI Cryptopolitan. Such exercises could be used to justify cybersecurity investment to financial decision-makers Cryptopolitan, translating technical readiness into business risk metrics.

Continuous testing and validation of cybersecurity defences proves more effective long-term than static audits or penetration testing exercises, aligning closer to Continuous Threat Exposure Management models Cryptopolitan. Organizations deploying AI security agents must establish governance frameworks determining which defensive operations can be fully autonomous, which require human oversight, and which must remain entirely human-controlled before adversaries exploit these systems' inherent limitations in complex, multi-stage attack scenarios.

As AI matures and frameworks like MITRE ATLAS gain traction, tools like HTB's AI Range may become standard components in enterprise security programmes Cryptopolitan. Decision-makers should monitor whether hybrid human-AI defensive teams demonstrate measurable improvements in mean time to detect and respond compared to purely human or fully autonomous approaches. The platform's ability to benchmark AI agent performance against human expertise will likely inform regulatory frameworks governing autonomous security tool deployment, particularly as threat actors weaponize similar AI capabilities for offensive operations at unprecedented velocity and scale.

Source & Date

Source: Artificial Intelligence News, Hack The Box, Business Wire, SiliconANGLE, Morningstar
Date: December 3, 2025

Promote Your Tool

Copy Embed Code

Similar Blogs

December 15, 2025
|

Industry Leaders Declare 2026 the End of Experimental AI Era as Autonomous Agentic Systems Replace Chatbots, Energy Constraints Replace Model Parameters as Primary Bottleneck

2026 will lose the focus on model parameters and be about agency, energy efficiency, and the ability to navigate complex industrial environments with the next twelve months representing a departure from chatbots.
Read more
December 15, 2025
|

BBVA Deploys ChatGPT Enterprise to 120,000 Employees Across 25 Countries in One of Finance Industry's Largest AI Transformations, Saving Three Hours Weekly per Worker

Read more
December 15, 2025
|

Microsoft's 37.5 Million Copilot Conversation Analysis Reveals Dual Identity: Desktop Productivity Tool by Day, Mobile Confidant for Health and Philosophy by Night

Microsoft's AI research team analyzed 37.5 million anonymized conversations revealing distinct AI use patterns following surprisingly human rhythms from late-night philosophical querie.
Read more
December 15, 2025
|

Microsoft Launches Promptions Framework to Eliminate AI Trial & Error Loop, Replacing Natural Language Prompts with Dynamic UI Controls for Enterprise Precision

Microsoft has released Promptions (prompt + options), an open-source UI framework designed to address inefficiency where AI prompts are given, responses miss the mark.
Read more
December 15, 2025
|

How US Regulations Are Shaping AI Adoption in 2026

Artificial intelligence has become essential to American business growth, powering everything from automation and analytics to customer service and supply chain optimization.
Read more
December 15, 2025
|

AI Security Risks Every American Business Owner Should Watch For

Artificial intelligence has become a powerful engine for growth in American businesses streamlining operations, improving customer service, and unlocking data-driven insights.
Read more