< ciso
brief />
AI and Security Pulse Banner

All news in category “AI and Security Pulse

957 articles · page 2 of 48

Defense in Depth for Autonomous AI Agents

🛡️ Microsoft Security explains how rising agentic autonomy reorients security from models to how agents are assembled, constrained, and governed inside applications. The post identifies amplified risks—agent hijacking, intent breaking, data leakage, supply chain compromise—and shows why the application layer is decisive because builders fully control permissions, tool access, and failure handling. It recommends concrete design patterns: agents as microservices, least permissions, deterministic human-in-the-loop, and distinct agent identity to limit blast radius and preserve auditability.
read more →

AI Hallucinations Introduce Critical Security Risks

⚠️ AI hallucinations—confident but incorrect outputs—are increasingly driving risky decisions in critical infrastructure and cybersecurity operations, exploiting human trust in authoritative-sounding responses. A 2025 AA-Omniscience benchmark of 40 models found most systems were more likely to offer a confident wrong answer on difficult questions, underscoring that AI outputs must be treated as potential vulnerabilities until vetted. Effective controls include enforced human review before sensitive actions, treating training data as a security asset, strict least-privilege for AI systems, and prompt-engineering training to reduce ambiguous inputs.
read more →

Assessing the Risks of Anthropic’s Mythos AI Capabilities

🔍 Anthropic’s announcement that Claude Mythos Preview will not be released publicly underscores both genuine capability and strategic constraint. Independent testing and reproductions suggest similar performance from OpenAI’s GPT-5.5 and smaller community models, while Mythos’ cost and corporate incentives shape access. These generative systems dramatically improve automated vulnerability discovery, empowering both attackers and defenders. Mozilla’s use found 271 flaws, but many devices remain unpatchable, so organizations must adapt quickly.
read more →

Most Organizations Deploy AI Agents Despite Identity Risks

🔒 Semperis finds that 93% of global organizations use or plan to use AI agents for security tasks such as password resets and VPN access, while 92% report AI on endpoints with SSH and encryption key access. The survey of 1,100 organizations warns of over‑permissioned and abandoned 'zombie' non‑human identities that increase hijack risk. Semperis recommends treating agents as NHIs, enforcing least‑privilege, and improving observability and recovery readiness.
read more →

Defender's Guide: Frontier AI Impact on Cybersecurity

🔒 Palo Alto Networks reports ongoing testing of frontier AI models, including Anthropic and OpenAI, finding they rapidly surface code vulnerabilities and potential exploit paths. In the May 'Patch Wednesday' advisories the majority of findings originated from these AI scans, prompting broad rescanning and remediation. The company warns of a narrow three-to-five-month window before AI-driven exploits spread and offers Unit 42 services to help organizations respond.
read more →

Microsoft MDASH: Multi-Model AI for Vulnerability Discovery

🛡️ Microsoft introduced MDASH (multi-model agentic scanning harness), a model-agnostic AI system in limited private preview designed to discover, validate, and prove exploitable defects in large codebases. The system orchestrates more than 100 specialized agents across frontier and distilled models in a structured pipeline that builds threat models, runs auditor and debater stages, groups equivalent findings, and proves vulnerabilities. Microsoft reports MDASH uncovered 16 issues fixed in this month’s Patch Tuesday, including two critical Windows networking and authentication flaws.
read more →

When China's AI Catches Up: Mythos and Global Risks

🔒 Anthropic's Mythos Preview, shared last month with a limited set of security partners, has demonstrated the ability to autonomously find zero-day vulnerabilities across major operating systems and browsers. Anthropic paired the release with Project Glasswing and $100 million in usage credits to help defenders, but reports of unauthorized access and denied requests from Chinese entities have already emerged. The development challenges the assumption of a durable US lead and has injected cybersecurity into high-level US–China summit talks, prompting urgent questions about access, regulation, and international cooperation.
read more →

Autonomous Validation: Closing the AI-Speed Breach Gap

🛡️ In a post-Mythos environment, AI-driven attacks can weaponize vulnerabilities within hours or minutes, outpacing traditional defensive cycles. Picus Security argues defenders must pair continuous Breach and Attack Simulation (BAS) with autonomous pentesting to validate controls and reveal genuine attack paths. Operational friction — the "spaghetti handoff" between tools and teams —, not tooling alone, is the main cause of delayed response, so validation must be automated end-to-end.
read more →

GPT-5.5 Matches Mythos in Security Vulnerability Tests

🔍 The UK’s AI Security Institute evaluated GPT-5.5’s ability to identify software security vulnerabilities and concluded it performs comparably to Claude Mythos, based on a series of red-team style tests and benchmark prompts. The assessment highlights that GPT-5.5 is generally available from OpenAI, making high-quality automated vulnerability detection more accessible to organizations and researchers. The Institute also analyzed a smaller, cheaper model which, when given additional prompting scaffolding and careful supervision, delivered similar detection performance. Overall, the study suggests parity among leading LLMs for initial vulnerability discovery, with differences largely hinging on prompt engineering and deployment context.
read more →

AI-Assisted Synthetic Attack Logs to Accelerate Detection

🔒 Microsoft researchers describe an AI-driven pipeline that translates attacker TTPs into realistic, structured security logs to accelerate detection engineering. The approach uses prompt engineering, collaborative agentic refinement, and data augmentation to generate semantically accurate telemetry (command lines, process ancestry, fields) without exposing sensitive customer data. Evaluation across multiple datasets shows agentic workflows and reasoning models notably improve recall and fidelity compared to prompt-only methods.
read more →

AI Coding Agents Expand Developer Threat Surface Risks

🔍 AI coding agents now operate across IDEs, terminals, and extension runtimes, so defenders must expand focus beyond source code to repository files, instruction and runtime settings, and third‑party extensions that shape agent behavior. VirusTotal Code Insight and agentic threat intelligence apply semantic analysis to detect malicious intent in syntactically valid artifacts and link findings to broader campaigns and supply‑chain risks. Examples—weaponized tasks.json, malicious Skill.md, redirected settings.json endpoints, and sabotaged extensions—illustrate how semantics can enable exfiltration, privilege escalation, and stealthy attacker control.
read more →

OpenAI Daybreak: Secure-by-Design LLMs for Developers

🔒 OpenAI has launched Daybreak, an initiative built on its frontier LLMs and the Codex assistant to help developers embed security throughout the software development lifecycle. Announced on May 12, Daybreak extends the Trusted Access for Cyber (TAC) program and includes GPT‑5.5, TAC-enabled GPT‑5.5, GPT‑5.5‑Cyber and a Codex Security research preview. The initiative supports code scanning, vulnerability triage, automated detection and response while pairing defensive capabilities with verification, proportional safeguards and accountability.
read more →

Agentic AI: The Next Blindspot for Security Teams and Risk

🔐 Agentic AI is already operating across enterprises, executing tasks and taking actions often without meaningful security involvement. Security teams must develop hands‑on fluency — build and test agents, understand integrations like the Model Context Protocol, and enforce scoped configurations — because policy alone won't close the gap. The piece distinguishes three agent classes (productivity, MCP‑connected vendor agents, and custom user agents) and emphasizes configuration, access scoping, and training such as SANS SEC545 to reduce exposure.
read more →

CISOs Step into AI Spotlight: Risk, Governance and Trust

🔒 CISOs are shifting from a primarily technical control function to strategic business partners as AI reshapes risk, operations, and product delivery. Leaders such as Barry Hensley, Shaun Khalfan, and Jeff Trudeau stress publishing AI security frameworks, embedding security early in development, and aligning controls to business outcomes. They warn of AI-enabled threats — including advanced phishing, voice/video impersonation, and automated vulnerability discovery — and call for continuous controls, stronger identity and data governance, and near-real-time patching. Growing board engagement and changing reporting lines reflect the elevated role of security in enterprise strategy.
read more →

OpenAI launches Daybreak to harden software defenses

🛡️ OpenAI announced Daybreak, a cybersecurity initiative that combines GPT-5.5 family models with Codex Security to identify, test, and propose fixes for vulnerabilities before attackers exploit them. Daybreak builds editable threat models, runs isolated vulnerability tests, and suggests prioritized remediation and patch validation. Access is tightly controlled and available by request, and major vendors are integrating under Trusted Access for Cyber.
read more →

AI-Driven Exploitation: Evolving Threats and Access Risks

🔍 Google Threat Intelligence Group (GTIG) reports a rapid shift from nascent AI-enabled operations to industrial-scale use of generative models by threat actors. Based on Mandiant incident response, Gemini telemetry, and GTIG research, the report documents AI-assisted zero-day exploit development, autonomous malware like PROMPTSPY, and advanced obfuscation techniques. It highlights supply chain targeting of AI environments, anonymized premium LLM access, and specific interest from PRC- and DPRK-linked clusters. The report also outlines mitigations and defensive AI uses.
read more →

Lyrie.ai Joins Anthropic CVP, Releases Open ATP Standard

🔒 OTT Cybersecurity LLC — the team behind Lyrie.ai — announced acceptance into Anthropic’s Cyber Verification Program and the public release of the Agent Trust Protocol (ATP). ATP is an open cryptographic standard that enables real-time verification of an AI agent’s identity, authorized scope, attestation status, delegation, and revocation. The protocol is royalty-free, slated for IETF submission, and a reference implementation is published under an MIT license. Lyrie positions itself as the security layer for autonomous AI agents operating on the internet.
read more →

AI-Enabled Attack: First Recorded AI-Driven Zero-Day

🔍 Google’s Threat Intelligence Group (GTIG) reports the first observed case of cybercriminals using AI to discover and weaponize a zero-day, targeting a popular open-source web-based system administration tool to bypass two-factor authentication. GTIG worked with the vendor to close the flaw and disrupt the campaign. Forensic analysis of the Python exploit showed AI-like traits—structured docstrings, Pythonic formatting, and a hallucinated CVSS score. Google noted the attackers did not use Gemini or Anthropic Mythos.
read more →

Google Finds AI-Crafted Zero-Day Exploit in Wild, Reported

🔍 The Google Threat Intelligence Group (GTIG) reported the first confirmed instance of an AI-crafted zero-day exploit observed in the wild. The researchers identified a Python-based exploit that bypasses two-factor authentication in an open-source web administration tool and disclosed the flaw to the vendor to limit mass exploitation. GTIG found artifacts in the code—help text, a hallucinated CVSS score and textbook LLM-style constructs—consistent with large language model generation, and noted broader AI abuse by threat actors including misuse of Gemini and agentic tooling.
read more →

Autonomous Purple Teaming: Closing the Exploitation Gap

🛡️ Traditional purple teaming is failing because human handoffs and siloed toolchains make detection-to-fix cycles far slower than modern attackers. The author documents a collapse in the vulnerability-to-exploit window—from 56 days in 2024 to roughly 10 hours in early 2026 across CISA KEV, VulnCheck KEV, and ExploitDB—and warns that AI-assisted adversaries can act in seconds. Autonomous purple teaming pairs automated penetration testing, Breach and Attack Simulation, and AI-powered mobilization agents to close the loop at machine speed, converting red findings into blue tests and auto-deploying low-risk fixes while keeping every step auditable.
read more →