All news with #llm security tag

249 articles · page 3 of 13

April 1, 2026

Nine Practical Steps for CISOs to Prevent AI Hallucinations

🔍 CISOs should treat AI outputs as drafts, keep humans in the loop for high‑stakes decisions, and demand traceability from vendors before accepting compliance or control assessments. The story cites practitioners who stress-test models for consistency, measure hallucination and drift rates over time, and validate AI findings against scanners and penetration testing. It warns against automated regulatory mapping without technical verification and emphasizes audit trails, human signoff, and vendor proof as essential controls.

AI Risk Management LLM Security AI Governance

March 27, 2026

Five Techniques to Optimize LLM Inference Efficiency

⚡ Karl Weinmeister frames LLM inference as an efficient frontier that trades latency against throughput and argues production systems often sit below this curve. He presents five actionable optimizations—semantic model routing, prefill/decode disaggregation, modern quantization, context-aware L7 routing with prefix caching, and speculative decoding—and explains their practical tradeoffs. A Vertex AI case study reports 35% faster time-to-first-token and doubled prefix cache hit rates after deploying GKE Inference Gateway.

LLM Security Vertex AI

March 26, 2026

How CISOs Should Respond to Shadow AI Risks and Governance

🔒 Shadow AI — the unapproved use of AI tools and embedded AI features — is proliferating as employees seek productivity gains and vendors quietly enable capabilities. CISOs should first assess data sensitivity, storage practices and whether corporate inputs are being used to train models. After evaluating risk, organizations must choose to block or formally integrate tools and apply mitigations such as filtering, acceptable-use policies and targeted employee education. Clear governance, cross-functional review and simple approval pathways help balance innovation with security without unduly punishing productive behavior.

AI Governance LLM Security Data Security Opinion

March 26, 2026

AI Named Top Cybersecurity Priority as Threats Rise

🔒 A PwC report finds AI is now the top cybersecurity investment priority for defenders as criminals rapidly weaponize generative models. The firm's Annual Threat Dynamics 2026 study warns adversaries are using AI to accelerate malware development, automate reconnaissance and scale social engineering, including via dark‑web LLMs. PwC cites agentic tools like ReaperAI being repurposed in real campaigns, but also stresses that AI can empower defenders with faster detection, automated containment and intelligence‑led decision‑making when embedded into security strategies.

AI Security LLM Security Threat Report Agentic AI

March 25, 2026

AI Is Breaking Security Models — Where They Fail First

🤖 AI-assisted triage is changing vulnerability workflows and forcing organizations to redesign ownership and decision-making. By enriching findings with exploitability indicators, ownership metadata and business-impact signals, AI platforms accelerate detection and reduce manual triage. Security teams must shift from routine investigation to governing models, defining owners, and maintaining human checkpoints for high‑risk actions. Treat AI-driven features as first-class risk surfaces and assign clear owners for model behavior, prompt safety and misuse prevention.

AI Security Vulnerability Management LLM Security

March 24, 2026

Kubernetes as AI Infrastructure: llm-d Joins CNCF Sandbox

🚀 Google Cloud and partners announced that llm-d has been accepted into the CNCF Sandbox to promote open, accelerator-agnostic standards for distributed LLM inference. As a founding contributor alongside Red Hat, IBM Research, CoreWeave, and NVIDIA, Google emphasizes running any model on any accelerator in any cloud without vendor lock-in. GKE Inference Gateway now integrates the llm-d Endpoint Picker (EPP) to enable model-aware routing that optimizes for KV-cache hits, inflight requests, and queue depth, yielding concrete production gains in Vertex AI tests. Complementary work on the Kubernetes LeaderWorkerSet (LWS) API and vLLM extensions for Cloud TPUs targets scalable multi-node orchestration and up to 5x throughput improvements.

Google Kubernetes Nvidia LLM Security

March 24, 2026

Why CISOs Should Embrace AI-Powered Honeypots Today

🛡️ AI-driven honeypots pair large language models with deception servers to create dynamic, realistic environments that keep attackers engaged longer and collect richer threat intelligence. Academic research by Dr. M. Abdullah Canbaz and others showed LLMs can parse traffic and handle complex Linux commands, prompting open-source and commercial efforts such as Beelzebub and Deutsche Telekom’s T-Pot. These systems significantly lower the cost and engineering effort of high-interaction deception while enabling deployment in novel locations like APIs and AI agents. However, defenders must balance benefits with risks—attackers are using AI to automate attacks and may develop countermeasures such as deception-detection services or data poisoning—so CISOs should view AI honeypots as a complement to existing sensors and an important tool for improved visibility and hunting.

Threat Hunting LLM Security AI Security

March 19, 2026

Cloudflare Workers AI Adds Frontier Open-Source Models

🤖 Cloudflare’s Workers AI now hosts frontier open-source models, beginning with Kimi K2.5, a 256k-context model that supports multi-turn tool calling, vision inputs, and structured outputs. The release enables organizations to run full agent lifecycles on Cloudflare’s Developer Platform, leveraging primitives like Durable Objects and Workflows. Cloudflare emphasizes improved price-performance, prefix caching, a session-affinity header, and a redesigned asynchronous API to lower latency and inference costs for agentic workloads.

Cloudflare Agentic AI LLM Security Product Launch

March 19, 2026

AWS adds NIXL with EFA to accelerate LLM inference at scale

⚡ AWS now supports NVIDIA Inference Xfer Library (NIXL) with Elastic Fabric Adapter (EFA) on all EFA-enabled EC2 instances and regions. This integration accelerates disaggregated LLM inference by increasing KV-cache throughput, lowering inter-token latency, and optimizing KV-cache memory use between prefill and decode nodes. NIXL interoperates with frameworks such as NVIDIA Dynamo, SGLang, and vLLM. Supported versions are NIXL 1.0.0+ and EFA installer 1.47.0+, available at no extra cost.

AWS EC2 Nvidia LLM Security AI Runtime Security

March 18, 2026

Custom AI Apps to Dominate Incident Response Workloads

🛡️ Gartner warns custom-built AI applications will increasingly strain security teams unless defenders are engaged early. It predicts that by 2028 at least half of enterprise incident response work will handle fallout from AI app security issues. Analysts urge teams to "shift left" to embed controls during development, and expect AI security platforms to be widely adopted within two years to enforce guardrails and mitigate prompt injection, data misuse and related threats.

LLM Security AI Security Prompt Injection Attack Incident Response

March 18, 2026

CISOs Reevaluate Data Protection Amid Rapid AI Use

🔐 CISOs are updating data protection strategies as employees rapidly adopt AI tools that access and expose sensitive information. Leaders such as Scott Kopcha at Goodwin Procter and experts from SANS and Health-ISAC warn that traditional controls and many DLP tools are insufficient for the multiple ways AI can interact with data. Organizations are prioritizing data classification, identity and access management, continual monitoring, zero-trust, and ongoing vendor evaluations to close gaps and show due diligence.

Data Loss Prevention Data Security AI Security LLM Security

March 18, 2026

Amazon OpenSearch Service Adds Support for OpenSearch 3.5

🚀 Amazon OpenSearch Service now supports OpenSearch 3.5, introducing agentic AI enhancements, improved search relevance tooling, and expanded observability capabilities. The update adds persistent agentic conversation memory to capture multi-turn context and tool reasoning, plus context management that automatically truncates and summarizes inputs to reduce LLM token costs. A redesigned no-code agent interface with MCP integration and expanded search relevance workbench—including LLM-powered evaluation and scheduled experiments—helps teams tune and validate agent-driven search experiences without code.

AWS Agentic AI MCP Product Update

March 17, 2026

CISOs Struggle to Secure AI as Adoption Outpaces Defenses

🔒 The Pentera AI and Adversarial Testing Benchmark Report 2026, based on a survey of 300 US CISOs and senior security leaders, finds that most security teams lack the tools and skills to secure AI systems. 67% of respondents report limited visibility into AI usage, while half cite a lack of internal expertise. Organizations largely extend legacy security controls—75%—and only 11% use AI-specific tools.

AI Security LLM Security Threat Report

March 12, 2026

AI-Generated Slopoly Backdoor Used in Interlock Attack

🔒 A PowerShell backdoor called Slopoly, likely generated with an LLM, was used in an Interlock ransomware intrusion that allowed attackers to persist on a compromised server for over a week and exfiltrate data. IBM X-Force observed developer-style comments, structured logging, clear variable names, and robust error handling that suggest AI-assisted creation. Deployed to C:\ProgramData\Microsoft\Windows\Runtime\, Slopoly beacons to a C2 endpoint, polls for commands, executes them via cmd.exe, and establishes persistence as a scheduled task.

LLM Security Malware Ransomware Incident

March 12, 2026

AWS Open-Source LZA MCP Server Enables AI-Assisted

🤖 The open-source Landing Zone Accelerator (LZA) Model Context Protocol (MCP) Server enables management of LZA deployments via natural-language conversations with AI assistants. The containerized MCP endpoint provides 20 specialized tools to search documentation across LZA versions, manage configurations, monitor pipelines, and surface actionable failure insights. It integrates with IDEs such as Kiro, Amazon Q Developer, and Claude Code, uses temporary credentials per AWS security best practices, and is available now in supported commercial and GovCloud regions.

AWS MCP LLM Security

March 12, 2026

Hive0163 Deploys AI-Assisted Slopoly in Ransomware Ops

🛡️ IBM X-Force researchers have linked a PowerShell backdoor called Slopoly to financially motivated group Hive0163 and report indicators that portions of the script were likely produced with a large language model. The builder-delivered payload establishes persistence via a scheduled task named Runtime Broker and was used to maintain access for more than a week in a 2026 ransomware incident. Slopoly beacons system details every 30 seconds, polls for commands every 50 seconds, executes via cmd.exe and returns results to a C2 server. Although the script lacks true self-modifying polymorphism, its comments, logging and naming conventions demonstrate how AI can accelerate malware development.

IBM LLM Security Malware Ransomware

March 12, 2026

Reduce 429 Errors and Build Resilient Vertex AI Apps

⚠️ Building LLM applications on Vertex AI can trigger 429 errors when request rates exceed available throughput, degrading user experience and increasing retries. This article explains consumption options—Standard and Priority PayGo, Provisioned Throughput, Flex PayGo, and Batch—and prescribes five operational practices: smart retries, global model routing, context caching, prompt optimization, and traffic shaping. Combining these approaches (for example PT for critical real-time traffic and Batch for latency-tolerant jobs) helps preserve performance and control costs.

Google Vertex AI LLM Security How-To

March 11, 2026

Researchers Find Major Security Flaws in LLM Guardrails

🔒 Researchers at Unit 42, Palo Alto Networks' lab, have demonstrated that LLM-based safety and evaluation systems — called AI Judges — can be manipulated via prompt-injection-style token sequences. Their custom fuzzer, AdvJudge-Zero, probes models in a black-box manner, finding low-perplexity formatting tokens that shift internal attention and increase the likelihood of an 'allow' decision. Unit 42 recorded a 99% bypass rate across multiple architectures, and showed that adversarial retraining on fuzzer-discovered examples can reduce that success rate to near zero.

Unit 42 Palo Alto Networks LLM Security AI Guardrails

March 11, 2026

Cyber-Attacks on UK Firms Rise Nearly Fourfold YoY

📈 The February 2026 Check Point Global Threat Intelligence report found UK organisations saw fewer weekly attacks per organisation (1,504) than the global average (2,086), but a 36% year‑on‑year increase — nearly four times the global 9.8% rise. Education, energy & utilities, government, healthcare and financial services were among the most frequently targeted UK sectors. Ransomware remained acute, with 49 active groups and a plurality of victims attributed to Qilin, Clop and The Gentlemen. The report also warned that widespread, unmanaged GenAI use is elevating inadvertent data‑exposure risk, with one in 31 prompts judged high risk.

Check Point Threat Report Ransomware LLM Security

March 11, 2026

Agentic AI Security: Assessing Risks and Defenses Now

🛡️ Organizations are adopting agentic AI—autonomous, task-driven systems powered by LLMs—to streamline processes and boost throughput. These agents can plan, act, and iterate, but their non-deterministic behavior creates gaps in traceability, auditability, and access control. Apply strong role-based access, threat modeling, and oversight (human or independent evaluators) to limit exposure and ensure safe deployment.

Agentic AI Agent Security LLM Security AI Safety