All news with #model poisoning tag

Tue, August 26, 2025

Block Unsafe LLM Prompts with Firewall for AI at the Edge

#AI Security #Cloudflare #Cloudflare Workers #Content Filtering #Firewall for AI #Model Poisoning #PII #Prompt Injection #Prompt Logs #Safety Guardrails

🛡️ Cloudflare has integrated unsafe content moderation into Firewall for AI, using Llama Guard 3 to detect and block harmful prompts in real time at the network edge. The model-agnostic filter identifies categories including hate, violence, sexual content, criminal planning, and self-harm, and lets teams block or log flagged prompts without changing application code. Detection runs on Workers AI across Cloudflare's GPU fleet with a 2-second analysis cutoff, and logs record categories but not raw prompt text. The feature is available in beta to existing customers.