< ciso
brief />
Tag Banner

All news with #google cloud tag

379 articles · page 9 of 19

Scaling MoE Inference with NVIDIA Dynamo on A4X Rack-Scale

🚀 This post describes two validated deployment recipes for serving large Mixture-of-Experts (MoE) models on Google Cloud's A4X machines using NVIDIA Dynamo. The recipes provide throughput- and latency-optimized configurations that exploit the 72‑GPU GB200 NVL72 rack fabric, WideEP/DeepEP parallelism, global KV cache, and GKE-aware rack-level scheduling. Performance validation reports >6K tokens/sec/GPU for the throughput recipe and a 10ms median inter-token latency for the latency-optimized recipe.
read more →

Google Cloud's New RaMP Incentives for Cloud Migration

🚀 Google Cloud has refreshed the Rapid Migration and Modernization Program (RaMP) to incentivize cloud migrations with service credits tied to incremental usage and funded partner and professional services. The program offers enhanced rewards for advanced workloads—SAP, Oracle, VMware, NetApp and data analytics—to help offset higher technical costs. RaMP is positioned to reduce technical debt, accelerate AI readiness by making data accessible to Vertex AI and Gemini, and provide a funded path for assessment and implementation.
read more →

Testing Apps Exposed Online Used to Breach Fortune 500

⚠️ A recent Pentera investigation discovered nearly 2,000 intentionally vulnerable security-testing web applications (DVWA, OWASP Juice Shop, Hackazon, bWAPP) exposed on the public internet, often running from overly privileged cloud accounts on AWS, GCP and Azure. Attackers exploited these instances to deploy crypto miners, install webshells and create persistence mechanisms, then pivot to sensitive cloud resources. Affected vendors including Cloudflare, F5 and Palo Alto Networks were notified and remediated issues. Pentera recommends inventories, isolation of test systems, enforcement of least-privilege IAM, and elimination of default credentials.
read more →

Google Cloud Opens New Bangkok Region to Boost Thai AI

🚀 Google Cloud has launched a new Bangkok (asia-southeast3) region to deliver low-latency, high-performance cloud services while enabling local data residency under Thailand’s PDPA. The region is part of a USD $1 billion investment and is expected to generate THB 1.4 trillion (US$41 billion) in economic value over five years and support roughly 130,000 jobs annually. It offers certified security controls (ISO/IEC, PCI DSS, SOC), default encryption, customer-managed keys, and direct access to Vertex AI, enterprise Gemini, and generative models to accelerate local AI adoption.
read more →

Getting Started with Gemini 3 Flash on Google Cloud

🚀 This post introduces Gemini 3 Flash, Google’s low-latency, cost-efficient model in the Gemini 3 family, optimized for advanced reasoning, multimodal understanding, and agentic workflows. It guides developers through obtaining an API key from Google AI Studio and configuring it for local use or environment-based invocation. The article demonstrates interactive prompt testing in the Playground, explains toggles like Structured outputs and Thinking level, and shows how to export language-specific sample code via the "Get code" feature to run with the Google GenAI SDK.
read more →

Palo Alto Networks Builds Multi-Tenant Unified Data Platform

🚀Palo Alto Networks partnered with Google Cloud to replace a brittle single-tenant data pipeline model with a unified, multi-tenant Unified Data Platform powered by Dataflow, Pub/Sub and BigQuery. The migration consolidated more than 30,000 pipelines into a shared, autoscaling platform that processes billions of events daily. The change delivered roughly 30% compute cost savings, faster onboarding, and reduced operational overhead, enabling engineers to refocus on analytics and threat detection.
read more →

Practical Guidance for Building Securely with SAIF on Cloud

🔐 Tom Curry and Anton Chuvakin from Google Cloud’s Office of the CISO present practical guidance for implementing the Secure AI Framework (SAIF) on Google Cloud. The piece emphasizes three operational principles: treat data as the perimeter, treat prompts like code, and require identity propagation for agentic AI. It maps 15 common AI risks to controls and highlights concrete tools and patterns—IAM, Dataplex, Vertex AI, Model Armor, Gemini, Apigee, and the Agent Development Kit—to operationalize SAIF.
read more →

Agent Factory Recap: Reinforcement Learning on TPUs

🤖 This recap of the Agent Factory holiday special summarizes practical guidance on model fine-tuning, with a focus on reinforcement learning (RL) and Google’s TPU infrastructure. Hosts Shir Meir Lador and Don McCasland speak with Kyle Meggs from the TPU Training Team about when to fine-tune, the distinction between pre‑training, SFT, and RL, and why specialized workloads benefit from hosted solutions like MaxText on TPUs. The post also demonstrates a GRPO demo using Pathways, vLLM, and Tunix components to show RL at scale.
read more →

gRPC as a Native Transport for the Model Context Protocol

🔗 Google Cloud describes work to enable gRPC as a native transport for the Model Context Protocol (MCP), offering an alternative to JSON-RPC transcoding for organizations that already use gRPC. Native gRPC eliminates the need for transcoding gateways and preserves existing tooling, while delivering lower latency, smaller Protobuf-encoded payloads, and full-duplex streaming. The MCP core maintainers agreed to pluggable transports in the SDK, and Google Cloud will contribute a community-backed gRPC transport package to promote consistent, interoperable deployments.
read more →

De-risking Network Migration with VPC Flow Logs & Analyzer

🔍 Hackensack Meridian Health used VPC Flow Logs and Flow Analyzer to obtain precise, end-to-end visibility of Cloud Interconnect traffic before a major Google Cloud network migration. They enabled VLAN-attachment flow logs, aggregated ingress/egress flows (IPs, ports, bytes, timestamps), and organized results into sankey diagrams mapping data center → region → VPC → application. This process revealed critical flows early and shortened incident detection to 3 minutes and resolution to 5 minutes, materially de-risking the cutover.
read more →

FINRA Modernizes Software Delivery Using DORA and DevOps

🔍 FINRA partnered with Google Cloud to adopt the DORA metrics and a data-first DevOps approach to shorten lead times and modernize its software lifecycle. A DORA workshop revealed lengthy User Acceptance Testing (UAT) cycles as a primary bottleneck, enabling a multi-million-dollar business case for a dedicated sandbox to accelerate testing and deployment. The initiative standardized DORA across teams and targets full adoption within the year.
read more →

Gemini CLI: Preconfigured Google Cloud Monitoring Dashboards

🔍 Google Cloud has enhanced Gemini CLI telemetry with pre-configured Google Cloud Monitoring dashboards that provide immediate visibility into adoption, usage patterns, and performance. By exporting data via OpenTelemetry, teams can use out-of-the-box visualizations or analyze raw logs and metrics to build custom views. Setup is simplified through direct GCP exporters and a three-step flow—project ID, authentication and IAM roles, and updating .gemini/settings.json—so telemetry can be live quickly.
read more →

Check Point Adds Google Cloud Network Security Integration

🔒 Check Point now supports Google Cloud Network Security Integration, offering a nondisruptive approach to deploying cloud firewalls that minimizes downtime and avoids performance degradation. The integration enables organizations—particularly in regulated sectors such as financial services, healthcare, and government—to scale hybrid network security while preserving latency and throughput. It simplifies deployment, centralizes policy management, and helps maintain compliance without rearchitecting existing networks.
read more →

Cloud SQL for MySQL: Optimized Writes Boost Throughput

⚡ Cloud SQL for MySQL Enterprise Plus now includes optimized writes, an automated runtime tuning suite that adjusts MySQL configuration and I/O behavior to reduce write latency and increase throughput. Enabled by default on Enterprise Plus instances, the feature implements adaptive purge, adaptive I/O limits, sharded I/O, faster REDO recovery, and adaptive buffer-pool warmup. Google provides a reproducible sysbench benchmark and reports up to 3x write throughput improvements versus the Enterprise edition, with results varying by machine type and workload.
read more →

Google Cloud Joins Auto-ISAC to Strengthen Vehicle Security

🚗 Google Cloud has joined the Automotive Information Sharing and Analysis Center as an Innovator Partner, pledging experts and resources to bolster vehicle and supply-chain cybersecurity. The partnership will bring threat intelligence and incident response expertise — including insights from Mandiant — to help members anticipate, mitigate, and respond to attacks against cloud-connected, software‑defined vehicles and Industry 4.0 environments. Google cites a $10 billion cybersecurity investment over five years as part of its broader commitment.
read more →

Google Cloud: VM Extensions Manager for Compute Engine

🚀 VM Extensions Manager is now available in preview as part of the compute.googleapis.com API, enabling administrators to centrally define policies that install and manage Google-provided extensions across VM fleets. The preview supports zonal project policies and key agents — Cloud Ops Agent (ops-agent), Agent for SAP (sap-extension), and Agent for Compute Workload (workload-extension) — with options to pin versions or use automatic rollouts. Policies are enforced by a progressive rollout engine, and Google will expand global, Organization, and Folder-level policy support in the coming months.
read more →

Persistent Cloud Misconfigurations Still Put Data at Risk

🔒 A Qualys survey and analysis of roughly 44 million public-cloud VMs highlights widespread misconfiguration: 45% of AWS, 63% of GCP and 70% of Azure instances showed issues. Respondents reported breaches and identified misconfigured services as a leading cloud risk. Experts cite neglected logging, monitoring and MFA, rushed M&A integrations and understaffed small firms as common causes. The piece recommends concrete controls — from Infrastructure as Code and continuous scanning to private networking and least-privilege — to reduce exposure.
read more →

Cybercriminals Abuse Google Cloud to Send Phishing Emails

📧 Check Point disclosed a large-scale phishing campaign that abused Google Cloud Application Integration to send authentic-looking messages from noreply-application-integration@google[.]com, enabling attackers to bypass SPF and DMARC protections. The emails mimicked routine enterprise notifications to prompt clicks and redirected victims through Google Cloud storage to a fake CAPTCHA and a counterfeit Microsoft login page. Google has blocked the abuse and is implementing further mitigations.
read more →

Google Data Cloud updates: 2025 database and AI features

📢Google Cloud’s Data Cloud updates through mid‑2025 introduce new self‑service Looker features, expanded Model Context Protocol (MCP) support, and tighter AI-to-data integrations. Highlights include AlloyDB AI time‑series forecasting via AI.FORECAST, GA of Conversational Analytics powered by Gemini, and the MCP Toolbox and ADK to securely connect agents to BigQuery, Spanner, Cloud SQL, and Looker. Dataplex Universal Catalog now previews curated data products for governed, deployable datasets and AI use.
read more →

Equifax’s Security Overhaul: Culture and Cloud as Core

🔒 Since the 2017 breach, Equifax has pursued a comprehensive security transformation, investing nearly $3 billion to rebuild technology and migrate to Google Cloud under NIST-aligned frameworks. The company reports that security is now embedded across processes and incentivized through employee bonuses, with regional CISOs adapting programs to EU rules like DORA and NIS2. Equifax says it neutralizes millions of threats daily and uses a hybrid approach to AI-driven attacks, combining multiple layers of controls rather than relying on a single technology.
read more →