Tag Banner

All news with #text-to-speech tag

Thu, October 2, 2025

Google Cloud Releases Generative Media Models on Vertex AI

🎨Google Cloud announced General Availability and feature updates for its generative media models on Vertex AI, including Gemini 2.5 Flash Image, Veo 3, Imagen 4, and Gemini 2.5 TTS. The release emphasizes production readiness and enterprise security while adding multi‑aspect ratio image generation, batch image processing, vertical 9:16 video formats with precise duration controls, and studio‑quality multi‑speaker text‑to‑speech across 70+ languages. These enhancements target teams seeking faster, controlled, and scalable cross‑format media workflows for sight, sound, and motion.

read more →

Wed, August 27, 2025

Cloudflare Workers AI Adds Leonardo and Deepgram Models

🚀 Cloudflare is expanding Workers AI to include closed-source partner models from Leonardo and Deepgram, bringing optimized image generation and real-time audio capabilities to the edge. The launch includes Leonardo's @cf/leonardo/phoenix-1.0 and @cf/leonardo/lucid-origin and Deepgram's @cf/deepgram/nova-3 and @cf/deepgram/aura-1. These models run on Cloudflare's low-latency GPU infrastructure and integrate with Workers, R2, Images, and Realtime for end-to-end developer workflows.

read more →