< ciso
brief />
Tag Banner

All news with #amazon bedrock tag

173 articles · page 8 of 9

Amazon Connect launches generative AI for email support

📧 Amazon Connect now provides generative AI-powered email conversation overviews, suggested actions, and draft responses to help agents resolve customer emails faster and more consistently. Administrators enable the capability by adding the Amazon Q in Connect block to contact flows before an email is assigned to an agent. Outputs can be customized with knowledge bases and tailored prompts to align responses with company tone and policies. The feature is available in all regions where Amazon Q in Connect is offered.
read more →

Open-Source MCP Server for Amazon Bedrock AgentCore

🚀 The open-source Model Context Protocol (MCP) Server is now available for Amazon Bedrock AgentCore, providing a standardized interface that enables developers to analyze, transform, and deploy production-ready AI agents within their preferred development environments. The release includes one-click installation and integrates with agentic IDEs like Kiro and AI coding assistants such as Claude Code, Cursor, and the Amazon Q Developer CLI. Developers can use natural language to iteratively build agents, convert agent logic to the AgentCore SDK, and deploy into development accounts. Documentation and installation instructions are published in the MCP Server GitHub repository, with additional implementation guidance and pricing details available in the AgentCore documentation and pricing resources.
read more →

Cohere Embed v4 Multimodal Embeddings on Amazon Bedrock

🚀 Amazon Bedrock now supports Cohere Embed v4, a multimodal embedding model that generates high-quality embeddings for text, images, and complex business documents. The model natively processes tables, charts, diagrams, code snippets, and handwritten notes, reducing the need for extensive preprocessing and data cleanup. It supports over 100 languages and includes industry fine-tuning for finance, healthcare, and manufacturing. Cohere Embed v4 is available for on-demand inference in select AWS Regions; access is requested via the Bedrock console.
read more →

Amazon Bedrock Data Automation Adds Enhanced Transcription

🔊 Amazon Bedrock Data Automation (BDA) now offers enhanced transcription with speaker diarization and channel identification, letting developers separate and process individual speakers or channels in audio files. It also provides a guided, natural language blueprint workflow for extracting custom audio insights. These capabilities simplify reading and analysis of multi-party recordings—customer calls, telehealth visits, webinars, public-safety recordings, and meetings—and support subtitle creation, compliance monitoring, and productivity analysis. BDA is available in seven AWS Regions.
read more →

Secure Network Architectures for Generative AI on AWS

🔐 This post explains how to design defense-in-depth network architectures for generative AI workloads using AWS services. It outlines common external threats — including layer 4 and layer 7 DDoS, web request floods, application-specific exploits, and malicious bots — and maps mitigations to AWS capabilities. The guidance recommends private connectivity via Amazon Bedrock and AWS PrivateLink, edge protections with AWS WAF and AWS Shield, subnet-level controls using AWS Network Firewall, and continuous detection and response with GuardDuty, Inspector, and CloudWatch.
read more →

Anthropic Claude Sonnet 4.5 Now Available in Bedrock

🚀 Anthropic’s Claude Sonnet 4.5 is now available through Amazon Bedrock, providing managed API access to the company’s most capable model. The model leads SWE-bench Verified benchmarks with improved instruction following, stronger code-refactoring judgment, and enhanced production-ready code generation. Bedrock adds automated context editing and a memory tool to extend usable context and boost accuracy for long-running agents across global regions.
read more →

Amazon Bedrock Launches in Middle East (UAE) Region

🚀 Amazon Bedrock is now available in the Middle East (UAE) Region, enabling customers to build, experiment with, and scale generative AI applications using a broad selection of foundation models (FMs) and integrated developer tools. The managed service provides capabilities to deploy and operate agents and production workloads with built-in controls for security and operational management. Customers in the region can begin using Bedrock today and should consult the documentation for supported models, APIs, and recommended practices.
read more →

Amazon Bedrock Now Available in Israel (Tel Aviv) Region

🚀 Beginning today, Amazon Bedrock is available in the Israel (Tel Aviv) region, enabling customers to build and scale generative AI applications with local infrastructure. The managed service connects organizations to a variety of foundation models (FMs) and provides tools to deploy and operate agents, reducing time-to-production. Local availability can lower latency, support regional compliance needs, and help move projects from experimentation to real-world deployment.
read more →

Amazon Bedrock Available in Thailand, Malaysia, and Taipei

🚀 Amazon has launched Amazon Bedrock in the Asia Pacific (Thailand), Asia Pacific (Malaysia), and Asia Pacific (Taipei) regions, enabling local customers to build and scale generative AI applications using a range of foundation models and developer tools. The managed service supports deploying agents and productionizing models to shorten the path from experimentation to real-world deployment. Customers can expect improved latency, regional data residency options, and integration with AWS operational and security services.
read more →

Source-of-Truth Authorization for RAG Knowledge Bases

🔒 This post presents an architecture to enforce strong, source-of-truth authorization for Retrieval-Augmented Generation (RAG) knowledge bases using Amazon S3 Access Grants with Amazon Bedrock. It explains why vector DB metadata filtering is insufficient—permission changes can be delayed and complex identity memberships are hard to represent—and recommends validating permissions at the data source before returning chunks to an LLM. The blog includes a practical Python walkthrough for exchanging identity tokens, retrieving caller grant scopes, filtering returned chunks, and logging withheld items to reduce the risk of sensitive data leaking into LLM prompts.
read more →

Amazon Q Developer CLI Adds Remote MCP Server Support

🔒 Amazon Q Developer CLI now supports remote MCP servers to centralize tool integrations and OAuth-based authentication, enhancing scalability and security in development workflows. Administrators specify HTTP transport, the authentication URL, and optional headers in agent configuration or mcp.json. Upon successful OAuth authentication, the CLI enumerates tools on the MCP server and exposes them to the agent. This capability is available in both the CLI and the Amazon Q Developer IDE plugins.
read more →

Stability AI Image Services Now Available in Amazon Bedrock

🖼️ Amazon Bedrock now includes Stability AI Image Services, a suite of nine specialized image-editing tools available via the Bedrock API. The offering splits into Edit tools (Remove Background, Erase Object, Search and Replace, Search and Recolor, Inpaint) and Control tools (Structure, Sketch, Style Guide, Style Transfer). It is currently supported in US West (Oregon), US East (N. Virginia), and US East (Ohio), and is intended to accelerate professional creative workflows with granular edit control.
read more →

OpenAI Open-Weight Models Now in Eight More AWS Regions

🚀 AWS has expanded availability of OpenAI open weight models on Amazon Bedrock to eight additional regions. The update adds US East (N. Virginia), Asia Pacific (Tokyo), Europe (Stockholm), Asia Pacific (Mumbai), Europe (Ireland), South America (São Paulo), Europe (London), and Europe (Milan) to the previously supported US West (Oregon). This broader regional coverage reduces network latency, helps meet data residency preferences, and makes it easier for customers to deploy AI-powered applications closer to their users. Customers can access the models through the Amazon Bedrock console and supporting documentation to get started.
read more →

DeepSeek-V3.1 Available as Fully Managed in Bedrock

🔍 DeepSeek-V3.1 is now available as a fully managed foundation model in Amazon Bedrock, offering an open-weight option designed for enterprise deployment. The model supports a selectable 'thinking' mode for step-by-step analysis and a faster non-thinking mode for quicker replies, with improved multilingual accuracy and reduced hallucinations. Enhanced tool-calling, transparent reasoning, and strong coding and analytical performance make it well suited for building AI agents, automating workflows, and tackling complex technical tasks. DeepSeek-V3.1 is available in US West (Oregon), Asia Pacific (Tokyo, Mumbai), and Europe (London, Stockholm).
read more →

Amazon Bedrock Adds Four Qwen3 Open-Weight Models Now

🤖 Amazon Web Services added four Qwen3 open-weight foundation models to Amazon Bedrock as fully managed, serverless offerings. The lineup—Qwen3-Coder-480B-A35B-Instruct, Qwen3-Coder-30B-A3B-Instruct, Qwen3-235B-A22B-Instruct-2507, and Qwen3-32B—covers both dense and Mixture-of-Experts (MoE) architectures. The coder variants specialize in agentic coding, function calling, and tool use, while the 235B and 32B models provide general reasoning and efficient dense computation. These models are available now across multiple AWS regions, enabling developers to build advanced AI applications without managing infrastructure.
read more →

AWS Bedrock Adds OpenAI Open‑Weight Models in Eight Regions

🚀 AWS has expanded availability of OpenAI open weight models on AWS Bedrock to eight additional AWS Regions worldwide. The update brings the models to US East (N. Virginia), Asia Pacific (Tokyo, Mumbai), Europe (Stockholm, Ireland, London, Milan) and South America (São Paulo), alongside existing US West (Oregon) support. This broader footprint aims to lower latency, improve model performance and help customers meet data residency requirements. To get started, use the Amazon Bedrock console or consult the documentation.
read more →

Amazon OpenSearch Serverless Adds Disk-Optimized Vectors

🔍 Amazon has added disk-optimized vector storage to OpenSearch Serverless, offering a lower-cost alternative to memory-optimized vectors while maintaining equivalent accuracy and recall. The disk-optimized option may introduce slightly higher latency, so it is best suited for semantic search, recommendation systems, and other AI search scenarios that do not require sub-millisecond responses. As a fully managed service, OpenSearch Serverless continues to automatically scale compute capacity (measured in OCUs) to match workload demands.
read more →

On-demand deployment for custom Meta Llama models on Bedrock

🚀 Amazon Bedrock now offers an on-demand deployment option for customized Meta Llama 3.3 models that have been fine-tuned or distilled in Bedrock; models customized on or after September 15, 2025 are eligible. The feature lets customers process requests in real time and pay only for consumed compute, removing the need for pre-provisioned always-on resources. Bedrock continues to provide a managed platform with built-in security, privacy, and responsible AI capabilities.
read more →

Amazon Bedrock AgentCore Gateway gains PrivateLink, logs

🔒 AWS announced that Amazon Bedrock AgentCore Gateway now supports AWS PrivateLink for private VPC access and adds invocation logging to Amazon CloudWatch, Amazon S3, and Amazon Data Firehose. These updates allow agent traffic to avoid the public internet while sending per-invocation logs to common observability and storage services. The combination improves network isolation, governance, and operational visibility. AgentCore Gateway is currently in preview in US East (N. Virginia), US West (Oregon), Asia Pacific (Sydney), and Europe (Frankfurt).
read more →

TwelveLabs Marengo 2.7 Embeddings Now Synchronous in Bedrock

Amazon Bedrock now supports synchronous inference for TwelveLabs Marengo Embed 2.7, delivering low-latency text and image embeddings directly in API responses. Previously optimized for asynchronous processing of large video, audio, and image files, Marengo 2.7’s new mode enables responsive search and retrieval features—such as instant natural-language video search and image similarity discovery—while retaining advanced video understanding via asynchronous workflows.
read more →