< ciso
brief />
Tag Banner

All news with #aws tag

1977 articles · page 57 of 99

SageMaker AI Adds Flexible Training Plans for Inference

⚙️ Amazon SageMaker AI's Flexible Training Plans (FTP) now support inference endpoints, allowing customers to reserve guaranteed GPU capacity for planned evaluations and production peaks. You choose instance types, compute requirements, reservation length, and start date, then reference the reservation ARN when creating an endpoint. SageMaker AI automatically provisions and runs the endpoint on the reserved capacity for the plan duration, removing much of the infrastructure scheduling overhead. FTP for inference is initially available in US East (N. Virginia), US West (Oregon), and US East (Ohio).
read more →

Amazon Bedrock Reserved Tier for Predictable Performance

🔒 Amazon Bedrock now offers a Reserved service tier that provides prioritized compute and guaranteed input/output tokens-per-minute capacity for inference workloads. Customers can reserve asymmetric input and output capacities to match workload patterns, and excess traffic overflows automatically to the pay-as-you-go Standard tier to keep operations running. The tier targets 99.5% model response uptime and is available today for Anthropic Claude Sonnet 4.5, with 1- or 3-month reservations billed monthly at a fixed price per 1K tokens-per-minute.
read more →

AWS Knowledge MCP Server Adds Topic-Based Search for Domains

🔎 The AWS Knowledge MCP Server now supports topic-based search across specialized documentation domains, enabling more precise queries against areas such as Troubleshooting, AWS Amplify, AWS CDK, CDK Constructs, and AWS CloudFormation. This enhancement lets MCP clients and agentic frameworks target domain-specific resources to reduce noise and improve relevance. The capability complements existing API reference and general documentation search features and is available immediately at no additional cost, subject to standard rate limits.
read more →

AWS Network Firewall Proxy Now Available in Preview

🔒 AWS has launched Network Firewall Proxy in public preview, providing centralized controls to block data exfiltration and malware injection across application traffic. In explicit proxy mode you can set up filters in just a few clicks to control outbound requests and the responses your applications receive, protect against domain or SNI spoofing, and restrict access to trusted domains or IPs. The service supports TLS inspection and granular HTTP header filtering, and emits detailed logs to Amazon S3 and AWS CloudWatch. Preview access is free in US East (Ohio).
read more →

AWS Issues Behavioral Guidelines for Network Scanning

🔍 AWS published behavioral guidelines for network scanning to help legitimate scanners distinguish themselves from malicious actors when probing AWS IP space. The guidance defines four pillars—observational, identifiable, cooperative, and confidential—and gives practical examples (non‑mutating checks, reverse DNS, meaningful user‑agents, opt‑out mechanisms). Conforming scanners should limit impact, secure collected data, and respect opt‑out requests to reduce abuse reports and improve internet security.
read more →

AWS Glue: Zero-ETL Replication for Self-Managed Databases

🔁AWS Glue now supports zero-ETL for self-managed database sources, enabling no-code replication from Oracle, SQL Server, MySQL, and PostgreSQL hosted on-premises or on EC2 to Amazon Redshift. The feature auto-creates ongoing integrations to simplify setup, reduce operational overhead, and eliminate much of the engineering work previously required to build ingestion pipelines. It is available in multiple AWS Regions and aims to save teams weeks of engineering effort.
read more →

OpenSearch Service Introduces Agentic Search for NLP Queries

🔎 Amazon Web Services has introduced Agentic Search for OpenSearch Service, an agent-driven layer that interprets natural-language intent, orchestrates search tools, and generates OpenSearch DSL queries while providing transparent summaries of its decision process. The built-in QueryPlanningTool uses LLMs to plan and emit DSL, removing the need for manual query syntax. Two agent types are available: conversational agents with memory and flow agents optimized for throughput. Administrators can configure agents via APIs or OpenSearch Dashboards, and Agentic Search is supported on OpenSearch Service version 3.3+ across AWS Commercial and GovCloud regions.
read more →

AWS Glue Data Quality Adds Preprocessing Queries Support

🛠️ AWS announces general availability of AWS Glue Data Quality preprocessing queries, enabling transformations before running data quality checks through the Glue Data Catalog APIs. The feature lets you create derived columns, filter datasets, perform calculations, and validate column relationships as part of the quality evaluation. This capability removes separate preprocessing steps, streamlines workflows, and tailors recommendations and rules to specific data subsets across commercial AWS Regions.
read more →

AWS Glue Data Quality Adds Rule Labeling for Reporting

🔖 AWS has made AWS Glue Data Quality rule labeling generally available, allowing teams to attach custom key-value labels to data quality rules for better organization and targeted reporting. Labels can represent business context, team ownership, compliance tags, or priority and can be authored in DQDL. Queryable in rule outcomes, row-level results, and APIs, labels enable focused reports and streamlined remediation workflows across all commercial AWS Regions where the service is available.
read more →

Amazon Quick Suite: Scheduling for Quick Flows Automation

🕒 Amazon Quick Flows now supports scheduled execution, allowing teams to automate repetitive workflows at specified times or custom intervals. You can configure flows to run daily, weekly, monthly, or on custom schedules and schedule any flow you can access—whether you created it or it was shared with you. Scheduling is set via the Quick Flows scheduling icon and is available now in IAD, PDX, and DUB. There are no additional charges beyond standard Quick Flows usage, and common use cases include recurring report generation, summarizing open items in external services, and producing daily meeting briefings.
read more →

AWS Service Quotas: Automatic Quota Management Launch

🚀 AWS announced general availability of automatic quota management in Service Quotas. The feature sends configurable notifications (email, SMS, Slack) via the Service Quotas console or API when usage approaches allocated limits and can automatically and safely adjust service quota values in response to observed consumption. This reduces operational overhead from tracking and requesting quota increases across accounts and Regions and helps prevent unexpected interruptions. The capability is available at no additional cost in all AWS commercial regions.
read more →

SageMaker AI Inference Adds Bidirectional Streaming

🎙️ Amazon SageMaker AI Inference now supports bidirectional streaming, enabling real-time speech-to-text transcription that returns partial transcripts while audio is still being captured. Using the new Bidirectional Stream API, clients open an HTTP/2 connection to the SageMaker AI runtime, which automatically creates a WebSocket to your model container so audio frames and interim transcripts flow continuously. Any container that implements a WebSocket handler per the SageMaker AI contract works out of the box, allowing real-time models such as Deepgram to run without modification. The feature eliminates weeks or months of custom streaming infrastructure work so teams can focus on model accuracy, latency tuning, and agent behavior.
read more →

Amazon SageMaker Adds EAGLE for Faster Inference Throughput

⚡ Amazon SageMaker AI now supports EAGLE (Extrapolation Algorithm for Greater Language-model Efficiency) speculative decoding to boost large language model inference throughput by up to 2.5x. The capability enables models to predict and validate multiple tokens in parallel rather than one at a time, preserving output quality while reducing latency. SageMaker automatically selects between EAGLE 2 and EAGLE 3 depending on model architecture and provides built‑in optimization jobs using curated or customer datasets. Optimized models can be deployed through existing SageMaker inference workflows without infrastructure changes, and the feature is available in select AWS Regions.
read more →

AWS IoT Core: Retrieve Thing Registry Data via Rules

🔧 AWS IoT Core now supports dynamic retrieval of thing registry data directly within IoT rules using the new get_registry_data() inline rule function. You can access device attributes, device type, and group membership to filter, enrich, and route messages — for example, routing lifecycle events or enriching gateway messages with sensor thresholds. The feature is available in all AWS regions where AWS IoT Core is present and can be used immediately via the IoT developer guide and API.
read more →

Amazon CloudFront Adds mutual TLS Authentication Now

🔒 Amazon CloudFront now supports mutual TLS (mTLS), enabling both server and client authentication with X.509 certificates at AWS edge locations. Customers can require trusted client certificates to access distributions, reducing unauthorized access to APIs and applications. Typical uses include secure B2B API integrations and IoT device authentication. mTLS is available at no additional cost and configurable via Console, CLI, SDK, CDK, and CloudFormation.
read more →

Amazon EC2 Introduces Interruptible Capacity Reservations

🔁 Amazon EC2 now offers interruptible On‑Demand Capacity Reservations (ODCRs), allowing reservation owners to temporarily make unused reserved capacity available to other internal workloads while retaining the ability to reclaim it. Consumers using interruptible ODCRs receive an interruption notice to allow graceful shutdown or checkpointing. The capability targets flexible, fault‑tolerant tasks such as batch processing, data analysis, and machine learning training, is available at no additional cost, and CloudFormation support is coming soon.
read more →

AWS OpenSearch Service adds PPL and natural language

🔍 Amazon OpenSearch Service now makes Piped Processing Language (PPL) and natural language the default experience in the OpenSearch UI Observability workspace. The release introduces 35+ new commands for deep analysis, faceted exploration, and natural-language querying, and integrates OpenTelemetry ingestion pipelines to simplify onboarding. Users can run enterprise-grade queries, correlate events, and move directly from query to visualization to reduce mean time to detect and resolve issues.
read more →

Fluent Bit Vulnerabilities Threaten Cloud and Kubernetes

⚠️ Researchers disclosed five vulnerabilities in Fluent Bit, the open-source telemetry agent, that can be chained to bypass authentication, write or overwrite files, execute code, corrupt logs, and cause denial-of-service conditions. CERT/CC noted many issues require network access, and fixes were released in Fluent Bit 4.1.1 and 4.0.12 with AWS participating in coordinated disclosure. Operators are urged to update immediately and apply mitigations such as avoiding dynamic tags, mounting configs read-only, and running the agent as a non-root user.
read more →

AWS MediaTailor Adds HLS Interstitials for Live Streams

🎥 AWS Elemental MediaTailor now supports HLS Interstitials for live streams, enabling broadcasters and streaming services to insert interstitial ads using the HLS Interstitials specification. MediaTailor generates necessary metadata tags (EXT-X-DATERANGE with X-ASSET-LIST) for compatible players such as HLS.js, Shaka, Bitmovin and Apple devices, removing the need for custom client stitching. The capability integrates with MediaTailor's server-side ad insertion for frame-accurate, buffer-free transitions and preserves server-side beaconing for measurement. It supports pre-roll and mid-roll, can be toggled per playback session via a manifest query parameter, and is available in all Regions where MediaTailor operates on a pay-as-you-go basis.
read more →

Amazon Connect flow modules: custom IO and versioning

🧩 Amazon Connect flow modules now support custom inputs, outputs and branching logic, plus advanced versioning and alias management. You can define flexible parameters for reusable modules—for example, an authentication module that accepts a phone number and PIN and returns customer name and authentication status with branches like authenticated or not authenticated. Immutable version snapshots and alias mapping let teams update module implementations while flows referencing an alias automatically use the new version, simplifying maintenance and reuse.
read more →