< ciso
brief />
Tag Banner

All news with #amazon sagemaker ai tag

78 articles

Nemotron-3-Super-120B and Qwen3.5 Models Added to SageMaker

πŸš€ Amazon SageMaker JumpStart now includes NVIDIA’s Nemotron-3-Super-120B and the Qwen3.5 family (9B and 27B), giving customers turnkey access to foundation models optimized for agentic reasoning, multilingual coding, and advanced instruction following. Nemotron-3-Super-120B employs a hybrid LatentMixture-of-Experts architecture with Mamba-2 and MoE layers to support collaborative agents and high-volume automation such as IT ticket triage and cybersecurity workflows. The Qwen3.5-9B prioritizes efficiency for resource-constrained environments, while Qwen3.5-27B offers deeper contextual and multimodal reasoning for large-scale document processing and complex scenarios. Users can deploy these models directly from the JumpStart catalog or programmatically via the SageMaker Python SDK.
read more β†’

SageMaker HyperPod Adds Gang Scheduling for EKS Clusters

βœ… Amazon SageMaker HyperPod task governance now supports gang scheduling for HyperPod clusters using the EKS orchestrator. Administrators can configure readiness timeouts, node-failure behavior, single-workload admission and retry policies so distributed training jobs are only admitted when all required pods are ready. Pulled workloads are automatically requeued to avoid stalls and wasted compute. This reduces deadlocks, resource contention, and unexpected cost overruns for multi-pod training jobs.
read more β†’

Amazon SageMaker Serverless Workflows for Identity Center

βš™οΈ Amazon SageMaker Unified Studio now supports Serverless Workflows in Identity Center domains, allowing customers to orchestrate data-processing tasks with Apache Airflow (via Managed Workflows for Apache Airflow) without provisioning Airflow infrastructure. Serverless Workflows auto-provision compute during runs and release it afterward, so you pay only for actual run time. Each workflow runs with its own execution role and isolated worker to ensure workflow-level security and prevent cross-workflow interference. The Visual Workflow experience supports around 200 operators and built-in integrations with services such as Amazon S3, Amazon Redshift, Amazon EMR, AWS Glue, and Amazon SageMaker AI.
read more β†’

SageMaker Unified Studio adds notebook import/export

πŸ“ Amazon has added import/export capabilities to SageMaker Unified Studio notebooks to simplify migration from JupyterLab and other platforms. The feature supports .ipynb, .json, and .py formats while preserving cell types, outputs, execution history, and metadata. Exports are available in four package types (.zip with requirements, .ipynb, .py, and native .json). The release also introduces developer productivity features including cell reordering, keyboard shortcuts, cell renaming, and multi-line SQL with tabbed results.
read more β†’

Amazon SageMaker Data Agent Adds Charts, SQL, and MVs

πŸ“Š Amazon SageMaker Data Agent now embeds interactive charting, SQL analytics across Snowflake sources, and materialized view management directly inside SageMaker Unified Studio notebooks. You can ask natural-language prompts like "plot monthly revenue trends by region for 2025" to generate interactive charts that support hover, editing, and refinement without writing code. When analyses span AWS and Snowflake, the agent lets you join Snowflake tables via external connections with AWS Glue Data Catalog data in a single prompt. The agent can also recommend and create materialized views, including refresh schedules, to optimize query performance.
read more β†’

SageMaker Data Agent adds Japan and Australia CRI support

πŸ”’ SageMaker Data Agent now supports cross-region inference profiles for Japan (JP-CRIS) and Australia (AU-CRIS) via Amazon Bedrock. Inference requests originating in Asia Pacific (Tokyo) and Asia Pacific (Sydney) are processed entirely within their respective geographies, helping customers meet data residency and sovereignty requirements. Data Agent continues to provide conversational data exploration, Python and SQL code generation, troubleshooting, and analytics inside SageMaker Unified Studio Notebooks and the Query Editor, with traffic routed exclusively over the AWS Global Network.
read more β†’

SageMaker Unified Studio: CloudWatch Metrics for Glue Jobs

πŸ” Amazon SageMaker Unified Studio now surfaces Amazon CloudWatch metrics for AWS Glue jobs directly alongside job logs in a single, unified interface. Data engineers can correlate DPU utilization, memory consumption, CPU load, and data movement size with log output to diagnose compute bottlenecks and memory pressure faster. The consolidated view reduces mean time to resolution for ETL pipelines and is available in all Regions where SageMaker Unified Studio is generally available. To view metrics, open a Glue job run and select the Metrics tab.
read more β†’

Amazon SageMaker Data Agent in Query Editor for SQL

πŸ”Data Agent in the Amazon SageMaker Unified Studio Query Editor brings natural-language-to-SQL capabilities to your SQL analytics workflow. You can ask questions in plain language and have the agent generate context-aware SQL for Amazon Redshift and Amazon Athena, propose step-by-step plans, and use Fix with AI to diagnose and correct failed queries. It preserves query context across follow-ups and is available in IAM domains where SageMaker Unified Studio is supported.
read more β†’

SageMaker Studio Now Supports Remote Kiro and Cursor IDEs

πŸ”— AWS now enables remote connections from Kiro and Cursor IDEs to Amazon SageMaker Studio. Data scientists, ML engineers, and developers can use their local Kiro/Cursor setups β€” including spec-driven development, conversational coding, and automated feature generation β€” while running workloads on SageMaker's scalable cloud compute. Authentication is supported via the AWS Toolkit extension or SageMaker Studio's web UI, preserving Studio security boundaries and reducing context switching.
read more β†’

Cursor IDE connects remotely to SageMaker Unified Studio

πŸ”— AWS announced remote connection from Cursor IDE to SageMaker Unified Studio via the AWS Toolkit extension. This integration lets data scientists, ML engineers, and developers use their local Cursor setup β€” including AI-powered code completion, natural language editing, and multi-file editing β€” while leveraging SageMaker's scalable compute and data. Authentication is handled securely through AWS IAM via the Toolkit, preserving custom Cursor settings and enterprise-grade security.
read more β†’

AWS Batch Adds Quota Management and Preemption for SageMaker

βš™οΈ AWS Batch now supports quota management with job preemption for SageMaker Training, enabling prioritized GPU allocation and automatic preemption of lower-priority workloads. You can create up to 20 quota shares per job queue and choose resource-sharing strategies, with both cross-share and in-share preemption modes to restore or reallocate borrowed capacity. Capacity utilization is visible at queue, quota share, and job levels, and you can update job priorities after submission and set preemption retry limits. The feature integrates with the SageMaker Python SDK via the aws_batch module and is available in all AWS Regions where AWS Batch is offered; AWS provides an example notebook and user-guide documentation for implementation guidance.
read more β†’

Amazon SageMaker AI Adds Serverless Customization for Models

πŸš€ Amazon SageMaker AI now offers serverless model customization and reinforcement fine-tuning for 12 additional open‑weight models, enabling SFT, DPO, and advanced RFT techniques such as RLVR and RLAIF without infrastructure management. You can fine‑tune and evaluate these models on a pay‑per‑use basis across multiple regions. This simplifies alignment for complex, domain‑specific tasks and improves accuracy on verifiable tasks like code generation and structured extraction. No cluster setup, capacity planning, or distributed training expertise is required.
read more β†’

SageMaker HyperPod Adds Continuous Provisioning for Slurm

πŸš€ Amazon SageMaker HyperPod now supports continuous provisioning for clusters using the Slurm orchestrator, allowing training jobs to start immediately on available instances while remaining capacity is provisioned in the background. Priority-based provisioning brings up the Slurm controller first, then login and worker nodes in parallel, with asynchronous retries for failed launches. The feature reduces time-to-training, improves utilization, and removes the need for manual scaling interventions.
read more β†’

Amazon SageMaker Unified Studio Adds Custom Filters

πŸ”Ž Amazon SageMaker Unified Studio now supports custom metadata search filters, enabling teams to narrow catalog results using organization-specific attributes like business region, data classification, or study name. Filters accept string fields with a contains operator and numeric fields (Integer, Long) with equals, greater than, and less than operators. Users can also filter by asset name, description, and date range, combine multiple filters, and retain selections across browser sessions; the feature is available in all AWS Regions where Unified Studio is supported.
read more β†’

Amazon SageMaker Unified Studio: Aggregated Lineage

πŸ” Amazon SageMaker Unified Studio now offers an aggregated lineage view that shows all jobs contributing to a dataset across multiple levels of the lineage graph. The aggregated view is the default for IdC-based domains, while the previous event-timestamp snapshot can be restored via a "display in event timestamp order" toggle. A new QueryGraph API returns lineage node graphs with metadata and augmented business context; the capability is available in all SageMaker Unified Studio regions, with documentation and API references provided.
read more β†’

Amazon SageMaker Unified Studio Aggregated Lineage View

πŸ” Amazon SageMaker Unified Studio now provides an aggregated view of data lineage that consolidates all jobs contributing to a dataset. The aggregated view displays multi-level transformations and dependencies to help you identify upstream sources and downstream consumers across the full lineage graph. It is the default for IdC-based domains, with an option to revert to the previous event-timestamp-ordered view, and the new QueryGraph API exposes node graphs with metadata and augmented business context.
read more β†’

Amazon SageMaker Training Plan Extension Now Available

πŸ” Amazon SageMaker Training Plans can now be extended to cover AI training runs that take longer than originally scheduled. Extensions are available in 1-day increments up to 14 days, or 7-day increments up to 182 days, and can be purchased through the SageMaker console or via API. Once an extension is purchased the reserved GPU capacity (clusters up to 64 instances) continues to run without interruption, and SageMaker automatically provisions infrastructure so workloads keep running without reconfiguration. The feature helps teams maintain cost-efficient training schedules and reduce operational disruptions.
read more β†’

Amazon SageMaker Unified Studio Adds Light Mode Option

πŸ”† AWS has added light mode support to Amazon SageMaker Unified Studio for IAM-based domains, allowing users to choose between dark and light visual themes. The addition improves readability in bright environments and offers a familiar look for customers who prefer lighter interfaces. In Studio, select Profile > customize appearance to switch modes. The setting persists across browsers and devices and complements the existing dark mode to give users full control over their development environment's appearance.
read more β†’

Amazon SageMaker HyperPod Adds RIG Observability for Training

πŸ” Amazon SageMaker HyperPod now provides integrated observability for Restricted Instance Groups (RIG), giving teams training foundation models with Nova Forge a unified view of compute resources and training workloads. A pre-configured Amazon Managed Grafana dashboard, backed by Amazon Managed Service for Prometheus, aggregates metrics from four exporters to show GPU utilization, NVLink bandwidth, CPU pressure, FSx for Lustre usage, network fabric, Kubernetes state, and curated logs including epoch progress, step-level logs, pipeline errors, and Python tracebacks. Observability is automatically enabled for new RIG clusters and can be turned on for existing clusters via the HyperPod console; it is available in all Regions where SageMaker HyperPod RIG is supported.
read more β†’

SageMaker Unified Studio Syncs Catalog Metadata to Partners

πŸ” Amazon SageMaker Unified Studio now synchronizes catalog metadata and context with Atlan, Collibra, and Alation, aligning projects, assets, descriptions, glossary terms, and hierarchies across platforms. Collibra supports bidirectional synchronization and can manage SageMaker Unified Studio data access requests, while Atlan and Alation ingest metadata from SageMaker with additional enhancements planned. The Collibra integration is provided as an open-source solution on GitHub, and setup is performed by establishing connections from each partner to SageMaker Unified Studio.
read more β†’