Tag Banner

All news with #efa tag

Fri, September 12, 2025

AWS Adds Five EFA Metrics to Improve Network Observability

🔍 AWS has introduced five new Elastic Fabric Adapter (EFA) metrics to improve network observability for AI/ML and HPC workloads. The counters track retransmitted packets and bytes, retransmit timeouts, impaired remote connections, and unresponsive remote receivers at the per-EFA device level. Available on Nitro v4+ instances with EFA installer 1.43.0+, metrics are exposed via sysfs and can be exported to Prometheus and tools like Grafana for monitoring and alerting.

read more →