All news with #apache parquet tag
Wed, November 12, 2025
Amazon CloudWatch Adds Network Load Balancer Access Logs
🔍 Amazon CloudWatch Logs now ingests Network Load Balancer (NLB) access logs as vended logs, enabling direct analysis within CloudWatch. You can run CloudWatch Logs Insights queries, create metric filters, and use Live Tail for real‑time traffic review to accelerate troubleshooting. NLB access logs are configurable from the NLB integrations tab, AWS CLI, or SDKs, and can also be delivered to Amazon Data Firehose or S3 with optional Apache Parquet conversion. Delivery to CloudWatch and Firehose is billed as vended logs; S3 delivery is free while Parquet conversion carries a per‑GB charge.
Thu, September 25, 2025
R2 SQL Deep Dive: Serverless Queries over R2 Data Platform
⚡ R2 SQL is Cloudflare’s serverless query engine that runs SQL directly against Iceberg tables stored in R2, eliminating the need for Spark or Trino clusters. The Query Planner uses R2 Data Catalog metadata and multi-level stats to prune manifests, files, and Parquet row groups so only necessary bytes are read. Execution is distributed across Cloudflare’s network using Workers and query workers running Apache DataFusion, with results serialized via Apache Arrow. An ordered, streaming planning pipeline enables early termination for ORDER BY ... LIMIT queries; R2 SQL is currently available in open beta.