Widespread internet outage reported as Amazon Web Services works on issue

AWS Outage: Timeline and Impact

A major outage at Amazon Web Services (AWS), the world’s largest cloud provider, disrupted internet services worldwide starting early on October 20, 2025. The incident, centered in the US-EAST-1 region (Northern Virginia), affected dozens of AWS services and cascaded to thousands of customer websites and apps, including high-profile platforms like Venmo, Snapchat, Reddit, Ring, Alexa, Fortnite, and even airline systems. By late October 20, AWS reported significant recovery progress, but some services like Reddit and dating app Hinge remained partially impacted into October 21, with full restoration ongoing.

Key Timeline

  • Onset (October 20, ~12:11 AM PDT / 3:11 AM EDT): Increased error rates and latencies reported across multiple AWS services in US-EAST-1, beginning with DNS resolution failures for DynamoDB API endpoints. This quickly spread to dependent services, causing elevated errors for EC2 instance launches (e.g., “Insufficient Capacity” issues), Lambda function polling delays, and backlogs in event processing for CloudTrail and EventBridge.
  • Peak Impact (Early Morning EDT): By 1:26 AM PDT, the outage hit 78 services, including core infrastructure like Amazon EC2, RDS, ECS, SQS, and global features such as IAM updates and DynamoDB Global Tables. Support case creation was also disrupted. Thousands of sites went down, with Downdetector reporting spikes in complaints for affected apps.
  • Mitigation Efforts (2:00–7:00 AM PDT): AWS identified the DNS issue as the root cause and applied fixes, including network connectivity restorations across availability zones (AZs). By 3:35 AM PDT, most operations were succeeding with some throttling; 33 services were fully resolved by mid-morning, including DynamoDB, SQS, S3, and IAM. Recommendations included retrying requests, using multi-AZ configurations for EC2, and flushing DNS caches.
  • Latest Status (as of October 20, 7:29 AM PDT / ~10:29 AM EDT): Early recovery signs observed, but elevated EC2 launch errors persisted in some AZs, with rate limiting in place to aid stabilization. Global services tied to US-EAST-1 recovered, and backlogs were processing normally. AWS classified the severity as “Degraded” and committed to updates every 30–45 minutes. No new incidents reported by October 21 morning.

Affected Services and Ripple Effects

The outage’s breadth stemmed from US-EAST-1’s role as a backbone for global AWS operations—many services default to or rely on it. Key impacts:

  • AWS Services: 78 total, with hits to compute (EC2, Lambda), storage (DynamoDB, S3), databases (RDS, DocumentDB), messaging (SQS, SNS), and more. Gaming (GameLift), AI/ML (SageMaker, Bedrock), and security (GuardDuty, Secrets Manager) were among the hardest hit.
  • Consumer Apps and Sites:
  • Payments/Finance: Venmo remained down for transfers and notifications into October 21.
  • Social/Media: Snapchat, Reddit, and Wordle experienced login and loading failures; Fortnite servers lagged.
  • Smart Home/Devices: Ring doorbells and Alexa skills failed to respond.
  • Other: Airlines like Delta and United reported check-in disruptions; streaming and e-commerce sites saw intermittent outages.
  • Global Reach: While US-EAST-1 was primary, latency spiked worldwide due to interconnected services.

Cause and AWS Response

Preliminary analysis points to a DNS resolution failure in DynamoDB endpoints, triggering cascading failures in API calls and resource provisioning. AWS engineers engaged immediately, applying mitigations like AZ-specific fixes and request retries. The company emphasized no evidence of malicious activity and urged customers to monitor the AWS Health Dashboard for personalized alerts.

Reactions and Broader Implications

  • Users and Businesses: Frustration peaked on social media and forums, with complaints about lost productivity—e.g., developers unable to deploy code, e-commerce sites missing sales. Some users joked about “going back to carrier pigeons,” while enterprises highlighted the risks of over-reliance on single-region cloud setups.
  • Experts: Analysts noted this as a reminder of AWS’s ~33% market dominance, underscoring the need for multi-cloud strategies. One expert called it “a perfect storm of dependency,” potentially costing millions in downtime.
  • AWS Statement: In updates, the company apologized for the disruption and confirmed “full mitigation” efforts, with systems “back online” for most users by midday October 20. They promised a post-incident review.

As of October 21, 2025, the outage appears largely resolved, but AWS continues monitoring for residuals. Customers are advised to check the official status page for region-specific details. This event echoes past AWS disruptions (e.g., 2021 US-EAST-1 outage), fueling discussions on cloud resilience.

WhatsApp and Telegram Button Code
WhatsApp Group Join Now
Telegram Group Join Now
Instagram Group Join Now

Leave a Reply