
A significant Amazon Web Services (AWS) outage on Monday morning caused a ripple effect across the internet, leaving millions of users unable to access a wide array of popular applications, websites, educational apps, major British banks, airlines, and gaming services, with thousands of down reports in the U.S. and the U.K.
The outage also impacted Amazon's own platforms, creating a broad Amazon cloud disruption. Impacted services reportedly include:
Popular cloud-based team communication platform Slack, which has been owned by Salesforce since 2020, is also an AWS customer, and some users reported to TechNadu they were unable to open huddles.
The disruption stemmed from an operational issue within Amazon's cloud computing division, which provides essential backend infrastructure for a large portion of the internet.
The issues began around 3 am ET, with more than 6,000 of outage reports quickly logged on monitoring sites like DownDetector from affected U.S. customers and another 1,600 users and counting are from U.K. users, according to Daily Mail.
Amazon acknowledged the problem on its AWS Health Dashboard, confirming an "operational issue" was impacting multiple services. The potential source of the failure is the US-EAST-1 region in North Virginia, one of the most critical data center hubs in the world.
“We have identified a potential root cause for error rates for the DynamoDB APIs in the US-EAST-1 Region. Based on our investigation, the issue appears to be related to DNS resolution of the DynamoDB API endpoint in US-EAST-1,” says the announcement.
“Global services or features that rely on US-EAST-1 endpoints such as IAM updates and DynamoDB Global tables may also be experiencing issues.”
This single point of failure led to a cascading effect, demonstrating the high level of dependency that global services have on a concentrated set of cloud infrastructure.
While Amazon's engineers worked to mitigate the issue, cybersecurity experts began to analyze the situation. Although a coordinated cyberattack could not be entirely ruled out pending a full post-incident report, the initial assessment pointed toward an internal technical error.
Experts noted that such outages often result from configuration mistakes that cascade through the system. The incident serves as a stark reminder of the internet's fragility and the risks associated with a heavy reliance on a few major cloud providers.
With AWS controlling a substantial portion of the global cloud market, any disruption in its services can have far-reaching consequences for businesses and users worldwide.
In June, a major Google Cloud and Cloudflare outage impacted Google, YouTube, AWS, and other leading tech services.