In a startling incident that has raised fresh concerns about the reliability of AI infrastructure, Amazon Web Services (AWS) — the world’s largest cloud provider — recently experienced a widespread service outage that lasted approximately 13 hours. According to industry reports, the prolonged outage was triggered by an internal AWS system that utilized AI-powered automation to manage operations — a system that ironically became part of the problem rather than the solution. The incident affected multiple AWS services, causing disruptions for businesses, developers, and end users that rely on cloud computing for everything from hosting websites to running enterprise software. Given AWS’s dominant position in global cloud infrastructure, even a single outage of this magnitude reverberated across digital ecosystems. What Went Wrong At the heart of the disruption was an AI-powered automation tool that AWS used to optimise internal processes. Instead of improving operational efficiency, a misconfiguration or error in the AI system caused it to inadvertently trigger a cascade of failures in AWS’s infrastructure. Rather than immediately switching to manual control, the automated system continued to make adjustments that compounded the disruption. Engineers eventually had to intervene directly to halt the faulty automation and restore services — a process…  ​Read MoreBusiness Archives – Trak.in – Indian Business of Tech, Mobile & Startups