New set of flexible data center components will support the next generation of generative AI innovation and provide 12% more compute power, while improving availability and efficiency
At AWS re:Invent, Amazon Web Services, Inc. (AWS), an Amazon.com, Inc. company (NASDAQ: AMZN), today announced new data center components designed to support the next generation of artificial intelligence (AI) innovation and customers’ evolving needs. These capabilities combine innovations in power, cooling, and hardware design to create a more energy efficient data center that will underpin further customer innovation. These new capabilities will be implemented globally in AWS’s new data centers, and many components are already deployed in its existing data centers. To learn more, visit: https://www.youtube.com/watch?v=NUIrNGwUu14.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20241201928268/en/
“AWS continues to relentlessly innovate its infrastructure to build the most performant, resilient, secure, and sustainable cloud for customers worldwide,” said Prasad Kalyanaraman, vice president of Infrastructure Services at AWS. “These data center capabilities represent an important step forward with increased energy efficiency and flexible support for emerging workloads. But what is even more exciting is that they are designed to be modular, so that we are able to retrofit our existing infrastructure for liquid cooling and energy efficiency to power generative AI applications and lower our carbon footprint.”
AWS has been building large-scale data centers for 18 years and GPU-based servers for AI workloads for 13 years. Today, AWS’s data centers support millions of active customers worldwide, including hundreds of thousands of customers using AWS AI and machine learning services, and tens of thousands of global customers using Amazon Bedrock to build their generative AI applications. As use of generative AI continues to grow and GPU capacity demands increase, AWS data centers are adapting to support increasingly higher power densities. Key improvements include:
1. Simplified electrical and mechanical design for high availability
AWS continuously focuses on offering customers the most reliable infrastructure. Simplified electrical and mechanical designs are more reliable and easier to maintain, ensuring that customers enjoy the benefits of high reliability that AWS has offered from the beginning.
AWS’s latest data center design improvements include simplified electrical distribution and mechanical systems, which enable infrastructure availability of 99.9999%. The simplified systems also reduce the potential number of racks that can be impacted by electrical issues by 89%.
In a data center, electricity goes through multiple conversion and distribution systems before reaching the IT equipment. Each step naturally introduces inefficiency, energy loss, and potential failure points. As one new design example, AWS simplified the electrical distribution and in doing so, reduced the number of potential failure points by 20%. Other examples of simplifications include bringing backup power closer to the rack and reducing the number of fans that are used to exhaust hot air. AWS is using the natural pressure differential to exhaust hot air, which improves the amount of electricity available for servers. All of these changes help reduce overall energy consumption while minimizing the risk of failures.
2. Innovations in cooling, rack design, and control systems
AWS has built a number of new and enhanced capabilities to offer customers the most performant, highly available, and energy efficient infrastructure possible. New data center innovations include:
- Liquid cooling: Newer AI servers benefit from liquid cooling to more efficiently cool high density compute chips. AWS has developed novel mechanical cooling solutions providing configurable liquid-to-chip cooling in both its new and existing data centers. Some AWS technologies utilize network and storage infrastructure that does not require liquid cooling, so updated cooling systems will seamlessly integrate air and liquid cooling capabilities for the most powerful AI chipsets, like AWS Trainium2 and rack-scale AI supercomputing solutions like NVIDIA GB200 NVL72, as well as AWS’s network switches and storage servers. This flexible, multimodal cooling design allows AWS to provide maximum performance and efficiency at the lowest cost, whether running traditional workloads or AI models. The unique liquid cooling rack design was developed in collaboration with leading chip manufacturers to accelerate time to market for AI workloads.
- Support for high-density AI workloads: AWS is maximizing how power is used by optimizing how it positions racks in a data center. This was achieved through software, powered by data and generative AI, that predicts the most efficient way to land servers. AWS will now reduce the amount of stranded power – energy that is available but unused or underutilized – and make more efficient use of the energy available.
This design will support the next generation of hardware and high-density racks required for AI workloads, but is flexible enough to accommodate a wide range of other hardware types. AWS infrastructure offers the broadest and deepest compute platform with more than 750 Amazon Elastic Cloud Compute (Amazon EC2) instances, giving customers choice of the latest processor, storage, networking, operating system, and purchase model for any workload. In addition to the flexible multimodal cooling design, AWS has developed engineering innovations in its power delivery systems, which enable AWS to support a 6x increase in rack power density over the next two years, and another 3x increase in the future. This is delivered in part by a new power shelf, which efficiently delivers data center power throughout the rack, reducing electrical conversion losses.
Taken together, these innovations enable AWS to deliver 12% more compute power per site for customer workloads. These changes will reduce the overall number of data centers needed to deliver the same amount of compute capacity.
- Updated control systems: The rollout of an Amazon-owned control system across AWS’s electrical and mechanical devices provides the ability to standardize monitoring, alarming, and operational sequences. For example, AWS’s internally built telemetry tools use AWS technologies to provide real-time diagnostics and troubleshooting services, both of which enable AWS to maintain optimal operating conditions on behalf of customers. In addition, AWS has increased the redundancy in its controls systems, while reducing complexity. These benefits result in AWS designing for infrastructure availability of 99.9999%.
3. Increased energy efficiency and sustainability, including 46% reduction in mechanical energy consumption and 35% reduction in embodied carbon in the concrete used
For many years, AWS has been a pioneer in improving energy efficiency and sustainability across its infrastructure. Research estimates AWS’s infrastructure is currently up to 4.1 times more efficient than on-premises infrastructure, and when workloads are optimized on AWS, the associated carbon footprint can be reduced by up to 99%. In 2023, Amazon achieved its goal to match all of the electricity consumed by its operations with 100% renewable energy – seven years ahead of its 2030 goal.
AWS continuously reevaluates how its data centers operate and determines ways to help its infrastructure use energy more efficiently through ongoing innovation. The new components include the following upgrades for energy efficiency and sustainability:
- A more efficient cooling system that is expected to reduce mechanical energy consumption by up to 46% compared to its previous design during peak cooling conditions, without increasing water usage on a per-megawatt basis. Design changes include a new single-sided cooling system, reduction in cooling equipment, and introduction of liquid cooling capabilities.
- Reduction of embodied carbonin the concreteof the data center building shell by up to 35%, compared to industry average. AWS is adopting specifications for lower-carbon steel and concrete, and optimizing the structural design to use less steel overall.
- Backup generators will be able to run on renewable diesel, a biodegradable and non-toxic fuel that can reduce greenhouse gas emissions by up to 90% over the fuel’s lifecycle when compared to fossil diesel. AWS has already started transitioning to renewable diesel to power backup generators at existing data centers in Europe and in America.
“Data centers must evolve to meet AI’s transformative demands,” said Ian Buck, vice president of hyperscale and HPC at NVIDIA. “By enabling advanced liquid cooling solutions, AI infrastructure can be efficiently cooled while minimizing energy use. Our work with AWS on their liquid cooling rack design will allow customers to run demanding AI workloads with exceptional performance and efficiency.”
“As Anthropic develops our leading foundation models, having access to secure, performant, and energy-efficient infrastructure is crucial to our success,” said James Bradbury, distinguished engineer, Compute, at Anthropic. “AWS’s commitment to building cutting-edge data centers is one of the key reasons we’ve chosen them as our primary cloud provider and training partner. Their design improvements represent a significant step forward in providing secure, scalable, and efficient infrastructure to power AI models and drive innovation in this field.”
“AWS’s continuous infrastructure advancements allow us to concentrate on innovating new services that help our customers make more informed financial decisions rather than the undifferentiated heavy lifting of running data centers,” said Alex Lintner, CEO of Technology, Software Solutions and Innovation, Experian. “Security and regulatory compliance are fundamental for us, and working with AWS helps to fortify our position as a trusted technology provider, supporting businesses across a variety of industries. With their energy-efficient and AI-optimized facilities, we can dedicate more resources to the rapid delivery of new data-driven products rather than data center management.”
“Our customers depend on us to deliver reliable and highly secure generative AI innovation in a trusted, private environment,” said Jeff Reihl, chief technology officer, LexisNexis Legal & Professional. “Our longstanding partnership with AWS and their leading infrastructure enable us to deliver this, and also meet our goal of increasing our positive environmental impact while striving to reduce our environmental footprint.”
The new data center components are built to scale across all of AWS’s infrastructure worldwide, including its 34 Regions, 108 Availability Zones, and other infrastructure offerings like AWS Local Zones. Construction on new AWS data centers with the full set of components is expected to begin in early 2025 in the United States.
To learn more about AWS Infrastructure, visit https://aws.amazon.com/about-aws/global-infrastructure/.
About Amazon Web Services
Since 2006, Amazon Web Services has been the world’s most comprehensive and broadly adopted cloud. AWS has been continually expanding its services to support virtually any workload, and it now has more than 240 fully featured services for compute, storage, databases, networking, analytics, machine learning and artificial intelligence (AI), Internet of Things (IoT), mobile, security, hybrid, media, and application development, deployment, and management from 108 Availability Zones within 34 geographic regions, with announced plans for 18 more Availability Zones and six more AWS Regions in Mexico, New Zealand, the Kingdom of Saudi Arabia, Taiwan, Thailand, and the AWS European Sovereign Cloud. Millions of customers—including the fastest-growing startups, largest enterprises, and leading government agencies—trust AWS to power their infrastructure, become more agile, and lower costs. To learn more about AWS, visit aws.amazon.com.
About Amazon
Amazon is guided by four principles: customer obsession rather than competitor focus, passion for invention, commitment to operational excellence, and long-term thinking. Amazon strives to be Earth’s Most Customer-Centric Company, Earth’s Best Employer, and Earth’s Safest Place to Work. Customer reviews, 1-Click shopping, personalized recommendations, Prime, Fulfillment by Amazon, AWS, Kindle Direct Publishing, Kindle, Career Choice, Fire tablets, Fire TV, Amazon Echo, Alexa, Just Walk Out technology, Amazon Studios, and The Climate Pledge are some of the things pioneered by Amazon. For more information, visit amazon.com/about and follow @AmazonNews.
View source version on businesswire.com: https://www.businesswire.com/news/home/20241201928268/en/