There’s greater cost of deploying AI and ML models in production

At the end of last year, we witnessed a story that rattled the tech scene and roused heated discussion around AI and AI ethics. Namely, Timnit Gebru, former Co-lead of Google’s ethical AI team, co-authored a paper that led to her leaving Google. One of the core risks outlined in the paper was the AI carbon footprint of the development and deployment of large language models.

The paper titled “On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?” brings to the front some inconvenient truths related to a core line of Google’s research and raised questions about AI that might have caused Google concern, writes MIT Technology Reviews. In its focus, the paper lays out the risks of ever-larger language models, such as BERT, its variants, GPT-2/3, others most recently Switch-C – AI models trained on staggering amounts of text data, and China’s ‘Wu Dao’ AI model, which is 10X bigger than GPT-3.

One of the four main risks of large language models the paper outlines is a critical risk of deploying AI models into production that, it seems, is rarely given a second thought in the ambitious AI initiatives – environmental and financial costs.

There’s greater cost of deploying AI and ML models in production - the AI carbon footprint

The carbon footprint of training AI and ML models

Gebru’s paper highlighted the bitter truth about the industry: Today everyone is talking about deploying AI models into production, but no one or not enough people are talking about the cost of compute power and maintaining the ML models in production. Training AI models consumes a lot of computer processing power, and hence a lot of electricity. Furthermore, as the amount of data is exponentially growing, so is the AI carbon footprint expanding and putting a huge burden on the environment and impacting climate change.

No wonder the AI industry is often compared to the oil and gas industry, bearing a striking similarity to its high value and lucrativeness and its environmental impact.

In the paper, Gebru and her colleagues refer to another paper by Emma Strubell on the carbon emissions and financial costs of large language models, which revealed that the energy consumption and AI carbon footprint of the large language models have been exploding since 2017, as models have been fed more and more data. While another study referred to in Anthropocene Magazine found that computing power used in deep learning grew 300,000-fold between 2012 and 2018. If this pace of growth continues, AI could have a severe climate impact.

After performing a life cycle assessment for training several common large AI models, Strubell and her colleagues from the University of Massachusetts, found that the process can emit more than 626,000 pounds of carbon dioxide equivalent—nearly five times the lifetime emissions of the average American car (and that includes the manufacture of the car itself).

More specifically, Strubell’s paper examined the model training process for natural-language processing (NLP) – the Transformer, ELMo, BERT, and GPT-2 – models that have reached many groundbreaking performance milestones in the last years. First, the research included training each model on a single GPU for up to a day to measure its power draw. Then the researchers used the number of training hours listed in the model’s original papers to calculate the total energy consumed over the complete training process.

It’s worth noting that the research included training each model once, while in practice, models are usually trained many times during research and development. Strubell and her colleagues found that the compute and environmental cost of training grew proportionally to model size. The results showed the most costly model was BERT, with a carbon footprint of roughly 1,400 pounds of carbon dioxide equivalent, which is close to a round-trip trans-America flight for one person. GPT-3, the latest generation deep learning models designed to produce human-like language, requires an amount of energy equivalent to the yearly consumption of 126 Danish homes and creates a carbon footprint equivalent to travelling 700,000 kilometres by car for a single training session, describes Anthropocene Magazine.

Although experts had had some assumptions about the AI’s environmental impact, no one imagined that the actual problem is of such flabbergasting magnitude.

The cost of deploying AI models in production

AI’s dual role in the climate change

On the other hand, AI can be deployed to reduce the effects of climate change, such as creating smart grid design, developing low-emission infrastructure, and modelling climate change predictions.

Doubtless, AI’s destiny is to have a dual role in climate change, but this doesn’t mean that actions can’t be taken to mitigate or reduce the carbon footprint of AI. As the issue of carbon emissions and electricity costs from deploying AI models gains more traction, it attracts the attention of important stakeholders. For example, thousands took to the streets in September 2019, joining the global climate strike to shed light on big tech’s collaboration with fossil fuel companies. Another instance is the Tech Workers Coalition that united employees of Amazon, Google, Microsoft, Facebook, Twitter on a march to demand from their employers a promise to reduce emissions to zero by 2030, to cease contracts with fossil fuel companies, stop funding climate change deniers, and to stop the exploitation of climate refugees and frontline communities, reports Nature.

Quantifying carbon emissions

The challenge in taking proactive action to reduce AI’s carbon footprint and climate impact is how to quantify energy consumption and carbon emission, as well as to make this information transparent. Part of the problem is the lack of standard of measurement and the vagueness of the cost of building complex AI and ML systems in the social perception, Nature emphasises.

To narrow the quantification challenge, researchers have worked on a study to quantify the carbon emissions of machine learning. The researchers found out that the emissions obtained during training a neural network model were related to the location of the training server and the energy grid it uses, the length of the training procedure, and the hardware on which the training takes place. To help other researchers and practitioners, they created an emissions calculator to estimate the energy use and the associated environmental impact of training ML models.

Apart from the calculator, one of the researchers Alexandra Luccioni, suggested several actions to reduce AI carbon footprint. “Using renewable energy grids for training neural networks is the single biggest change that can be made. It can make emissions vary by a factor of 40, between a fully renewable grid and a full coal grid,” states Luccioni.

However, to make AI less polluting, it needs to become more of a mainstream conversation, including researchers being transparent about how much carbon dioxide was produced by their research, and reusing models instead of training them from scratch and by using more efficient GPUs.

Another team of Stanford University researchers also tried to track and get accurate measures of AI carbon emissions. First, they measured the power consumption of a particular AI model, untangling each training session and taking into account the power for shared overhead functions, such as data storage and cooling. Then they translated power consumption into carbon emissions, which depend on the mix of renewable and fossil fuels that produced the electricity.

The Stanford study demonstrated that the location of an AI training session could make a big difference in its carbon emissions. For example, the researchers estimated that running a session in Estonia, which largely relies on shale oil, will produce 30 times the volume of carbon as the same session would in Quebec, which relies primarily on hydroelectricity, affirm Stanford University.

However, to fully grasp AI’s carbon impact, it is not enough to scrutinise the compute costs incurred by training large models. A huge hurdle for quantifying carbon emissions is tech companies’ resistance to sharing data about their energy mix and their undisputed dominance in building data centres across the world.

A possible solution would be to offer tax incentives to cloud providers to open data centres in places with hydro or solar energy, not in areas where the grid is mainly coal-fuelled. This would make a huge impact to reduce the carbon footprint, states Alexandra Luccioni.

Turning Red AI into Green AI

The greater demand for AI innovation and performance in recent years comes with even greater computer power. The trend of using massive compute power to achieve better results was named “red AI” – a term introduced in 2019 by Roy Schwartz in a paper he co-authored.

Red AI presents the practice where, for a linear gain in performance, an exponentially larger model is required, which can increase the amount of training data or the number of experiments, thus escalating computational costs, and therefore carbon emissions. What is even worse, Schwartz concluded that the vast majority of papers he analysed prioritised accuracy over efficiency, falling under the “red AI” category.

Three factors categorise AI research as red, Schwartz’s study found:

the cost of executing the model on a single example;
the size of the training dataset, which controls the number of times the model is executed; and
the number of hyperparameter experiments, which controls how many times the model is trained.

Opposed to this, Schwartz and colleagues advocate for ‘green AI’, which they defined as “AI research that yields novel results without increasing computational cost, and ideally reducing it”, opposite to red AI, Nature affirms.

As to concrete actions that researchers can take to more efficient and sustainable AI, the Copenhagen Centre on Energy Efficiency states that environmental sustainability should be considered one of the principles towards responsible development and application of AI.

Gabriela Prata Dias, Head of the Centre, and Xiao Wang, programme officer, suggest several steps to apply cleaner and greener AI practices.

The definition of green AI needs to be actionable for all the relevant stakeholders in the industry, rather than be an abstract concept to most technical experts.
Environmental standards should be developed to ensure the mitigation of environmental impacts, and green AI certifications could be introduced to facilitate the industry process for promoting green AI development.
A practical industry framework and guidelines should be created that support green procurement of AI technologies that would support organisations and companies using and deploying AI technologies in looking for environmentally friendly AI practices.
Governments should consider the long-term impacts in setting up regulatory frameworks and legislation that would legally address transparency and sustainability in AI development.

Building data centres is also a great solution for minimising AI carbon footprint, where the waste heat is re-deployed back into the grid. But the players dominating the market should also strongly consider the energy mix of the location where they select for their data centres. In addition, there’s the need for the proper checks and balances, including regulations, to retain some form of local agency over these infrastructures, reflects Roel Dobbe for Nature.

Looking on a more micro level, Deepika Sandeep, AI scientist heading the AI and ML programme at Bharat Light & Power (BLP), a clean energy generation company, advises making judicious use of deep learning. Since training is the phase that consumes a lot of computational power and hence leads to a higher carbon footprint, Sandeep says they’ve minimised their training cycles. When the model is in production, they do re-training every three to six months. And in the end, she points out that not all problems require deep neural networks and deep learning architectures. Instead, choosing less compute-intensive AI might go a long way for our environment.

Cookie	Duration	Description
__cfduid	1 month	The cookie is used by cdn services like CloudFare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information.
cookielawinfo-checbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
bp_user-registered	13 years 8 months 8 days	This cookie is used to set which users can access the private pages of the website. It is a functional cookie.
bp_user-role	13 years 8 months 8 days	This is a functional cookie. It is used to set restriction to the user on acessing certain pages like back office, account page etc.
bp_ut_session	13 years 8 months 8 days	This is a functional cookie. This cookie is used to set restriction to the user on acessing certain pages like back office, account page etc.

Cookie	Duration	Description
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the wbsite is doing. The data collected including the number visitors, the source where they have come from, and the pages viisted in an anonymous form.

Cookie	Duration	Description
IDE	1 year 24 days	Used by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
test_cookie	15 minutes	This cookie is set by doubleclick.net. The purpose of the cookie is to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	This cookie is set by Youtube. Used to track the information of the embedded YouTube videos on a website.

Cookie	Duration	Description
_gat_gtag_UA_62786802_1	1 minute	No description
CONSENT	16 years 9 months 21 days 15 hours 5 minutes	No description
ihc_workflow_restrictions_0	1 month	No description
ihcMedia	1 hour	No description

There’s greater cost of deploying AI and ML models in production – the AI carbon footprint