Upgrade to Pro — share decks privately, control downloads, hide ads and more …

apidays Singapore 2025 - The Quest for the Gree...

apidays Singapore 2025 - The Quest for the Greenest LLM , Jean Philippe Ehret (Resilio)

The Quest for the Greenest LLM - Insights from the Frontlines
Jean Philippe Ehret, Founder BetterBytes (Now Resilio) - CTO at Emplify

apidays Singapore 2025
Where APIs Meet AI: Building Tomorrow's Intelligent Ecosystems
April 15 & 16, 2025

------

Check out our conferences at https://www.apidays.global/

Do you want to sponsor or talk at one of our conferences?
https://apidays.typeform.com/to/ILJeAaV8

Learn more on APIscene, the global media made by the community for the community:
https://www.apiscene.io

Explore the API ecosystem with the API Landscape:
https://apilandscape.apiscene.io/

Avatar for apidays

apidays

July 04, 2025
Tweet

More Decks by apidays

Other Decks in Programming

Transcript

  1. Kia Ora, I’m Jean -Philippe Founder of BetterBytes by Resilio

    CTO of Emplify 25 years in IT. Still writing code. MSc (Industrial Science – Renewable Energy) by
  2. Even in Middle Earth, We Burn Coal New Zealand: Stunning

    landscapes, rich in renewables. Yet we’re importing coal.
  3. New Zealand’s data centers used to take 1.5% of our

    electricity. Soon, they’ll take 8% . And they’re eating our renewables. AI’s Quiet Pull on Clean Power
  4. When Tech Companies Dig Deeper - Literally Microsoft is helping

    fund geothermal energy capacity in New Zealand to power new data centres.
  5. What Models? The Association Of Data Scientist (Thanks Chris Simon)

    Small -> 3B -> MMLU: 61 (similar to LLAMA 3B) Larger -> 120B -> MMLU: 85 (similar to GPT4o) MMLU Massive Multitask Language Understanding
  6. Training vs Inference: Two Sides of the Emissions Coin Direct

    emissions: 1.14g (larger) vs 0.16g (small). For a 400 output tokens conversation
  7. Training vs Inference: Two Sides of the Emissions Coin Direct

    emissions: 1.14g (bigger) vs 0.16g (small). Amortized: 37g vs 70g For a 400 output tokens conversation 37 70
  8. The Greenest Prompt? None at All. Eco-design your software: Delete

    non -essentials Allow a budget for it. It is worth it!
  9. Water scarcity 120B 3B : 314,000 m3 : 3,400 m3

    60% Cooling 30% Hydropower For a 400 output tokens conversation
  10. Water Scarcity Freshwater is scarce. Climate location matters. All water

    on, in, and above the Earth Liquid fresh water Fresh-water lakes and rivers
  11. Water Scarcity Freshwater is scarce. Climate location matters. All water

    on, in, and above the Earth Liquid fresh water Fresh-water lakes and rivers
  12. Water Scarcity Freshwater is scarce. Climate location matters. All water

    on, in, and above the Earth Liquid fresh water Fresh-water lakes and rivers
  13. Raw Materials in the Red Zone 120B 3B : 750

    kg Sb eq : 10 kg Sb eq 61% Hardware manufacturing
  14. Raw Materials in the Red Zone 120B 3B : 750

    kg Sb eq : 10 kg Sb eq 61% Hardware manufacturing For a 400 output tokens conversation
  15. 4 Actions suggested by the Study • Use small/specialized models

    when possible • Headline generation, sentiment analysis, spam detection
  16. 4 Actions suggested by the Study • Use small/specialized models

    when possible • Select Models trained in appropriate regions. • Which ones? -> Stay tuned
  17. 4 Actions suggested by the Study • Use small/specialized models

    when possible • Select Models trained in appropriate regions. • Run open source models locally when it makes sense
  18. 4 Actions suggested by the Study • Use small/specialized models

    when possible • Select Models trained in appropriate regions. • Run open source models locally when it makes sense • Apply Software eco-design principles for inferences
  19. AI Can Accelerate Both Good and Harm • Jevon’s Paradox

    and rebound effect • Frugal AI and AI for good Efficiency ≠ footprint reduction.
  20. AI Can Accelerate Both Good and Harm • Jevon’s Paradox

    and rebound effect • Frugal AI and AI for good Efficiency ≠ footprint reduction.