Upgrade to Pro — share decks privately, control downloads, hide ads and more …

What the history of the web can teach us about ...

What the history of the web can teach us about the future of AI

Recent advancements in Generative AI are exciting, and will surely have a significant, yet uncertain impact on the future. Are we still going to need developers going forward, or will they be replaced by AI? Is Big Tech monopolizing the technology? And will we become entirely dependent on API providers, sacrificing the spirit of open-source software and data privacy? I believe there is a lot we can learn from another groundbreaking technology: the web. In this talk, I'll show you what the history of the web can teach us about the future of artificial intelligence, and what this means for developers, models, open source and regulation.

Ines Montani

January 25, 2025
Tweet

Resources

Story time: How I started coding

https://ines.io/blog/how-i-started-coding/

My story on how I got into programming and working on AI and Natural Language Processing, starting with web development, front-end and design.

Python Developers Survey 2023 Results

https://lp.jetbrains.com/python-developers-survey-2023/

The seventh annual official Python Developers Survey, conducted as a collaborative effort between the Python Software Foundation and JetBrains.

Let Them Write Code

https://speakerdeck.com/inesmontani/let-them-write-code-keynote-pycon-india-2019

Talk about the development philosophy and mindset that motivates the design of our tools and the problem with reinventing the road vs. the wheel.

How S&P Global is making markets more transparent with NLP, spaCy and Prodigy

https://explosion.ai/blog/sp-global-commodities

A case study on S&P Global’s efficient information extraction pipelines for real-time commodities trading insights in a high-security environment using human-in-the-loop distillation.

A practical guide to human-in-the-loop distillation

https://explosion.ai/blog/human-in-the-loop-distillation

This blog post presents practical solutions for using the latest state-of-the-art models in real-world applications and distilling their knowledge into smaller and faster components that you can run and maintain in-house.

The AI Revolution Will Not Be Monopolized

https://speakerdeck.com/inesmontani/the-ai-revolution-will-not-be-monopolized-how-open-source-beats-economies-of-scale-even-for-llms

Are we heading further into a black box era with larger and larger models, obscured behind APIs controlled by big tech monopolies? I don’t think so, and in this talk, I’ll show you why.

More Decks by Ines Montani

Other Decks in Technology

Transcript

  1. displaCy ines.io/blog/how-i-started-coding I NLP … and still love it today!

    My Blog 2006 - 2009 I CSS I started with the web…
  2. Prodigy Modern scriptable annotation tool for machine learning developers prodigy.ai

    900+ COMPANIES 10k+ USERS Alex Smith Developer Kim Miller Analyst GPT-4 API
  3. 0 % 20 % 40 % 60 % Data Analysis

    Web Dev Machine Learning DevOps Parsing / Scraping Python Developers Survey 2023 Python usage
  4. 0 % 20 % 40 % 60 % Data Analysis

    Web Dev Machine Learning DevOps Parsing / Scraping Python Developers Survey 2023 Python usage Web Development Machine Learning Data Analysis Research Education DevOps Data Engineering 0 % 25 % Python Developers Survey 2023 What do you use Python for the most?
  5. 0 % 20 % 40 % 60 % Data Analysis

    Web Dev Machine Learning DevOps Parsing / Scraping Python Developers Survey 2023 Python usage Web Development Machine Learning Data Analysis Research Education DevOps Data Engineering 0 % 25 % Python Developers Survey 2023 What do you use Python for the most? Python wins for AI because it’s a general- purpose language
  6. Big Tech is reinventing the wheel and the road spacy.fyi/ltwc

    Reinvent the wheel, but don’t reinvent the road!
  7. Big Tech is reinventing the wheel and the road Google

    most-used search engine with market share of over 80% spacy.fyi/ltwc Reinvent the wheel, but don’t reinvent the road!
  8. Big Tech is reinventing the wheel and the road Google

    most-used search engine with market share of over 80% Chrome / Chromium most popular browser and engine powering Microsoft Edge, Opera, Arc and more spacy.fyi/ltwc Reinvent the wheel, but don’t reinvent the road!
  9. Big Tech is reinventing the wheel and the road Google

    most-used search engine with market share of over 80% Chrome / Chromium most popular browser and engine powering Microsoft Edge, Opera, Arc and more Gemini (formerly Bard) generative AI model and chat bot integrated into search spacy.fyi/ltwc Reinvent the wheel, but don’t reinvent the road!
  10. Big Tech is reinventing the wheel and the road Accelerated

    Mobile Pages open-source framework for faster mobile browsing, encouraged for best search results Google most-used search engine with market share of over 80% Chrome / Chromium most popular browser and engine powering Microsoft Edge, Opera, Arc and more Gemini (formerly Bard) generative AI model and chat bot integrated into search spacy.fyi/ltwc Reinvent the wheel, but don’t reinvent the road!
  11. Big Tech is reinventing the wheel and the road Accelerated

    Mobile Pages open-source framework for faster mobile browsing, encouraged for best search results Google most-used search engine with market share of over 80% Chrome / Chromium most popular browser and engine powering Microsoft Edge, Opera, Arc and more Gemini (formerly Bard) generative AI model and chat bot integrated into search Google Ads search advertising generating $200b+ revenue per year spacy.fyi/ltwc Reinvent the wheel, but don’t reinvent the road!
  12. Are we still going to need developers? Will Big Tech

    monopolize AI? What about open- source software?
  13. Are we still going to need developers? Will Big Tech

    monopolize AI? What about open- source software? Will AI developers be replaced by AI?
  14. Are we still going to need developers? Will we become

    dependent on API providers? Will Big Tech monopolize AI? What about open- source software? Will AI developers be replaced by AI?
  15. Are we still going to need developers? Will we become

    dependent on API providers? Will Big Tech monopolize AI? How do we keep our data private? What about open- source software? Will AI developers be replaced by AI?
  16. Are we still going to need developers? What can we

    learn from the past? Will we become dependent on API providers? Will Big Tech monopolize AI? How do we keep our data private? What about open- source software? Will AI developers be replaced by AI?
  17. ceiling floor local store sets up website without requiring help

    from a web developer 🛍 +10% customers and revenue
  18. ceiling floor local store sets up website without requiring help

    from a web developer 🛍 +10% customers and revenue todo list app adds automated translation feature using an API 🤖 +5% international user growth
  19. ceiling floor streaming service makes web player 1ms faster 📺

    +3% time spent in app local store sets up website without requiring help from a web developer 🛍 +10% customers and revenue todo list app adds automated translation feature using an API 🤖 +5% international user growth
  20. ceiling floor hotel booking platform improves recommendation system 💸 +0.5%

    click-through rate, +$1m revenue streaming service makes web player 1ms faster 📺 +3% time spent in app local store sets up website without requiring help from a web developer 🛍 +10% customers and revenue todo list app adds automated translation feature using an API 🤖 +5% international user growth
  21. ceiling floor hotel booking platform improves recommendation system 💸 +0.5%

    click-through rate, +$1m revenue streaming service makes web player 1ms faster 📺 +3% time spent in app local store sets up website without requiring help from a web developer 🛍 +10% customers and revenue todo list app adds automated translation feature using an API 🤖 +5% international user growth high adoption
  22. ceiling floor hotel booking platform improves recommendation system 💸 +0.5%

    click-through rate, +$1m revenue streaming service makes web player 1ms faster 📺 +3% time spent in app local store sets up website without requiring help from a web developer 🛍 +10% customers and revenue todo list app adds automated translation feature using an API 🤖 +5% international user growth high value high adoption
  23. real-time commodities trading insights by extracting structured attributes high-security environment

    explosion.ai/blog/sp-global-commodities S&P Global CASE STUDY 99% F-SCORE 6mb MODEL SIZE 16k+ WORDS / S
  24. real-time commodities trading insights by extracting structured attributes high-security environment

    used LLM during annotation explosion.ai/blog/sp-global-commodities S&P Global CASE STUDY 99% F-SCORE 6mb MODEL SIZE 16k+ WORDS / S
  25. real-time commodities trading insights by extracting structured attributes high-security environment

    used LLM during annotation 10× data development speedup with humans and model in the loop explosion.ai/blog/sp-global-commodities S&P Global CASE STUDY 99% F-SCORE 6mb MODEL SIZE 16k+ WORDS / S
  26. real-time commodities trading insights by extracting structured attributes high-security environment

    used LLM during annotation 10× data development speedup with humans and model in the loop 8+ market pipelines in production explosion.ai/blog/sp-global-commodities S&P Global CASE STUDY 99% F-SCORE 6mb MODEL SIZE 16k+ WORDS / S
  27. real-time commodities trading insights by extracting structured attributes high-security environment

    used LLM during annotation 10× data development speedup with humans and model in the loop 8+ market pipelines in production explosion.ai/blog/sp-global-commodities S&P Global CASE STUDY 99% F-SCORE 6mb MODEL SIZE 16k+ WORDS / S
  28. static pages custom models dynamic pages static pages pretrained models

    Development workflows over time compile static data at build time
  29. static pages custom models dynamic pages static pages custom models

    pretrained models Development workflows over time compile static data at build time
  30. static pages custom models dynamic pages static pages custom models

    distill models into smaller, faster and private components pretrained models Development workflows over time compile static data at build time
  31. AI products are more than just a model spacy.fyi/ai-revolution How

    open-source beats economies of scale, even for LLMs
  32. AI products are more than just a model machine-facing models

    GPT-4 human-facing systems ChatGPT spacy.fyi/ai-revolution How open-source beats economies of scale, even for LLMs
  33. AI products are more than just a model machine-facing models

    GPT-4 human-facing systems ChatGPT most important di ff erentiation is product, not just technology spacy.fyi/ai-revolution How open-source beats economies of scale, even for LLMs
  34. AI products are more than just a model machine-facing models

    GPT-4 human-facing systems ChatGPT UI / UX marketing customization most important di ff erentiation is product, not just technology spacy.fyi/ai-revolution How open-source beats economies of scale, even for LLMs
  35. AI products are more than just a model machine-facing models

    GPT-4 human-facing systems ChatGPT UI / UX marketing customization most important di ff erentiation is product, not just technology swappable components based on research, impacts are quantifiable spacy.fyi/ai-revolution How open-source beats economies of scale, even for LLMs
  36. AI products are more than just a model machine-facing models

    GPT-4 human-facing systems ChatGPT UI / UX marketing customization speed accuracy latency cost most important di ff erentiation is product, not just technology swappable components based on research, impacts are quantifiable spacy.fyi/ai-revolution How open-source beats economies of scale, even for LLMs
  37. AI products are more than just a model machine-facing models

    GPT-4 human-facing systems ChatGPT UI / UX marketing customization speed accuracy latency cost But what about the data? most important di ff erentiation is product, not just technology swappable components based on research, impacts are quantifiable spacy.fyi/ai-revolution How open-source beats economies of scale, even for LLMs
  38. AI products are more than just a model machine-facing models

    GPT-4 human-facing systems ChatGPT UI / UX marketing customization speed accuracy latency cost But what about the data? User data is an advantage for product, not the foundation for machine-facing tasks. most important di ff erentiation is product, not just technology swappable components based on research, impacts are quantifiable spacy.fyi/ai-revolution How open-source beats economies of scale, even for LLMs
  39. AI products are more than just a model machine-facing models

    GPT-4 human-facing systems ChatGPT UI / UX marketing customization speed accuracy latency cost But what about the data? User data is an advantage for product, not the foundation for machine-facing tasks. You don’t need specific data to gain general knowledge. most important di ff erentiation is product, not just technology swappable components based on research, impacts are quantifiable spacy.fyi/ai-revolution How open-source beats economies of scale, even for LLMs
  40. 🔮 Developers Don’t confuse raising the floor with raising the

    ceiling. 👩💻 ⚠ 🎁 high-value use cases are worth development e ort
  41. Models Take back control in the development process. 🔮 Developers

    Don’t confuse raising the floor with raising the ceiling. 👩💻 ⚠ 🎁 high-value use cases are worth development e ort
  42. compile smaller, faster and private models at development time Models

    Take back control in the development process. 🔮 Developers Don’t confuse raising the floor with raising the ceiling. 👩💻 ⚠ 🎁 high-value use cases are worth development e ort
  43. compile smaller, faster and private models at development time Interoperability

    is the opposite of monopoly. Open Source Models Take back control in the development process. 🔮 Developers Don’t confuse raising the floor with raising the ceiling. 👩💻 ⚠ 🎁 high-value use cases are worth development e ort
  44. compile smaller, faster and private models at development time LLMs

    can be one part of a product or process, and swapped Interoperability is the opposite of monopoly. Open Source Models Take back control in the development process. 🔮 Developers Don’t confuse raising the floor with raising the ceiling. 👩💻 ⚠ 🎁 high-value use cases are worth development e ort
  45. compile smaller, faster and private models at development time LLMs

    can be one part of a product or process, and swapped Interoperability is the opposite of monopoly. Open Source Regulation should focus on products and actions, not components. Regulation Models Take back control in the development process. 🔮 Developers Don’t confuse raising the floor with raising the ceiling. 👩💻 ⚠ 🎁 high-value use cases are worth development e ort