Upgrade to Pro — share decks privately, control downloads, hide ads and more …

pyconIT.pdf

Avatar for Luca Corbucci Luca Corbucci
June 01, 2025
11

 pyconIT.pdf

Avatar for Luca Corbucci

Luca Corbucci

June 01, 2025
Tweet

Transcript

  1. 👋 Luca Corbucci 👨💻 Ph.D. Student @ University of Pisa

    🎤 Podcaster @ PointerPodcast 🦸 Co-Organizer @ Superhero Valley and Pisa.dev 🌏 lucacorbucci.me
  2. 👋 Luca Corbucci 👨💻 Ph.D. Student @ University of Pisa

    🎤 Podcaster @ PointerPodcast 🦸 Co-Organizer @ Superhero Valley and Pisa.dev 🌏 lucacorbucci.me
  3. HOW CLASSIC ML WORKS TRAIN A MODEL MAKE PREDICTIONS COLLECT

    TRAINING DATA WHAT’S THE COLOR OF THE CAT?
  4. PRO AND CONS ✅ EASIER TO TRAIN & DEBUG ✅

    MORE MATURE ECOSYSTEM ✅ GLOBAL VIEW ON THE DATASET
  5. PRO AND CONS ✅ EASIER TO TRAIN & DEBUG ✅

    MORE MATURE ECOSYSTEM ✅ GLOBAL VIEW ON THE DATASET ⚠ PRIVACY AND TRUST RISKS
  6. PRO AND CONS ✅ EASIER TO TRAIN & DEBUG ✅

    MORE MATURE ECOSYSTEM ✅ GLOBAL VIEW ON THE DATASET ⚠ GDPR AND SIMIlAR LAWS CAN LIMIT DATA SHARING ⚠ PRIVACY AND TRUST RISKS
  7. PRO AND CONS ✅ EASIER TO TRAIN & DEBUG ✅

    MORE MATURE ECOSYSTEM ✅ GLOBAL VIEW ON THE DATASET ⚠ SCALABILITY LIMITS ⚠ GDPR AND SIMIlAR LAWS CAN LIMIT DATA SHARING ⚠ PRIVACY AND TRUST RISKS
  8. FEDERATED LEARNING THE IDEA WHY NOW? MORE COMPUTE AT THE

    EDGE RISE IN PRIVACY CONCERNS BETTER TOOLS TO IMPLEMENT FL * * * AGGREGATE THE SHARED MODELS * * MOVE THE MODELS, NOT THE DATA.
  9. TRAIN A MODEL ON MY DATA TRAIN A MODEL ON

    MY DATA TRAIN A MODEL ON MY DATA TRAIN A MODEL ON MY DATA TRAIN A MODEL ON MY DATA TRAIN A MODEL ON MY DATA
  10. TRAIN A MODEL ON MY DATA TRAIN A MODEL ON

    MY DATA TRAIN A MODEL ON MY DATA TRAIN A MODEL ON MY DATA TRAIN A MODEL ON MY DATA TRAIN A MODEL ON MY DATA
  11. THE MODELS ARE AGGREGATED INTO A SINGLE MODEL USUALLY, THIS

    IS JUST AN AVERAGE OF THE WEIGHTS OF THE MODELS
  12. ARCHITECTURE SERVER SERVER APP SUPERLINK CLIENT SUPERNODE CLIENT APP CLIENT

    SUPERNODE CLIENT APP CLIENT SUPERNODE CLIENT APP
  13. RECAP Centralized Learning Federated Learning Data location All data is

    collected in one central server Data stays on the local device Privacy Higher risk Raw data never leaves device Regulations Difficult with sensitive data Ideal for sensitive or regulated environments Entry barrier Low (lots of tools, guides) Lowering
  14. CONCLUSIONS AS INTERESTS IN DATA PRIVACY GROWS, FL IS BECOMING

    INCREASINGLY IMPORTANT * * EVEN LLM CAN BE TRAINED THIS WAY!
  15. CONCLUSIONS AS INTERESTS IN DATA PRIVACY GROWS, FL IS BECOMING

    INCREASINGLY IMPORTANT * THE FUTURE OF AI IS FEDERATED! * * EVEN LLM CAN BE TRAINED THIS WAY!
  16. Thank you! Questions? 👨💻 Ph.D. Student @ University of Pisa

    🎤 Podcaster @ PointerPodcast 🦸 Co-Organizer @ Superhero Valley and Pisa.dev 🌏 lucacorbucci.me