Lock in $30 Savings on PRO—Offer Ends Soon! ⏳

Building Cloud Data Infrastructure with Ruby @ ...

Avatar for exAspArk exAspArk
November 20, 2025

Building Cloud Data Infrastructure with Ruby @ SF Ruby 2025

Avatar for exAspArk

exAspArk

November 20, 2025
Tweet

More Decks by exAspArk

Other Decks in Programming

Transcript

  1. I ❤ Ruby Type of Software Programming Language Web /

    API Mobile Desktop Game Database AR / VR Blockchain / Web3 Infrastructure / DevOps ML / Data Science Embedded / IoT Ruby Ruby Ruby Ruby Ruby Ruby Ruby Ruby Ruby Ruby
  2. Choosing the Right Tool Type of Software Programming Language Web

    / API Ruby Mobile Kotlin / Swift Desktop TypeScript Game C++ Database Rust AR / VR C# Blockchain / Web3 Solidity Infrastructure / DevOps Go ML / Data Science Python Embedded / IoT C
  3. Jose Valim at SF Ruby? Type of Software Programming Language

    Web / API Ruby Mobile Kotlin / Swift Desktop TypeScript Game C++ Database Rust AR / VR C# Blockchain / Web3 Solidity Infrastructure / DevOps Elixir ML / Data Science Elixir Embedded / IoT Elixir Type of Software Programming Language Web / API Ruby Mobile Kotlin / Swift Desktop TypeScript Game C++ Database Rust AR / VR C# Blockchain / Web3 Solidity Infrastructure / DevOps ML / Data Science Embedded / IoT
  4. Data Engineering 101 Postgres: logical replication Notion: API Dialpad: webhooks

    Immutable CSV on steroids JSON metadata SQLite for analytics
  5. Data Engineering 101 Postgres: logical replication Notion: API Dialpad: webhooks

    Immutable CSV on steroids JSON metadata SQLite for analytics Kafka JVM Table compaction Snapshotting Iceberg v3, Delta Lake, Hudi, etc. Data catalog Data transformation, ETL, ELT, reverse ETL, WTF 🤯 Data streaming Orchestration Vectors Data quality
  6. SPA stacks are trivial in comparison Postgres: logical replication Notion:

    API Dialpad: webhooks Immutable CSV on steroids JSON metadata SQLite for analytics Kafka JVM Table compaction Snapshotting Iceberg v3, Delta Lake, Hudi, etc. Data catalog Data transformation, ETL, ELT, reverse ETL, WTF 🤯 Data streaming Orchestration Vectors Data quality
  7. 🔥 Hot Take #2 Data engineering stacks are 10x more

    complex and costly than software engineering
  8. Infrastructure Automation • Ansible (YAML), Chef (Ruby) • Terraform (HCL),

    Pulimi (JavaScript) • Docker (Docker fi le), Kubernetes (YAML)
  9. 🔥 Hot Take #4 When a monolith isn’t an option,

    go o ff -Rails and use plain Ruby
  10. 🔥 Hot Take #5 Gradual static typing is a useful

    tool, but its current state in Ruby is a mess
  11. 🔥 Hot Take #5 Gradual static typing is a useful

    tool, but its current state in Ruby is a mess RBS (aka C++ *.h fi les for Ruby) Steep Sorbet RBI Static type checking Runtime type checking Tapioca TypeProf
  12. Best Practices • Limit dependencies • Reduce mutable global state

    • Make scripts idempotent • Mask sensitive data before logging • Validate and sanitize user inputs • DRY without needless abstractions
  13. Advanced Techniques • Timeout handling • Feature fl ags •

    Rate limits • Retry with exponential backo ff • Circuit breaker • Chaos engineering
  14. Ruby: More Than Web Ruby was designed for scripting and

    has applications beyond just web development
  15. [email protected] | @exaspark 👋 • Ruby, AI, data engineering •

    Startups, founders, pivots • Swimming, pickleball, martial arts