Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Big Data In Action with Infinispan

Big Data In Action with Infinispan

Dealing with real-time, in-memory, streaming data is a unique challenge and with the advent of the smartphone and IoT (trillions of internet connected devices), we are witnessing an exponential growth in data at scale.

Building data layers that can satisfy these requirements can be challenging, but with the help of Infinispan, an in-memory data grid from Red Hat, you can take advantage of state of the art distributed data processing capabilities to tackle these challenges. From classic or full-text queries, to Spark/Hadoop integrations via distributed Java Streams, these wide ranging data processing capabilities make Infinispan the perfect choice for the Big Data era.

In this session, we will identify critical patterns and principles that will help you achieve greater scale and response speed. On top of that, you will witness how Infinispan follows these patterns and principles to tackle a big data situation via a live coding demonstration.

Galder Zamarreño

April 27, 2017
Tweet

More Decks by Galder Zamarreño

Other Decks in Programming

Transcript

  1. Moi • @ • Infinispan developer & co-founder • Lead

    client/server architecture • Functional programming @galderz #infinispan
  2. What is a imdg? • Distributed in-memory data • Server

    "mesh" • Peer-to-peer (P2P) • No master/slaves • No single bottleneck • No single point failure • Commodity hardware
  3. infinispan is a imdg Custom Applications Mobile Applications Web Apps

    & Websites JBoss Middleware Fuse "memory" across machines into a unified data store Read-through, write-through, write-behind • NoSQL • Extreme Performance • Linear Scalability • Fault Tolerant • Event processing • Configurable ACID Txn Infinispan Databases and/or file system Analytical Framework
  4. Infinispan Use Cases Distributed Cache cache frequent data transient short-lived

    storage NoSQL Database key/value store ACID transactions
  5. Infinispan Use Cases Distributed Cache cache frequent data transient short-lived

    storage NoSQL Database key/value store ACID transactions Event Broker listen to data changes continuous query
  6. Infinispan Use Cases Distributed Cache cache frequent data transient short-lived

    storage NoSQL Database key/value store ACID transactions Event Broker listen to data changes continuous query Data Analytics map/ reduce via java stream spark/ hadoop integration
  7. use examples • Web, Ecommerce • HTTP session • Shopping

    carts • Database/legacy offload: • Product catalog • Caching • Telecommunications • Cellular billings • Call routing, session info, • SMS content/notification • Travel • Aggregated flight pricing • Availability flights • Financial • Per-user portfolio data and risk analysis • Aggregated ticker stream • Defence • Sensor network data process and threat detection
  8. Infinispan Use Cases Distributed Cache cache frequent data transient short-lived

    storage NoSQL Database key/value store ACID transactions Event Broker listen to data changes continuous query Data Analytics map/ reduce via java stream spark/ hadoop integration
  9. Real-Time Demo Continuous Query Verticle Http App Verticle Data Grid

    Replication Sock JS Bridge Real Time Laptop Http Websockets JavaFX Injector Verticle
  10. java 8 stream List<Integer> numbers = Arrays.asList( 4, 74, 20,

    97, 118, 50, 97, 34, 48); numbers.stream() .filter(i -> i > 70) // ^ Returns Stream<Integer> .map(n -> new String(Character.toChars(n))) // ^ Returns Stream<String> .reduce("", String::concat); Returns "Java"
  11. What is the time of the day when there is

    the biggest ratio of delayed trains?
  12. Analytics Demo Data Grid Replication Delay Calculator Server Task Delay

    Calculator Server Task Delay Calculator Server Task Analytics Verticle Injector Verticle Analytics Jupyter Laptop HTTP
  13. Build Infinispan based infrastructure to store, search and process near

    real-time data and calculate analytics real-time data challenge
  14. Build Infinispan based infrastructure to store, search and process near

    real-time data and calculate analytics data growth problem real-time data challenge
  15. Build Infinispan based infrastructure to store, search and process near

    real-time data and calculate analytics real-time data challenge data growth problem continuous query for real-time
  16. Build Infinispan based infrastructure to store, search and process near

    real-time data and calculate analytics real-time data challenge data growth problem continuous query for real-time analysis with java streams
  17. credits Approve by Aha-Soft from the Noun Project engineer by

    Wilson Joseph from the Noun Project transformation by Felipe Perucho from the Noun Project analytics by Roman Kovbasyuk from the Noun Project Database sharing by YuguDesign from the Noun Project Server by designify.me from the Noun Project