Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Practical Elixir Streams — To Infinity and Beyond

Evadne Wu
October 08, 2020

Practical Elixir Streams — To Infinity and Beyond

By using Elixir Streams, you can make your application more responsive and consume fewer resources. Within this session, I shall explain how Streams work, and provide a few practical uses of Streams, with a live demonstration.

Evadne Wu

October 08, 2020
Tweet

More Decks by Evadne Wu

Other Decks in Technology

Transcript

  1. Finite Enumerables Lists, Maps, Keywords, etc. are finite Enumerables, with

    contents generated ahead of time All of them can be enumerated in the same manner — in linear or constant time, depending on the structure — because they all implement the Enumerable protocol
  2. Lazy Enumerables Any data type which has an implementation of

    the Enumerable protocol can be enumerated. This attribute can in turn be used to create lazily evaluated Enumerables.
  3. Lazy Enumerables Lazy Enumerables can act as placeholders for actual

    content which minimises blocking Computation is spread throughout the lifetime of the enumerable and can be halted if the results are no longer required
  4. Using Lazy Enumerables Enum.to_list/1: to return a List Stream.run/1: to

    consume the Stream NB: Calling Enum functions on Streams forces evaluation
  5. Anything is Enumerable > Since enumerables can have different shapes

    (structs, anonymous functions, and so on), the functions in this module may return any of those shapes and this may change at any time.
  6. Enum.reduce/3 The core concept which drives all enumeration Can be

    used to implement all other Enum functions Can be replaced with optimisation paths Calls Enumerable.reduce/3
  7. Enumerable Protocol Defines a series of functions to be implemented

    by any data type that wishes to become enumerable Includes reduce/3, count/1, member?/2, slice/1 with reduce/3 providing core functionality
  8. Enumerable.reduce/3 The core building block of enumeration, which is implemented

    for all enumerable types Continuable, i.e. can be suspended
  9. Enumerable.reduce/3 @type acc :: {:cont | :halt | :suspend, term}

    @type reducer :: (acc -> result) @spec reduce(t, acc, reducer) :: result
  10. Infinite Enumerables Any data type that implements Enumerable, and always

    returns new items in its reduce/3 implementation, is infinite. The Stream module provides convenience functions to construct such Enumerables!
  11. Infinite Enumerables Stream.unfold({0, 0}, fn {0, 0} -> {1, {1,

    0}} {1, 0} -> {1, {1, 1}} {c, p} -> {c + p, {c + p, c}} end)
  12. Stream Module = Convenience iex> Stream.map([1, 2, 3], & &1)

    #Stream<[ enum: [1, 2, 3], funs: [ #Function<48.68317796/1 in Stream.map/2> ] ]>
  13. Incremental Peeling Abuse of Enumerable.reduce/3 allows incremental consumption of any

    Enumerable, including Streams! Trick: Start the enumeration in a suspended state, accumulate into nil, but keep returning items Note: Streams might be non-reentrant
  14. Incremental Peeling reduce_fun = fn item, _ -> {:suspend, item}

    end acc = {:suspend, nil} result = Enumerable.reduce(enum, acc, reduce_fun) {:suspended, nil, next_fun} = result {:suspended, _, next_fun} = next_fun.({:cont, nil}) … {:done, nil} = next_fun.({:cont, nil})
  15. Note Streams are pull-based. No computation is done unless if

    new values are pulled from it. Collections are push-based. They can be used to collect output (e.g. File.Stream!/1)
  16. Stream.transform/3 iex> enum = 1001..9999 iex> n = 3 iex>

    stream = Stream.transform(enum, 0, fn i, acc -> ...> if acc < n, do: {[i], acc + 1}, else: {:halt, acc} ...> end) iex> Enum.to_list(stream) [1001, 1002, 1003]
  17. Stream.resource/3 Creates initial accumulator in start_fun Calls next_fun repeatedly until

    completion Calls after_fun at the end of enumeration Can be used for single-pass (streaming) file generation
  18. Stream.resource/3 Stream.resource( fn -> File.open!("sample") end, fn file -> case

    IO.read(file, :line) do data when is_binary(data) -> {[data], file} _ -> {:halt, file} end end, fn file -> File.close(file) end )
  19. Stream.into/2 Used to redirect output of a stream into a

    Collectable Useful when the Collectable represents the outside world (e.g. a File Handle)
  20. Task.async_stream/3 Emits a Stream which runs the given function once

    for each element in the Enumerable With options: max_concurrency, ordered Executed with Stream.run/1
  21. Plug.Conn.chunk/2 Sends the response to the client incrementally Requires a

    connection which has been configured with send_chunked/2 Leaves enumeration to the programmer
  22. Plug.Conn.chunk/2 Enum.reduce_while(enum, conn, fn (chunk, conn) -> case Plug.Conn.chunk(conn, chunk)

    do {:ok, conn} -> {:cont, conn} {:error, :closed} -> {:halt, conn} end end)
  23. StreamData Provides value generators in the form of Streams Can

    be used for data generation Very helpful for property-based testing
  24. StreamData use ExUnitProperties property "bin1 <> bin2 always starts with

    bin1" do check all bin1 <- binary(), bin2 <- binary() do assert String.starts_with?(bin1 <> bin2, bin1) end end
  25. Packmatic Generates a Zip64 archive stream Comes with Plug integration

    to enable fast download Added refinements based on prior community work
  26. Packmatic entries = [ [source: {:file, "/tmp/foo.pdf"}, path: "foo/bar.pdf"], [source:

    {:url, "https://example.com/baz.pdf"}, path: "baz.pdf"] ] Packmatic.build_stream(entries) |> Packmatic.Conn.send_chunked(conn, "download.zip")
  27. Why Use Streams? Minimise latency: Reduce time-to-first-byte and minimise jank

    risk (system saturation) Reduce resource usage: Eliminate peaks and avoid wasted work Reduce complexity: Leverage composition for shorter and more succinct programs