Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Digesting MRI by Studying Alternative Ruby I...

Christian Bruckmayer
September 23, 2019
96

Digesting MRI by Studying Alternative Ruby Implementations

Pointers, managing memory and static typing - writing C code is hard! However, most programming languages, including Matz's Ruby Interpreter (MRI), are implemented in a low level programming language. So you think without knowing these concepts, you can not contribute to Ruby? Wrong! Although MRI is implemented in C, fortunately there are Ruby's in Java, Go and even Ruby itself.

If you ever wanted to learn about Ruby internals without being a C expert, this talk is for you. Join me on my journey of re-implementing hash maps in JRuby, breaking bundler and actually learn to write (some) C code.

Christian Bruckmayer

September 23, 2019
Tweet

Transcript

  1. array = [1,2,3,4,5,6,7,8,9] # => [1, 2, 3, 4, 5,

    6, 7, 8, 9] puts array.minmax # => [1, 9] puts [array.min, array.max] # => [1, 9]
  2. require 'benchmark-driver' Benchmark.driver do |x| x.prelude <<~RUBY arr = (1..1000).map

    { rand } RUBY x.report %{ arr.minmax } x.report %{ [arr.min, arr.max] } end
  3. Warming up -------------------------------------- arr.minmax 36.970k i/s - 40.436k times in

    1.093742s (27.05μs/i) [arr.min, arr.max] 67.366k i/s - 72.435k times in 1.075249s (14.84μs/i) Calculating ------------------------------------- arr.minmax 36.784k i/s - 110.910k times in 3.015196s (27.19μs/i) [arr.min, arr.max] 67.324k i/s - 202.097k times in 3.001869s (14.85μs/i) Comparison: [arr.min, arr.max] : 67323.7 i/s arr.minmax : 36783.7 i/s - 1.83x slower
  4. def delete_prefix(prefix) %x{ if (!prefix.$$is_string) { #{prefix = Opal.coerce_to(prefix, String,

    :to_str)} } if (self.slice(0, prefix.length) === prefix) { return self.$$cast(self.slice(prefix.length)); } else { return self; } } end Opal
  5. "foobar".delete_prefix(:foo) # TypeError (no implicit conversion of Symbol into String)

    :foo.to_s # "foo" :foo.to_str # NoMethodError (undefined method `to_str' for :foo:Symbol)
  6. vs prefix = :Hello.to_s “Hello ID”.delete_prefix(prefix) class Prefix def to_str

    “Hello" end end “Hello ID”. delete_prefix(Prefix.new) Explicit Implicit
  7. class Path def to_s "/chris" end end puts "home" +

    Path.new # Traceback (most recent call last): # `+': no implicit conversion of Path into String (TypeError)
  8. /* Find an entry with KEY in table TAB. Return

    non-zero if we found it. Set up *RESULT to the found table entry key. */ int st_get_key(st_table *tab, st_data_t key, st_data_t *result) { st_index_t bin; st_hash_t hash = do_hash(key, tab); rb_p(key); .... }
  9. make: *** [encdb.h] Error 1 make: *** Waiting for unfinished

    jobs.... ../ruby/tool/transform_mjit_header.rb:5: [BUG] Segmentation fault at 0x0000000000000093 ruby 2.7.0dev (2019-09-03T19:17:53Z implement-ary-minmax ebebc4b80d) [x86_64-darwin18] -- Crash Report log information -------------------------------------------- See Crash Report log file under the one of following: * ~/Library/Logs/DiagnosticReports * /Library/Logs/DiagnosticReports for more details. Don't forget to include the above Crash Report log file in bug reports.
  10. class Hash def initialize @bins = Array.new(7) { [] }

    end private attr_reader :bins end Separate Chaining
  11. class Hash def initialize @bins = Array.new(7) { [] }

    end private attr_reader :bins end Separate Chaining
  12. def []=(key, value) if find(key) find(key).value = value else bin(key)

    << Entry.new(key, value) end end class Entry < Struct.new(:key, :value) end Separate Chaining
  13. If a particular storage location is referenced […], then it

    is likely that nearby memory locations will be referenced in the near future. Wikipedia
  14. class Hash def initialize @bins = Array.new(7) { [] }

    end private attr_reader :bins end Separate Chaining
  15. def bin(key) index = index(key) entry = bins[index] while entry

    != nil && entry.key != key index += 1 entry = bins[index] end index end Open Addressing
  16. def []=(key, value) if find(key) find(key).value = value else index

    = bin(key) bins[index] = key bins[index + 1] = value end end Open Addressing