Automated Type Contracts Generation

Automated  Type Contracts Generation — Valentin Fondaratov Hiroshima, September 2017
A tale about better code analysis @valich @fondarat

Why Types Matter An IDE Perspective

An IDE Perspective — • Where does the method go?
(aka Resolution)

(aka Resolution) • Bug prediction (aka NameError)

(aka Resolution) • Bug prediction (aka NameError) • IDE goodness (aka Speed)

(aka Resolution) • Bug prediction (aka NameError) • IDE goodness (aka Speed) • Rename refactoring (aka Safety),

Why Types Matter A non-IDE Perspective

RuboCop  is an industry standard solution — © http://batsov.com/articles/2014/09/05/rubocop-logo/ Inspecting
2184 files ....................................................................................................................... ..............................................................................................................W........ ....................................................................................................................... ....................................................................................................................... ....................................................................................................................... ....................................................................................................................... ....................................................................................................................... ....................................................................................................................... ....................................................................................................................... ....................................................................................................................... ....................................................................................................................... ....................................................................................................................... ....................................................................................................................... ....................................................................................................................... ....................................W.................................................................................. ....................................................................................................................... ........................ Offenses: actionpack/lib/action_dispatch/system_test_case.rb:109:7: C: Use 2 (not 11) spaces for indentation. SystemTesting::Browser.new(using, screen_size) ^^^^^^^^^^^ actionpack/lib/action_dispatch/system_test_case.rb:112:16: W: end at 112, 15 is not aligned with driver = if at 108, 6. end ^^^ guides/rails_guides/markdown/renderer.rb:104:18: W: end at 104, 17 is not aligned with path = case at 97, 10. end ^^^ guides/rails_guides/markdown/renderer.rb:106:1: C: Extra blank line detected. 2184 files inspected, 4 offenses detected

Missed errors — RuboCop does its job quite well suggesting
following Ruby Code Style.    The real error on line 5 is missed, though.  (downcase is not a method of Hash) Inspecting 1 file C Offenses: rubocop_fails.rb:1:1: C: Missing frozen string literal comment. x = "123" ^ rubocop_fails.rb:1:5: C: Prefer single-quoted strings when you don't need string interpolation or special symbols. x = "123" ^^^^^ rubocop_fails.rb:4:5: C: Space inside { missing. x = {:a => '1', :b => '2', :c => '3'} ^ rubocop_fails.rb:4:6: C: Use the new Ruby 1.9 hash syntax. x = {:a => '1', :b => '2', :c => '3'} ^^^^^ rubocop_fails.rb:4:17: C: Use the new Ruby 1.9 hash syntax. x = {:a => '1', :b => '2', :c => '3'} ^^^^^ rubocop_fails.rb:4:28: C: Use the new Ruby 1.9 hash syntax. x = {:a => '1', :b => '2', :c => '3'} ^^^^^ rubocop_fails.rb:4:37: C: Space inside } missing. x = {:a => '1', :b => '2', :c => '3'} ^ 1 file inspected, 7 offenses detected

Static analysis can detect more — RubyMine, for example, can
detect such errors. Notice that this unresolved warning   is not screaming red. We’ll see why.

Ruby DSLs  are hard  to analyse — Ruby core features,
readability and elegance, have a price. Inability  to properly verify the programs   is one of the compromises.  What type does @photo variable have?

“Beware of bugs in the above code;  I have only
proved it correct, not tried it” Donald E. Knuth

“Beware of bugs in the above code;  I have only
proved it correct, not tried it” Donald E. Knuth “Program testing can be used   to show the presence of bugs,   but never to show their absence!” Edsger W. Dijkstra

Coverage is a lie — Any composition of 2+ branching
methods requires cross product of branches tests. Without static analysis and proper inspections we are limited by • Running tests (uncaught bugs we call “regressions”), • Looking through the code with debugger and verifying manually (after each change??), • Testing with users in production :)

There is so much more  we can have by running 
the tests Than just checking the answers

Analysing RSpec::Matchers —

//demo/gif —

How it works — The process behind the magic:

How it works — The process behind the magic: 1.
Attach to Ruby VM to collect the types for all calls,

Attach to Ruby VM to collect the types for all calls, 2. Transform raw call data into type contracts,

Attach to Ruby VM to collect the types for all calls, 2. Transform raw call data into type contracts, 3. Collect and share the data.

Retrieving the data with TracePoint API — TracePoint is a
class allowing   to hook several Ruby VM events  like method calls and returns   and get any data through Binding

Unspecified arguments — One can’t distinguish default parameter values from
the passed ones.  If a method is defined dynamically, there is no way to derive which types  will be passed.    What type does foo() return? (Int) -> Int (String) -> String foo() = ?

Optional parameters — YARV compiles code into the bytecode. Note
that instructions for filling in   the default values are present,   independent on usages. When optional parameters are passed,  VM just skips these instructions.

Optional parameters — rb_control_frame_t const VALUE *pc; const rb_iseq_t *iseq;
PC

Resources — http://patshaughnessy.net/ruby-under-a-microscope    https://silverhammermba.github.io/emberb/c/    https://github.com/ruby/ruby

Type Tuples — str.split(pattern=nil, [limit]) -> anArray

Type Tuples — str.split(pattern=nil, [limit]) -> anArray Method calls Type
tuples

Too much data — str.split(pattern=nil, [limit]) -> anArray

Too much data — str.split(pattern=nil, [limit]) -> anArray https://boardgamegeek.com/image/1955740/game-goose http://www.megahowto.com/wp-content/uploads/2010/11/how-to-make-board-games.jpg

Too much data — str.split(pattern=nil, [limit]) -> anArray split(<String>, nil)
? ! https://boardgamegeek.com/image/1955740/game-goose http://www.megahowto.com/wp-content/uploads/2010/11/how-to-make-board-games.jpg

Too much data — str.split(pattern=nil, [limit]) -> anArray ? ?
split(<String>, <String>) https://boardgamegeek.com/image/1955740/game-goose http://www.megahowto.com/wp-content/uploads/2010/11/how-to-make-board-games.jpg

Too much data —

A worse example —

Template automatons —

Equality masks —

Equality masks — Param0 Param1 Param2 Equals to Param0 Equals
to Param1 {} {1} {11}

Equality masks —

Merge —

Merge — + Merge

Merge — + Merge Quack inference?

//demo/ —

Q. How do I collect the data?

Q. How do I collect the data? A. Run tests.

Q. How do I collect more data?

Q. How do I collect more data? A. Run more tests.

Q. How do I collect more data? A. Run more tests. Q. How do I collect so much data that the type contracts obtain exhaustiveness, i.e. become true?

Q. How do I collect more data? A. Run more tests. Q. How do I collect so much data that the type contracts obtain exhaustiveness, i.e. become true? A. Cooperate with others to run even MORE TESTS.

Community effort — Project1 Project2 … ProjectN

Community effort — Project1 Project2 … ProjectN Contracts1 Contracts2 ContractsN
Spec Spec Spec

Community effort — Project1 Project2 … ProjectN Contracts1 Contracts2 ContractsN
Spec Spec Spec “Devise Annotated”, ‘4.2’ M E R G E

Community effort — Contract  Diffs Rails 5 4.2 4.1 capybara 
+selenium 2.12.0 2.11.0 “Ruby weekly”.tar.gz Rubyists Cloud Storage

A single team may not have  100% test coverage. A
community is likely to have.

Tooling —

Tooling — • IDE goodness

Tooling — • IDE goodness • Code verification

Tooling — • IDE goodness • Code verification • Guided
Optimization

Contribute — This might be a viable alternative to explicit 
type annotations which is contradictory.    We have a chance to make Ruby   much more “static” for analysis  while preserving its  power and beauty. 

jetbrains.com Thank you for your attention — jetbrains.com/ruby/ github.com/valich github.com/jetbrains/ruby-type-inference

Automated Type Contracts Generation

Automated Type Contracts Generation

Other Decks in Programming

Featured

Transcript