The Shape of U -- Befriending Tensors

The Shape of U Nishant Sinha, OffNote Labs Nishant Sinha
Founder, Chief Scientist OffNote Labs nishant@offnote.co The Shape of U: Befriending Tensors

The Shape of U Nishant Sinha, OﬀNote Labs A New
Style of Programming

Programming with Examples and Knobs - Software 2.0 X Y’
W Y A New Style of Programming

Key Ingredients of Software 2.0 Design Example (X, Y) Generator
Annotator Inductive Biases, Performance Model The Diff Engine Loss Function Knob Tweaker Optimizers Tensors numpy, tensorﬂow, pytorch, mxnet, ...

As you dive into 2.0 ... reshape/view, permute/transpose expand dot
matmult batch_matmult Weird shape modiﬁers that hijack your code reduce_* select API

The Shape of U Nishant Sinha, OﬀNote Labs Talk Roadmap
• Getting to know Tensors ◦ Dimensions and Shapes, Memory and Mental Models ◦ Flaws in Tensor APIs • Befriending Tensors ◦ Tsalib: named shapes via Python types, library-agnostic Tsanley: A dynamic checker for named shapes

The Shape of U Nishant Sinha, OﬀNote Labs Part 1:
Understanding Tensors ﬂaws

The Shape of U Nishant Sinha, OﬀNote Labs Tensor Representations:
Physical Layer Image source: https://stackoverflow.com/questions/32034237/how-does-numpys-transpose-method-permute-the-axes-of-an-array T: (i 1 , ..i n ) -> v

The Shape of U Nishant Sinha, OﬀNote Labs Why are
Tensor Programs hard to write ? Tensor Libraries (numpy, tensorﬂow, pytorch, …): A. Expose developers to physical memory model a. Unable to enforce semantic view of data B. Implicit / Adhoc broadcast semantics ◦ Esoteric bugs! C. Hard to write tensor transformations ◦ Tensor shapes are latent across the program T: (i 1 , ..i n ) -> v

The Shape of U Nishant Sinha, OﬀNote Labs A. Exposure
to Low-level Memory Model API tied to physical, indexed layout. No semantic notion of ‘axis’.

The Shape of U Nishant Sinha, OﬀNote Labs B. Adhoc
Broadcast Semantics Rule 1: Make dimensions same by padding shorter one to the left. Rule 2: Mismatch in same dimension: stretch ‘1’ to higher size. (1, 32)

The Shape of U Nishant Sinha, OﬀNote Labs C. Hard
to read / write Tensor manipulating code Shapes are latent No standard way to track Adhoc comments

The Shape of U Nishant Sinha, OﬀNote Labs Cryptic Shape
Transformations

The Shape of U Nishant Sinha, OﬀNote Labs Tensor Representations:
10ft -> 1e10 ft T: (i 1 , ..i n ) -> v

The Shape of U Nishant Sinha, OﬀNote Labs Part 2:
Befriending Tensors name them

The Shape of U Nishant Sinha, OﬀNote Labs Proposals for
Naming Tensor Dimensions • No consensus on named APIs • Require deep changes to tensor libraries • Long due, no action ◦ Recently, Pytorch added named dimensions support. ◦ Mesh-tensorﬂow, Tensor-networks ◦ xarray Alexander Rush

The Shape of U Nishant Sinha, OﬀNote Labs Meanwhile :
Seq2Seq RNN Decoder

The Shape of U Nishant Sinha, OﬀNote Labs tsalib: A
Tensor Shape Annotation Library • Goals ◦ Enable tracking tensor shapes in programs — as ﬁrst-class citizens ◦ A language for tensor shapes ▪ Write crisp, intuitive, shape transformations — no cryptic code! ▪ Semantic (named) shape assertions ▪ Allow abstraction ◦ Integrated with Python — immediately usable ▪ Work with arbitrary tensor backends — avoid deep integration • Insight: Use Python 3.x (optional) type annotations

The Shape of U Nishant Sinha, OﬀNote Labs Resnet.forward (original)

The Shape of U Nishant Sinha, OﬀNote Labs Resnet.forward Full
model: https://github.com/ofnote/tsalib/blob/master/models/resnet.py

The Shape of U Nishant Sinha, OﬀNote Labs tsalib: quick
examples

The Shape of U Nishant Sinha, OﬀNote Labs Named Dimensions
--> Grammar of Named Shapes Shape Transformation API

The Shape of U Nishant Sinha, OﬀNote Labs From Transformer
Attention Module (old vs new) https://github.com/huggingface/transformers/blob/master/transformers/modeling_gpt2.py

The Shape of U Nishant Sinha, OﬀNote Labs Rewriting BERT
with warp • Enhances code readability • Reduced BERT attention_layer fn by ~25 lines (200 -> 175) •

The Shape of U Nishant Sinha, OﬀNote Labs tsalib: Lowering
Named Shape Transformations Under-the-hood: • Sympy: symbolic expressions • Fast lookup, Substitution

The Shape of U Nishant Sinha, OﬀNote Labs Warp: Big-step
'btd ->> b,d//2,t*2,1'

The Shape of U Nishant Sinha, OﬀNote Labs Big ->
Small Step Transformations 'b*t,n,h ->> bnth'

The Shape of U Nishant Sinha, OﬀNote Labs Tensors Considered
Harmful http://nlp.seas.harvard.edu/NamedTenso r

The Shape of U Nishant Sinha, OﬀNote Labs Two Issues
• Manually write named shape annotations • Manually write named shape assertions

The Shape of U Nishant Sinha, OﬀNote Labs Tsanley: dynamic
shape checking, annotation

The Shape of U Nishant Sinha, OﬀNote Labs Tsanley: dynamic
shape checking • Interplay of AST parsing, Python tracing (trace, inspect) ◦ Filter a subset of functions to track • Piggyback runtime shape checks on trace callbacks ◦ Track last executed statement in a function • Access concrete shapes from runtime frame ◦ Match against named shape annotations ◦ Log shapes for post- code annotation

The Shape of U Nishant Sinha, OﬀNote Labs Open Source
Research @ OffNote Labs Bridging the divide between real and imaginary research

The Shape of U Nishant Sinha, OﬀNote Labs Summary: Befriending
Tensors • Disconnect between memory model of tensor libraries and developer’s semantic model • Naming dimensions help bridge the disconnect ◦ A language for named shapes and transformations • Naming has multiple beneﬁts ◦ improves code readability, shape assertions ◦ semantic shape transformations Shapes for arbitrary data?

The Shape of U Nishant Sinha, OﬀNote Labs Questions? The
Shape of U: Befriending Tensors

The Shape of U Nishant Sinha, OﬀNote Labs reshape_from_matrix (BERT)

The Shape of U -- Befriending Tensors

The Shape of U -- Befriending Tensors

Other Decks in Research

Featured

Transcript