Monorepos - Why and How

Monorepos Why and How presented at Elixir Berlin Meetup on
2017/07/13 @ Blacklane

Me • Roman Heinrich (@mindreframer on all social accounts) •
Ruby / Devops / React / Elixir / Go dev

Disclaimer • This talk does not have any special relationship
to Elixir • But it will (hopefully) broaden your horizon a bit and might help in future decisions

What • What is a monorepo and how do you
eat it?

Mono != Monolith • Most startups start with a monolithic
implementation • By deﬁnition also a “monorepo” • The most pragmatic decision on start and great for quick shipping

What • Some stay with monolithic apps for the life
of the product / company • This is not that bad, when approached carefully and with good modularisation in mind

What • Multi-repo discussion arises when SOA / microservices enter
the ﬁeld • Usually it looks reasonable to stick with 1 service -> 1 repo approach • This is familiar and we know how to deal with basic requirements for simple projects

What • But things start be become quite tricky down
the road • It becomes hard / impossible to test applications end-2- end • Running the “application” locally is very hard, when you have to work against a pinned versions of other services

What • The number of repositories explodes and it feels
like herding cats • Impossible to tell whether something is used or not • So we just keep adding them, because we are afraid to break something… • Onboarding of new colleagues takes a lot of time • Many other non-trivial problems…

What • Dependencies are implicit or are encoded in the
CI - pipelines • Sometimes you might have a diagram of the overall architecture that is updated occasionally and is just a MAP, but not the real thing (might be incorrect without noticing it)

Magic Trick • Explicit dependency graph (even cross-languages) • Programmatically
accessible and when used properly very-very helpful • Content dependent hashing, not related to GIT SHAs or similar things

Dependency Graph • This makes your build predictable and very
fast • It allows you to statically analyse dependencies and do awesome things with it

Alternative • Single big repository + special tooling

BIG 3 • Google (https://bazel.build/) • Facebook (https://buckbuild.com/) • Twitter
(http://www.pantsbuild.org/)

BAZEL • In Java • Stripe • Dropbox • Pinterest
• Improbable • Google

BUCK • also Java • https://buckbuild.com/concept/what_makes_buck_so_fast.html

PANTS • Mostly Python • V2 will be in Rust!
• http://www.pantsbuild.org/powered_by.html • https://github.com/pantsbuild/pants/tree/master/src/rust/engine

Origin • The idea for a huge monorepo originated from
Google and was popularised by Ex-Googlers, when they joined other companies

GOOGLE • https://arxiv.org/pdf/1702.01715.pdf - Software Engineering at Google (2017/01) •
Most of Google’s code is stored in a single uniﬁed source- code repository, and is accessible to all software engineers at Google • Exception: some high-value or security-critical pieces of code for which read access is locked down more tightly

GOOGLE • As of January 2015, this 86 terabyte repository
contained a billion ﬁles, including over 9 million source code ﬁles containing a total of 2 billion lines of source code, with a history of 35 million commits and a change rate of 40 thousand commits per work day

Example BUILD ﬁle • http://www.pantsbuild.org/build_ﬁles.html

PLZ build • https://please.build/quickstart.html • New contender • Also from
ex-Googlers • By thoughtmachine.net • Vault OS - An Operating System for Banks

PLZ build • A single Go binary

PLZ build

WHY USE PLEASE INSTEAD OF BAZEL, BUCK OR PANTS? •
All four of these systems are quite closely related in the scheme of things, being inspired by (or in Bazel's case, a direct open sourcing of) Google's Blaze. • Several of us had worked at Google and used Blaze extensively there; we were excited about it being open sourced as Bazel but by then we were already using Please internally. It's a great system but we have slightly diﬀerent goals, speciﬁcally we're aiming Please at being lighter weight and pushing the boundaries of what can be done within the BUILD language. Since Please is written in Go there's no runtime dependency on the JVM.

PLZ Example • https://github.com/mindreframer/please-go-example

Demo • https://github.com/mindreframer/please-go-example

Links • https://github.com/gophersgang/go-kitchensink/blob/ master/docs/monorepo.md

Monorepo - Pros • Very scalable approach people / code
base wise • You probably won't ﬁnd any hard blockers here, since it was proven to work for companies like Facebook / Twitter / Google • It is easier to contribute across teams because you work on the same physical codebase • Predictable fast builds, high team morale

Monorepo - Cons • Requires an all-in decision • Requires
quite some tooling to become productive • Some operations for daily work (GIT / DVCS) might be slower than with smaller codebases • Not much experience in real life if you did not work for the couple bigger companies using it

Fin Thank you for listening! Q & A

Monorepos - Why and How

Monorepos - Why and How

Roman Heinrich

More Decks by Roman Heinrich

Other Decks in Programming

Featured

Transcript