Service Discovery For Machines And Humans - OOP conference 2017

SERVICE DISCOVERY FOR MACHINES AND HUMANS OLIVER WEHRENS @OWEHRENS E-POST
DEVELOPMENT GMBH

SERVICE DISCOVERY FOR MACHINES AND HUMANS BACKGROUND ▸ Chief Architect
E-POST Development GmbH ▸ Building E-POST System with > 80 Services ▸ Building Applications for DHL ▸ ~ 100 developers

WHAT DID WE LEARN RUNNING A SYSTEM WITH > 30
(SOMETIMES MICRO) SERVICES

ONCE THERE WAS A MONOLITH. ▸ One (big) codebase ▸
Few teams working on it ▸ Some communication overhead T H E MONOLITH

THEN YOU TRY TO SCALE YOUR ORGANIZATION… T H E
MONOLITH

THEN YOU TRY TO SCALE YOUR ORGANIZATION… T H E
MONOLITH ?????

YOU DECIDE TO GO FOR MICROSERVICE ARCHITECTURE. ▸ Only one
team works on service codebase ▸ Independent deployment of business functionality ▸ Less blocking communication = Microservice

EVERYTHING IS AWESOME !

EVERYTHING IS AWESOME ?

MICROSERVICE ARCHITECTURE

MICROSERVICE ARCHITECTURE - WHERE ARE ALL MY SERVICES ?

MICROSERVICE ARCHITECTURE - WHO WRITES ALL THIS ?

SERVICE DISCOVERY FOR MACHINES AND HUMANS

MACHINES

MANY SERVICES - MOVING PROBLEMS TO A DIFFERENT LEVEL ▸
How do services ﬁnd each other? ▸ How do they ﬁnd all instances? ▸ Fail over? ▸ Resilience ?

TALKING TO OTHER SERVICES ▸ HA-Proxy with conﬁg in tools
like puppet ▸ Conﬁgure your service to talk to static list of local HA-Proxy Services ▸ Done.

PROBLEMS ▸ Static list of services in e.g. Hiera ▸
Changes in service conﬁguration is painful ▸ Changes often need a puppet run & restart ▸ Always machines only updated, never create new and throw away old one (sometimes at sizing hosts) ▸ All hosts are health checking each other all the time

GOALS ▸ We had use cases for client side load
bancing (in code) not haproxy ▸ Immutability of service, updateing services would also make manual work obsolete ▸ Get rid of service restart at sizing

CATALOG OF AVAILABLE SERVICES

HOW WOULD IT WORK? ▸ Service announces it self to
a directory (self registration or third party registration) ▸ Modify own vs. install just another piece of software ▸ Directory Interval checks if service is still there ▸ Either dead or alive ▸ Client asks directory for a speciﬁc service and gets answer ▸ Resilience is still important

SERVICE DISCOVERY FOR MACHINES ▸ Several Solutions ▸ Roll your
own - You ▸ etcd - CoreOS ▸ Zookeeper - Apache ▸ Heureka - Netﬂix ▸ Consul - Hashicorp

CONSUL BY HASHICORP - SERVER ▸ Cluster of at least
3 ▸ Needs to be up all the time - more complexity ▸ Connect data centers - for data centers or very strict network/ﬁrewall rules ▸ Can also act as DNS (.consul) ▸ Restricting access read/write with ACLs

CONSUL BY HASHICORP - CLIENT ▸ Installed on each Client
locally ▸ Checks service health on given url ▸ Propagates status with gossip protocol (https:// en.wikipedia.org/wiki/Gossip_protocol)

USAGE CONSUL AT E-POST ▸ Client roll out on all
hosts automatically ▸ Registration ▸ HA-Proxy Template ▸ Lightweight Wrapper around Netﬂix Ribbon with Consul integration ▸ DNS implementation ▸ Keys under which services will be found deﬁned by teams

CONSUL AT E-POST (II) ▸ Took about 120 days to
get infrastructure right ▸ Zones ▸ Templates ▸ Networking ▸ ACLs ▸ Takes very long when > 90 Developers are involved to eliminate legacy base

SERVICE DISCOVERY FOR MACHINES - CONCLUSION ▸ If you have
> 30 Services it is worthwhile investigating ▸ No static distribution of service addresses ▸ Easier to automate, less effort in the mid/long run ▸ Use existing software, don’t write your own

MICROSERVICE ARCHITECTURE - WHERE ARE ALL MY SERVICES ?

MICROSERVICE ARCHITECTURE - WHO WRITES ALL THIS ?

HUMANS

WE HAVE A LOT OF SERVICES AND MACHINES CAN FIND
IT … HOW ABOUT HUMANS? Architecture Board - 2015

MICROSERVICES MEAN … ▸ More Services. ▸ More Teams. ▸
More Communication. ▸ More Documentation. ▸ More of everything.

PROBLEMS TO SOLVE ▸ What is available on the platform
? ▸ What does the whole platform look like ? ▸ Who is responsible for a service ? ▸ How to get more information about a service ? ▸ Which Software versions and licenses do we use ?

STANDARDS OR DIE.

… OR HAVE METADATA (IN ONE PLACE).

W I K I HERE NFORMATION ILLS TSELF

WIKI (RANT) ▸ Created once ▸ Rarely updated ▸ Nothing
can be found ▸ Developers just don’t like update Wikis ▸ If updated, nobody knows if this is up to date ▸ Use the source Luke.

… OR COLLECT METADATA.

MANUAL AUTOMATED BUILD TIME (SHOULD) RUNTIME (ACTUAL)

MANUAL THINGS THAT DON'T CHANGE OFTEN

AUTOMATED EVERYTHING ELSE (AS MUCH AS YOU CAN)

BUILD TIME ▸ Everything available at Code Level & CI
System ▸ VCS information ▸ License & Dependency information ▸ Build chain information ▸ Code Stats (Age, Committer, Language)

RUNTIME ▸ VM Level ▸ Network connections ▸ Setup Level
▸ Sizing

SITUATION ▸ ~ 250 Source code repositories ▸ 40 Services
▸ 12 Teams

STATUS QUO SERVICE REGISTRY BACK THEN … Klaus

OUR REQUIREMENTS ▸ Every VCS root needs documentation ▸ Description
▸ Type ▸ Team name ▸ Owner ▸ VCS & CI Information

IN THE BEGINNING (Q4/2014) - THE GOOD ▸ Started with
Wiki ▸ Description in yaml ﬁle in the Source Code ▸ Executed during CI Run ▸ Automated Code Dependencies via Maven, Gradle, SBT ▸ Formatted to HTML and uploaded to Wiki

IN THE BEGINNING (Q4/2015) - THE BAD ▸ Search was
limited ▸ Data could not be queried for additional beneﬁt ▸ We had a couple of other places where we distributed information about services, what they do, how they get deployed etc. ▸ No immediate beneﬁt, no problem when outdated

WE NEED SOMETHING BETTER.

OUR REQUIREMENTS ▸ General: Team name, Owner, a short name,
description, type ▸ Runtime: memory needs, cpu, machine type ▸ Service: what do I provide, which port, protocol, private/ public ▸ Dependencies to other services ▸ Software Dependencies and Licenses ▸ Query DSL

WHAT’S OUT THERE? System-Z Services Directory NOT OPEN SOURCE

PIVIO HTTP://PIVIO.IO

PIVIO ▸ A (very simple) system to describe service meta
data CLIENT SERVER (WRAPPER) WEB DB (ELASTIC- SEARCH) JSON JSON YAML

WHAT DOES IT LOOK LIKE ?

PIVIO YAML ▸ One pivio.yaml ﬁle in root vcs directory
▸ Format at: http://pivio.io/docs/ #section-dataformat ▸ Extendable to your own needs

OVERVIEW

DETAIL

DETAIL (II)

ELASTIC SEARCH BASED QUERY

OUR REQUIREMENTS ▸ General: Team name, Owner, a short name,
description, type ▸ Runtime: memory needs, cpu, machine type ▸ Service: what do I provide, which port, protocol, private/ public ▸ Dependencies to other services ▸ Software Dependencies and Licenses ▸ Query DSL ✅

DATA QUALITY ▸ … is key! ▸ Organisational changes not
reﬂected sometimes ▸ Owners don’t beneﬁt from quality ▸ Make use of the data that are relevant to the creator! ▸ You will have dirty data.

USE CASES FOR E-POST DEVELOPMENT ▸ Machine sizing for Open
Nebula ▸ Service names for Consul ▸ Generating Documentation ▸ General information about all services ▸ Visualize dependencies of teams and bounded contexts ▸ Impact analysis of changing APIs

SOFTWARE VERSION DEPENDENCY CHECK (60 LINES OF JS)

SERVICE DEPENDENCY CHECK (284 LINES OF JS)

SERVICE DISCOVERY FOR HUMANS - CONCLUSION ▸ Big Picture with
micro services is hard ▸ Metadata helps to understand the system ▸ Needs to be easily editable (e.g. in the IDE) ▸ Needs to be useful to the creator ▸ Metadata will be dirty ▸ Link build time and runtime information ▸ Build tools on top of it, have a Query Language!

IN A SYSTEM WITH > 30 SERVICES INVEST IN SERVICE
DISCOVERY FOR MACHINES AND HUMANS.

THANKS. QUESTIONS? HTTP://PIVIO.IO @OWEHRENS HTTP://SPEAKERDECK.COM/OWEHRENS

GRAPHICAL RECORDING OF THE TALK BONUS FEATURE

Service Discovery For Machines And Humans - OOP...

Service Discovery For Machines And Humans - OOP conference 2017

More Decks by Oliver Wehrens

Other Decks in Technology

Featured

Transcript