From Rails to microservices with Go: our experience with Gemnasium Enterprise

FROM RAILS TO MICROSERVICES WITH GO Our experience with Gemnasium
entreprise Jean-Philippe Boily @jipiboily | jipiboily.com http://metrics.watch Philippe Lafoucrière @plafoucriere https://gemnasium.com [JP] Explain that the presentation will be about microservices and especially the migration from a monolithic application to distributed micro-services

JEAN-PHILIPPE “JP” BOILY • Founder of Metrics Watch (alerts for
Google Analytics) • Consultant for SaaS conversion funnels implementation & improvement • Working remotely for US-based SaaS startups, last one being Rainforest QA (YC backed) [JP] • MW - Flexible & near real time alerts for GA • Now consultant, code or code+marketing (funnel & conversion improvement) • Tell more about experience at Rainforest QA • employee #2 • ﬁrst remote • Engineering Lead • helped build the distributed team, product and more

PHILIPPE “LAFOU” LAFOUCRIÈRE • Founder of Tech-Angels / Gemnasium •
CS Engineer from Université de Technologie de Compiègne France [PL]

• Why? • Genesis • Criteria for architecture • Anatomy
of a microservice • Conclusion LET’S TALK ABOUT MICROSERVICES! « « [PL] Quick poll about the attendees’ experience with microservices.

WHY? [JP]

DEFINITIONS Monolithic application: self-contained, and independent from other computing applications.
Microservices: approach to developing a single application as a suite of small services, each running in its own process and communicating with lightweight mechanisms. [JP]

RAINFOREST QA Easier to onboard people Easier to reason about
Small steps, no long term goals to rewrite [JP] Rainforest is a company that does Continuous QA, full QA in ~20 minutes Onboarding was harder and harder Experience that started in between, not micro services but having some parts of the app as external services, without a full blown micro service architecture New application aside, instead of extending the existing one Social accounts Many microservices in production now, using Ruby, Go, Elixir, Crystal and maybe even more…?

Performance issues Migration to Rails 4 (5?) Code boundaries GEMNASIUM
[PL] Gemnasium is a service to monitor dependencies and reduce technical debt. [TODO]

GENESIS [JP]

FIRST PROJECT badge server because… • Performance issues • A
lot of traffic • Easy to extract [JP] It was the smallest part to extract with a big impact on performance, and a good candidate for experimenting micro-services (very few interactions with the rest of the code base).

CHOOSING TOOLS We tried a lot of hosted PaaS. We
needed something solid and well maintained. Choice: OpenShift [PL] Tested: Deis, Flynn, Docker Compose + Swarm not tested: Mesos Openshift is based on Docker and Kubernetes, and was chosen because it was ﬁtting all our needs and requirements.

CRITERIA FOR ARCHITECTURE [PL] New architecture is not only a
new set of tools, it’s revisiting all the production line: - procedures - libraries - tools - infrastructure - deployments

SECURITY [PL] User authentication? Services isolation? Openshift has a great
authentication and authorization system

DEPLOYMENT [JP] From semi-auto deploys with Capistrano to continuous deploy
How do we rollback, in case of failure? * A critical piece of the new procedures * It wasn’t possible with docker-compose + swarm, at least not without maintaining a lot of scripts Reduce downtime as much as possible during deploys * Openshift (kubernetes) has a smart way to load balance traﬁc on containers, with absolutely zero downtime

SCALING * [PL] * Horizontal or vertical? * Only vertical:
not robust enough, because nodes can fail * Automatic? * Openshift has great load balancing features, and can automatically scale the app (with openshift metrics)

LOGGING * [JP] * Must be centralized * Must store
logs for a long period * Must not lose logs between redeploys

BACKUPS * [PL] * Not only backup, but also restore
data, in case of emergency * Must rewrite procedures accordingly * Ensure data is correctly backup * Openshift/k8s is using “Persistent Volumes” on glusterfs (distributed fs)

MONITORING * [JP] * Detect service failure * restart *
Metrics from the infrastructure (CPU, RAM, I/O,…) * Integration with our existing monitoring systems (omd + newrelic) * Openshift has an internal metrics system to monitoring services and take actions if necessary

DOCUMENTATION * [PL] * Complete * Up to date *
Clear * Not a requirement at ﬁrst, but became one after experimentations * Openshift has a GREAT documentation, very complete (created by RedHat employees)

ANATOMY OF A MICROSERVICE [PL]

NO FRAMEWORK! Just Plain Old Go [PL] Lots of frameworks
released every week. We prefer to keep it simple, and stay with vanilla Go, no need for a framework. We like to separate the infrastructure conﬁguration from our business logic code. Every service is independent, and can’t start without a registry service (but still needs a DB, etc.)

HOW TO CONSUME IT? HTTP? Queueing? Protobufs? [JP] Mostly HTTP
and Queuing in production Protobufs is a good option, but it’s too early for us to use them

Heard of zero downtime database migrations? Same idea. Deﬁned &
documented protocol, that supports versions. UPGRADING WITHOUT [JP] I/O must be deﬁned by a common protocol. We try to avoid multiple versions of the protocol. Multiple versions is also more code to maintain, and sometimes just to avoid a small downtime. Not always worth the extra work.

CONFIGURATION Environment variables Command line arguments Mounts outside of Docker
for secrets [PL] We follow 12factors methodology for our services. All conﬁguration is passed using env vars. The network info (addr, ports) are passed automatically by openshift to each container. No need for a central registry. Every service is a command line tool, and a web server (default). Some options are passed in the CMD of the container. Sensitive information, like db credentials, are passed using mounted secret volumes. Credentials don’t appear in the container inspect info. Secret volumes are shared between services, like SMTP conﬁg. Each Service has a dedicated PG credential.

TESTS Test each microservices as any project. Have high level
end to end testing. [JP] Each service has unit test, like any other project. E2E tests are more complex to achieve. Testing API with Go can be sometimes painful without framework.

HOW WILL IT SCALE? For us, nothing specific. Just keep
in mind many instances can co-exist. [JP] Nothing specific in the code, because we don’t write files on FS. All services can be ran several times behind a LB

Retry (Hystrix from Netflix) Double check EMBRACE FAILURE Things will
break one day. [PL] Even if k8s is smart enough to not send trafic to a non-valid pod, things can quickly become unstable and unavailable. We need to anticipate failure in every workflow: - Idempotent tasks (i.e., do not insert twice if runs twice, or do not send the same email twice, for example) - Check writes (DB, NSQ, etc.) - Retry - Hot retry (inside the running instance, with exp. backoff) - Cold retry (instance can be killed, and the backoff with it, so retry every task that should be finished already) Advice: Keep it simple at the beginning!

MONITORING & ALERTING Metrics, metrics, metrics. Librato Graylog [JP] More
and more metrics will be needed when debugging, or just monitoring Ex: Number of signups, Number of failures during project sync, number of notiﬁcations (by channel), number of API calls to a provider like Google Analytics. Librato is a good and simple (SaaS) tool for custom metrics, with alerts and dashboards. [PL] We’re using Graylog because we need a tool inside our network (and logs shouldn’t be elsewhere). Graylog is an opensource and free log aggregator with lots of features (dashboards, alerts, search graphs, etc.). We use Graylog for alerts based on number of occurence, and Airbrake for the ﬁrst error.

CONCLUSION [PL]

PROS & CONS • Better security • Easier evolution •
Targeted scaling • Easier onboarding & maintenance • Harder deployment • Added failure management • Longer to develop • Added latency [PL] Pros [JP] Cons - Harder deployment: which service to deploy ﬁrst? Order of deployment is important. -

START WITH A MONOLITH [JP] Easier to start with, and
then face problems as they come.

THANKS! QUESTIONS? Hire JP - http://jipiboily.com Metrics Watch: http://metrics.watch -
freeGoogleAnalyticsCourse.com Use Gemnasium: gemnasium.com or enterprise.gemnasium.com

From Rails to microservices with Go: our experi...

From Rails to microservices with Go: our experience with Gemnasium Enterprise

JP Boily

More Decks by JP Boily

Other Decks in Technology

Featured

Transcript