DevoxxFR 2021 - Systematic error management in application

[email protected] @fanf42 Systematic error management in application To make them
useful #DevoxxFR

Hi! devops automation/compliance app manage ten of thousands computers 2
François ARMAND CTO Founder Free Software Company “Stay Up”

Hi! devops automation/compliance app manage ten of thousands computers 3
François ARMAND CTO Founder Free Software Company “Stay Up” Developer

Developer ? • Model the world into code ◦ Try
to make it useful 4

to make it useful • Nominal case necessary (of course) 5

to make it useful • Nominal case necessary (of course) • But not suﬃcient (models are false) ◦ Bugs ◦ Misunderstanding of needs ◦ Open world 6

This talk systematic management of errors 7 (with the help
of types, functional programing… And well, systems)

This talk systematic management of errors • I’m a scala
dev, mainly ▪ expect Scala terminologie (ask if unclear!) ▪ statically typed language with sum types, interfaces • examples use ZIO - https://zio.dev 8

This talk • Scala pure functional programming framework • manage
concurrency, asynchronicity, resources, errors, ... • state of the art "principled effect management for everyone" • made with non-FP developers in mind (Java, etc) • Smells like Spring framework in 2006 ◦ opinionated core framework + domain oriented projects ◦ tries to tackles hard problem of the time ◦ the "80% of dev" in mind 9 ?

10 Not so popular opinions - 4 Hills I would
die on -

Our work as developers is to discover and assess failure
modes 11 Not so popular opinion 1/4

It’s YOUR work to choose the SEMANTIC between nominal case
and error and KEEP your PROMISES Not so popular opinion 2/4 12

ERRORS are a SOCIAL construction to give AGENCY to the
receiver of the error 13 Not so popular opinion 3/4

An application has always at least 3 kinds of users:
users ; ops ; and dev. Don’t forget any. 14 Not so popular opinion 4/4

15 4 principles: • Assess failure modes • You are
responsible to keep promises made. • Give agency to your users • and don’t forget any of them. In that talk, we are looking for interaction between things (APIs, not internal logic, tests, etc)

16 You are responsible to keep promises made. Don't lie
in your code, model with types I. Systematic error management - at micro scale, in code - at macro scale, in systems errors are a signal for users, ops, dev Assess failure modes Give agency to your users and don’t forget any of them. II. III. IV.

I 17 Assess errors Discover where API lies. Understand your
model. Assess its limits.

18 How to make contract WYSIWIG just with a naively
descriptive signature I.1

getUserFromDB(id: String): User 19 Assess failure mode: Don’t lie!

getUserFromDB(id: String): User 20 Assess failure mode: Don’t lie! Where
are the lies?

👍 rules of thumb: be naively explicit contract: • structure
inputs • enumerate outputs • no hidden constraint, dependency, or side effects getUserFromDB(id: String): User 21 Where are the lies? Assess failure mode: Don’t lie!

getUserFromDB(id: UserId): IO[RudderError, Option[User]] 22 In Rudder, we write it
like that! Assess failure mode: make your contract WYSIWYG Yep, longer. But naively explicit. Let's see why, step by step

23 Assess failure mode: WYSIWYG contract 1: structured data types
getUserFromDB(id: String): User

24 Assess failure mode: WYSIWYG contract 1: structured data types
• we don't get user by any string, but by ID. • Don't lie in your code. getUserFromDB(id: String): User

getUserFromDB(id: String): User 25 Assess failure mode: WYSIWYG contract 1:
structured data types • we don't get user by any string, but by ID. • Don't lie in your code • make your contract WYSIWYG getUserFromDB(id: UserID): User "I give you an user by it's ID, no way I will succeed if you give a random sentence".

26 Assess failure mode: WYSIWYG contract 2: total function getUserFromDB(id:
UserID): User

27 Assess failure mode: WYSIWYG contract 2: total function •
for some valid id, there's no user. • function is not total. It's always a problem. getUserFromDB(id: UserID): User

28 Assess failure mode: WYSIWYG contract 2: total function getUserFromDB(id:
UserID): User "I give you an user by it's ID only if it exists: sometime you will have to deal with nobody" getUserFromDB(id: UserID): Option[User] • for some valid id, there's no user. • function is not total. It's always a problem. • don't lie on your part of the contract:

29 Assess failure mode: WYSIWYG contract 3: control side effects
getUserFromDB(id: UserID): Option[User]

• sometimes, the environment fails • side effects are always an hidden error waiting to happen getUserFromDB(id: UserID): Option[User]

• sometimes, the environment fails • side effects are always an hidden error waiting to happen • make explicit that sometime, thing fails "I give you an user by it's ID if it exists and nothing fails, else YOU deal with the error." getUserFromDB(id: UserID): Option[User]

• sometimes, the environment fails • side effects are always an hidden error waiting to happen • make explicit that sometime, thing fails "I give you an user by it's ID if it exists and nothing fails, else YOU deal with the error." getUserFromDB(id: UserID): Option[User] getUserFromDB(id: UserId): IO[RudderError, Option[User]]

getUserFromDB(id: UserId): IO[RudderError, Option[User]] 33 Assess failure mode: make your
contract WYSIWYG • Ask yourself: "what are all the cases I have no idea how to deal with ?" • Model with types. • Assess failure mode by making your code contract WYSIWYG.

I.2 34 models are false by construction What are nominal
case, errors, defects ?

Model? Systems? 35 Code is a model of interacting systems.
Interaction can be expected (nominal case) or not (error) How do you decide what is what? getUserFromDB(id: UserId): IO[RudderError, Option[User]] • why is "no user for that id" a nominal case ?

Model? Systems? 36 Code is a model of interacting systems.
Interaction can be expected (nominal case) or not (error) How do you decide what is what? getUserFromDB(id: UserId): IO[RudderError, Option[User]] • why is "no user for that id" a nominal case ? It depends of the system !

Systems? 37 A school is a system

Systems? 38 ◦ BOUNDED group of things ◦ with a
NAME Interacting ◦ with others systems A school is a system

System interactions: nominal cases, non nominal cases Expected interaction or
error ? 39 ◦ Play marble: ◦ win or loose => both nominal cases ◦ marble broke => likely an error ◦ game interrupted => not sure ? A school is a system

Nominal cases vs Errors 40 getUserFromDB(id: UserId): IO[RudderError, Option[User]] ▪
why is "no user for that id" a nominal case ? It depends of the system !

Nominal cases vs Errors 41 getUserFromDB(id: UserId): IO[RudderError, Option[User]] ▪
why is "no user for that id" a nominal case ? It depends of the system ! You, the developer, decide what is nominal.

Nominal cases vs Errors 42 Nominal cases • expected output
NOT ONLY the "good one"! "the game can be lost or won" • reﬂected in types with enumeration Errors • expected non-nominal case "a teacher interrupted the game" • reﬂected in type with an error type

Nominal cases vs Errors 43 Nominal cases • expected output
NOT ONLY the "good one"! "the game can be lost or won" • reflected in types with enumeration Errors • expected non-nominal case "a teacher interrupted the game" • reflected in type with an error type Everything reflected in types?

Model everything? 44 getUserFromDB(id: UserId): IO[RudderError, Option[User]]

Model everything? 45 java.lang.SecurityException? (jvm permission to access network) getUserFromDB(id:
UserId): IO[RudderError, Option[User]]

Model everything? 46 ⟹ where do you put the limit?
getUserFromDB(id: UserId): IO[RudderError, Option[User]] java.lang.SecurityException? (jvm permission to access network)

Systems have horizon. 47 ◦ nothing exists beyond horizon

Systems have horizon. Horrors lie beyond. 48 ◦ nothing exists
beyond horizon ◦ Like with Lovecraft: if something from beyond interact with a system, the system becomes inconsistent

Errors vs Defects 49 Errors • expected non nominal case
• reflected in types • signal for users • social construction: you propose alternatives or error Defects • unexpected case: by definition, application is in an unknown state • not reflected in types • only choice is to stop as cleanly as possible (coredump)

Nominal vs Errors vs Defects 50 Errors • expected (modeled)
non nominal cases • reflected in types with error channel Defects • non expected (out of model) cases • not reflected in types Nominal cases • expected (modeled) nominal cases • reflected in types output with enumeration

non nominal cases • reflected in types with error channel Defects • non expected (out of model) cases • not reflected in types Nominal cases • expected (modeled) nominal cases • reflected in types output with enumeration But who choose what is what?

non nominal cases • reflected in types with error channel Defects • non expected (out of model) cases • not reflected in types Nominal cases • expected (modeled) nominal cases • reflected in types output with enumeration But who choose what is what? YOU

Horizon limit is your choice - by deﬁnition 53 java.lang.SecurityException?

54 java.lang.SecurityException? execScript(js: String): IOResult[String] In Rudder, we have a
JS engine (JS from users): Horizon limit is your choice - by deﬁnition

JS engine (JS from users): ⟹ SecurityException is an expected error case here Horizon limit is your choice - by deﬁnition

JS engine (JS from users): ⟹ SecurityException is an expected error case here … but nowhere else in Rudder. By our choice. Horizon limit is your choice - by deﬁnition

Your code should paint a clean and understandable model: •
Don't lie in your code: ◦ explicit data structure ◦ impossible state unrepresentable ◦ totale function ◦ control side effects • model code as systems with: ◦ nominal cases (enumeration) ◦ errors (either success "or") ◦ defects (out of model) 57 I. Take away: WYSIWYG-ify your code with data structures

II 58 Systematic error management Code: get the help of
the compiler Parse, don't validate Effects as ﬁrst class citizens Dedicated error channels

Systematic error management ? 59 Like ? Check error at
each lines ?

Systematic error management ? 60 kubernetes/pkg/controller/daemon/daemon_controller.go

Systematic error management ? 61 kubernetes/pkg/controller/daemon/daemon_controller.go It's REALLY good error
management

Systematic error management ? 62 kubernetes/pkg/controller/daemon/daemon_controller.go • 23 lines •
13 for error control • nothing automated: only rely on developer diligence It's REALLY good error management, but:

63 Systematic error management ? SEEMS EXTREMELY PAINFUL AND ERROR
PRONE kubernetes/pkg/controller/daemon/daemon_controller.go

64 • We did progress in the last 25 years
• Let the compiler do the job for you • two tools: ◦ parse, don't validate ▪ make unrepresentable state impossibles ◦ dedicated error channel ▪ effect as ﬁrst class citizens Systematic error management ?

II.1 65 Parse, don't validate Make your code more and
more precise and simple

Prevention 66 Make impossible states unrepresentable Reﬁne iteratively getUserFromDB(id: String):
User • not all strings are a valid user id. ◦ parse only one time at the edge ◦ let all your code know about it with a dedicated data structure

Prevention 67 Make impossible states unrepresentable Reﬁne iteratively getUserFromDB(id: String):
User • not all strings are a valid user id. ◦ parse only one time at the edge ◦ let all your code know about it with a dedicated data structure getUserFromDB(id: UserID): User

Prevention 68 https://lexi-lambda.github.io/blog/2019/11/05/parse-don-t-validate/ Make impossible states unrepresentable Reﬁne iteratively

Example - real code from Rudder 69 • Need to
parse license for plugin validation

parse license for plugin validation • we described all possible cases ◦ from unstructured (a binary blob) ◦ to checked license

parse license for plugin validation • we described all possible cases • then, iteratively reﬁne case

II.2 72 let the compiler helps you Effects as value
with explicit error channel

73 • Use the type system to automate classiﬁcation of
errors? Effect as ﬁrst class citizens - effect system

74 A type system is a tractable syntactic method for
proving the absence of certain program behaviors by classifying phrases according to the kinds of values they compute. Benjamin Pierce • Use the type system to automate classiﬁcation of errors? Let the compiler helps you

75 A type system is a tractable syntactic method for
proving the absence of certain program behaviors by classifying phrases according to the kinds of values they compute. Benjamin Pierce Let the compiler helps you By deﬁnition, a type system automatically categorize results ⟹ need for a dedicated error chanel + a common error trait

76 trait MyAppError // common properties of errors type PureResult[A]
= Either[MyAppError, A] def divide(a: Int, b: Int): PureResult[Int] Let the compiler helps you A type system is a tractable syntactic method for proving the absence of certain program behaviors by classifying phrases according to the kinds of values they compute. Benjamin Pierce By deﬁnition, a type system automatically categorize results ⟹ need for a dedicated error chanel + a common error trait

77 Effect as ﬁrst class citizens - effect system •
data structure to capture side effects with a dedicated error channel ? Same for effectful functions!

data structure to capture side effects with a dedicated error channel ? • this problem is hard. • (but we have a 25-years old solution in prod in thousands of big shop) Same for effectful functions!

solution: effects as ﬁrst class value (I will let you dig into effect systems, monads, referential transparency, etc - for now: let's just say we have tech so that:) getUserFromDB(id: UserId): IO[RudderError, Option[User]] • "IO" here means: your side effecting function is now pure, with an error channel.

80 • With a dedicated error channel ◦ ~ Either[E,
A] for pure code, ◦ else ~ IO[E, A] for effect management • and parent trait for common error properties… • we get automatic categorization of errors by compiler Let the compiler helps you

81 Let the compiler helps you • Remember the example
with reﬁned inputs? • error in any line stop the process • error is automatically returned ◦ no boilerplate • effects are values - possibility to augment error: ◦ aggregate similar cases, ◦ add your own control structure, ◦ decompose as you need

Systematic error management ? 82 kubernetes/pkg/controller/daemon/daemon_controller.go • 23 lines •
13 for error control • nothing automated: only rely on developer diligence It's REALLY good error management, but:

83 Systematic error management ? • 15 lines • 2
for errors (for) • error management automated: developer focus on nominal case It's REALLY good error management, and: daemon_controller.scala - idea about how it could be • automatic error management ! • getPod can be moved around without fear of unhandled error • catchAll is a built-in combinator of ZIO • notOptional() is a one-liner self-made combinator • contextualizeError() is an other

Compilers are now very potent, they can help you systematically
assess properties: • Parse, don't validate ◦ precise parameter types ◦ iteratively reﬁne from unstructured to structured data • Effects as values with dedicated error channel ◦ unbloat your error management ◦ let the compiler check it ◦ build your own combinators 84 II. Take away: Use types for automatic help from compiler

III 85 Systematic error management Macro: use systems to materialize
promises Program to strict interfaces and protocols

86 A bit more about systems We don't have a
compiler everywhere to help us.

We don't have a compiler everywhere to help us. Then,
we have system analysis. 87 A bit more about systems

Need for a systematic approach to error management 88 ◦
BOUNDED group of things ◦ with a NAME Interacting ◦ with others systems A school of systems A bit more about systems

A bit more about systems Need for a systematic approach
to error management 89 ◦ BOUNDED group of things ◦ with a NAME Interacting ◦ via INTERFACES ◦ by a PROTOCOL with other systems ◦ And PROMISING to have a behavior A school of systems

A bit more about systems 90 Systematic error management possible
with clear deﬁnition of consistent sub-system in interaction. Find out: • interfaces, protocoles, promises Write down expectations: • nominal cases, errors, out of model Look for consistency in: • lifecycle, constraints, actors, locations, dependencies, maturity, ...

Example? 91 Typical web application.

Example? 92 Typical web application. How to keep contradictory promises?
Promises to third parties about REST behaviour Promises to business and developers about code manageability

Make promises, Keep them 93 • systems allow to bound
responsibilities Look for consistency in: • lifecycle • actors

responsibilities

responsibilities Business Core sub-system: • own ADT / logic (mostly pure) • lifecycle bounded to developers understanding of needs (rapid changes)

responsibilities Business Core sub-system: • own ADT / logic (mostly pure) • lifecycle bounded to developers understanding of needs (rapid changes) Pattern: “A pure heart (core) surrounded by side effects”* * excuse my french

responsibilities Users of the API want stability and to know what errors can happen Business Core sub-system: • own ADT / logic (mostly pure) • lifecycle bounded to developers understanding of needs (rapid changes)

responsibilities Business Core sub-system: • own ADT / logic (mostly pure) • lifecycle bounded to developers understanding of needs (rapid changes) REST sub-system : • own ADT / logic (mostly effects) • lifecycle bounded to REST contract: strict versioning, changes are breaking changes Users of the API want stability and to know what errors can happen

responsibilities Business Core sub-system: • own ADT / logic (mostly pure) • lifecycle bounded to developers understanding of needs (rapid changes) REST sub-system : • own ADT / logic (mostly effects) • lifecycle bounded to REST contract: strict versioning, changes are breaking changes Stable API : interface, strict protocol & promises (nominal cases + errors) Users of the API have agency (able to react eﬃciently)

responsibilities Business Core sub-system: • own ADT / logic (mostly pure) • lifecycle bounded to developers understanding of needs (rapid changes) REST sub-system : • own ADT / logic (mostly effects) • lifecycle bounded to REST contract: strict versioning, changes are breaking changes Stable API : interface, strict protocol & promises (nominal cases + errors) Users of the API have agency (able to react eﬃciently) Translation between sub-systems: API: interface, protocol & promises!

Make promises, Keep them 101 • discover sub systems and
their limits ◦ explore how components are coupled ◦ find or create loosely coupled sub sustems • find nominal case and error, translate them between sub-systems ◦ make errors relevant to their users • It’s a model, it’s false - but useful ◦ there is NO definitive answer. ◦ discuss, share, iterate • the bigger the promises, the stricter the API

Your code, the IT on which it runs are interactive
systems: • look for their perimeters ◦ program to interface with protocols • understand life cycle ◦ parse at the edge ◦ core business need purity for rapid iteration ◦ use adapter subsystem to manage contradictory promises 102 III. Take away: look for system limits and contracts

IV. 103 Errors are a social construction to give agency
to dev, ops, users

104 It’s a signal

105 It’s a signal The only goal of an error
is to be analyzed by someone who will have to deal with the problem. Make that person* life easier. * it could be you. In the middle of the night.

• Don’t assume what’s obvious • It’s an open world
out there • Don’t force users to revert-engineer possible cases 106 It’s a signal make it unambiguous

Checked exceptions are a good signal for users 107 Unpopular
opinion (sure)

Checked exceptions are a good signal for users Are they
? 108 Unpopular opinion (sure)

• exceptions* are often a pile of useless ambiguity ◦
Error ? Fatal error ? Checked ? Unchecked ? ◦ most exceptions are just a message ◦ … or hidden behind a generic throws Exception • signal must be unambiguous and actionable 109 It’s a signal make it unambiguous * NPE anyones?

Error ? Fatal error ? Checked ? Unchecked ? ◦ most exceptions are just a message ◦ … or hidden behind a generic throws Exception • signal must be unambiguous and actionable 110 It’s a signal make it unambiguous ➢ be precise with your contracts and errors

Error ? Fatal error ? Checked ? Unchecked ? ◦ most exceptions are just a message ◦ … or hidden behind a generic throws Exception • signal must be unambiguous and actionable 111 It’s a signal make it unambiguous ➢ think "who will react to that case ?" User, ops or dev? ➢ be precise with your contracts and errors

• It's OK to not know how to deal with
a case at some point • give agency* to deal with it at the right time * capacity to inﬂuence environment 112 It’s a signal make it unambiguous give agency

113 • 👍 Rule of thumb ◦ app/service user: can
inﬂuence inputs. Be precise with your function parameters. ◦ ops: concerned with environment and system interaction. Likely what is in the error channel. ◦ developers: make model hypothesis, contract and limit unambiguous. On defect, core dump info. It’s a signal make it unambiguous give agency users ops dev

114 IV.Take away: give agency to dev, ops, users with
clear signals

115 Not so popular opinion 5/4 If it's NOT on
the path of least resistance, it won't be done consistently

What’s missing for good error management in code ? 116

What’s missing for good error management in code ? •
exceptions or Go errors are A PAIN to deal with ◦ nothing is automatable ▪ no help from compiler, no tooling, no inference, nothing ◦ no composition ▪ composition: • ability to build solutions to more complex problems from solutions to simpler ones. • and provably be sure that all properties checked in the small are kept in the result. ▪ loose referential transparency* 117 * the single biggest win regarding code comprehension

Make it a joy! 118 • fearless refactoring: focus on
domain logic, deconstruct problems as you need ◦ automatic error management ◦ composition (referential transparency…) • boilerplate free, makes the code extremely readable ◦ able to add all the combinators we need! ◦ it’s cheap and can ﬁt your domains • give access to higher level compositional tools ! ◦ ex: automatically manage resources in ALL cases ◦ simple async & concurrent structure: queues, etc Today, we have the tooling to make managing error enjoyable!

119 Let correct error management be the path of least
resistance. Make it a joy.

Question? Contact me / Chat with me! https://twitter.com/fanf42 https://github.com/fanf irc/freenode:
fanf @fanf42:matrix.org [email protected] 120 Ressources ◦ Error management: future vs ZIO A much more detailed presentation of ZIO error management capabilities https://www.slideshare.net/jdegoes/error-management-future-vs-zio ◦ Understand Things As Interacting Systems More insights on systems. https://medium.com/@fanf42/understand-things-as-interacting-systems-b273bdba5dec ◦ Parse, don't validate ! the reference article about making impossible state unrepresentable and getting help from the compiler https://lexi-lambda.github.io/blog/2019/11/05/parse-don-t-validate/ ◦ Effect tracking is commercially worthless https://degoes.net/articles/no-effect-tracking ◦ Stay Up! Journey of a Free Software Company. One decade in search for a sustainable model https://medium.com/@fanf42/stay-up-5b780511109d Images ◦ scientists checking logs: https://www.quantamagazine.org/hope-rekindled-for-abc-proof-20151221 ◦ mountains: Game arts for Forest of Liars https://imgur.com/t/gaming/zOcPpG1

121 Don't lie in your code, model with types: -
explicit data types - total and pure functions - knows your limits: Defects vs Errors Systematic error management: - At micro scale, in code: parse, don't validate; use dedicated error channel - At macro scale, in systems: program to strict interfaces and protocols errors are a signal for users, ops, dev: - users: agency to understand nominal case - ops: agency to correct errors - dev: agency to model deliberately (with joy) You are responsible to keep promises made. I. Assess failure modes Give agency to your users and don’t forget any of them. II. III. IV . Make it extremely convenient V.

Full example - real code from Rudder 122 • inference
just works • each sub-system add relevant information • simple combinators (in white) used as syntax sugar (None, msg) => Unexpected(msg) PureResult[A] => IOResult[A] (err: RudderError[A], msg) => Chained(msg, err) error contextualisation between systems

• What about making impossible state unrepresentable from the beginning?
◦ That’s a very good point and you should ALWAYS try to do so. The idea is to change method’s domain definition (ie, the parameter’s shape) to only work on inputs that can’t rise errors. Typically, in my trivial “divide” example, we should have use “non zero integer” for denominator input. ◦ Alexis King (@lexy_lambda) wrote a wonderful article on that, so just go read it, she explains it better than I can: “Parse, don’t validate” https://lexi-lambda.github.io/blog/2019/11/05/parse-don-t-validate/ ◦ We use that technique a lot in Rudder to drive understanding of what is possible. Each time we can restrict domain definition, we try to keep that information for latter use. ◦ Typical example: parsing plugin license (we have 4 “xxxLicenses” classes depending what we now about its state); Validating user policy (again several “SomethingPolicyDraft” with different hypothesis needed to build the “Something”). ◦ the general goal is the same than with error management: assess failure mode, give agency to users to react efficiently. ◦ There’s still plenty of cases where that technique is hard to use (fluzzy business cases…) or not what you are looking for (you just want to tell users that something is the nominal case, or not, and give them agency to react accordingly). Some questions asked after the talk 123

Some questions asked after the talk 124 • Is SystemError
used to catch / materialize failure ? ◦ no, SystemError is here to translate Error that need to be dealts with (like connection error to DB, FS related problem, etc) but are encoded in Java with an Exception. SystemError is not used to catch Java “OutOfMemoryError”. These exception kills Rudder. We use the JVM Thread.setDefaultUncaughtExceptionHandler to try to give more information to dev/ops and clean things before killing the app.

Some questions asked after the talk 125 • You have
only one parent type for errors. Don’t you lose a lot of details with all special errors in subsystems losing the specificities when they are seen as RudderError? ◦ this is a very pertinent question, and we spend a log of time pondering between the current design and one where all sub-systems would have their own error type (with no common super type). In the end, we settled on the current design because: ▪ no common super type means no automatic inference. You need to guide it with transformer, and even if ZIO provide tooling to map errors, that means a lot of useless boilerplate that pollute the readability of your code. ▪ there is common tooling that you really want to have in all errors (Chained, SystemError, but also “notOptional”, etc). You don’t want to rewrite them. Yes type class could be a solution, but you still have to write them, for no clear gain here. ▪ you are fighting the automatic categorization done by the compiler in place of leveraging it. ▪ The gain (detailed error) is actually almost never needed. When we switched to “only one super class for all error”, we saw that “Chained” is sufficient to deals with general trans-system cases, and in some very, very rare cases, you case build ad-hoc combinators when needed, it’s cheap. ◦ So all in all, the wins in convenience and joy of just having evering working without boilerplate clearly outpaced the not clear gain of having different error hierarchies. ◦ The problem would have been different if Rudder was not one monolithic app with a need of separated compilation between services. I think we would have made an “error” lib in that case.

Some questions asked after the talk 126 • We use
Future[Either[E,A]] + MTL, why should we switch to ZIO? ◦ Well, the decision to switch is yours, and I don’t know the speciﬁc context of your company to give an advice on that. Nonetheless, here is my personal opinion: ▪ ZIO stack seems simpler (less concepts) and work perfectly with inference. Thus it may be simpler to teach it to new people, and to maintain. YMMV. ▪ ZIO perf are excellent, especially regarding concurrent code. Fibers are a very nice abstraction to work with. ▪ ZIO enforce pure code, which is generally simpler to compose/refactor. ▪ ZIO tooling and linked construction (Managed resources, Async Queues, STM, etc) are a joy to code with. It removes a lot of pains in tedious, boring, complicated tasks (closing resources correctly, sync between concurrent access, etc) ▪ pertinent stack trace in concurrent code is a major win • But at the end of the day, you decide!

Some questions asked after the talk 127 • How long
did it took to port Rudder to ZIO? ◦ It’s complicated :). 1 month of part time (me), plus lots more time for teaching, refactoring, understanding new paradigm limits, etc ▪ 1/ we didn’t started from nowhere. We were using Box from liftweb, and a lot of the code in Rudder was already “shaped” to deal with errors as explain in the talk (see https://issues.rudder.io/issues/14870 for context) ▪ 2/ we didn’t ported all Rudder to ZIO. I estimated that we ported ~ 40% of the code (60k-70k lines ?). ▪ 3/ we did some major refactoring along the lines, using new combinators and higher level structures (like async queues) ▪ 4/ we started in end of 2018, when ZIO code was still moving a lot and we switch to new things we when became available (ZIO 1.0.0 is around the corner and it as been quite stable for months now) ▪ we spent quite some time looking for the best choice for errors between sub-system (see other question)

Some questions asked after the talk 128 • Your system
part is very interesting thank you but what about hexagonal architecture / clean code / onion architecture / etc ? ◦ I don't really care of the exact name, what is important for me, the core idea that need to be internalized and shared, is that in a any complex construction, there is specialized subparts that communicate between them, and that high coupling really means "no subparts". ◦ and the ﬁrst baby, huge step, is to identify that these subpart exists, and discussed what they are, and give them autonomy. ◦ and ﬁrst practice that helps for that is: make as explicit as possible your system interfaces

DevoxxFR 2021 - Systematic error management in ...

DevoxxFR 2021 - Systematic error management in application

More Decks by fanf42

Other Decks in Programming

Featured

Transcript