CodeCamp 2016 - How the web and browsers actually work

@jamesmacﬁe How the web and browsers actually work An in-depth
look at how things are rendered to the screen #codecamp

#codecamp This is a talk about some technical things we
often gloss over

#codecamp What happens when we enter in a domain? And
how is that information rendered to the screen?

Oh look, we’ve entered a url and pressed enter #codecamp

Before the browser does anything, it checks the url validity
#codecamp If there are invalid characters, the browser will use Punycode to make the URL valid. bücher.com xn--bcher-kva.com/ büücher.com xn--bcher-kvaa.com/ münchen.com xn--mnchen-3ya.com/

Does the browser need to force the request to use
https? #codecamp There’s an internal cache of sites that have requested to only be communicated with https, not http HTTP Strict Transport Security LISt

Now the browser can request the domain’s ip address #codecamp
Cache Hosts ﬁle Local network DNS resolver

#codecamp www. Domains resolve right to left google.COM

Example for www.google.com #codecamp Recursive resolver Root resolver “Hey, can
I have the IP address for www.google.com?” “Don’t have it - try xxx.xxx.xxx.xxx” “Don’t have it - try xxx.xxx.xxx.xxx” TLD resolver “Hey, can I have the IP address for www.google.com?” Domain resolver “Hey, can I have the IP address for www.google.com?” Sure, it’s -xxx.xxx.xxx.xxx”

#codecamp At this point, we still haven’t made a request
for any data

This is what a request and response looks like #codecamp
Request Response 200 OK Content-Length: 243 Content-Type: text/html [response headers] <html> <head> <title>Howdy</title> </head> GET / HTTP/1.1 Host: google.com Connection: close [other headers]

We use different ports for http and https #codecamp 80
HTTP 443 HTTPS

Now, what’s this tcp/ip I’ve heard about? #codecamp TCP/IP handles
how data is transferred 1 4 7 2 5 8 3 6 9 6 1 7 2 9 5 3 4 8 1 4 7 2 5 8 3 6 9

WOOHOO! Now we start getting the document contents

These are the different parts of a browser #codecamp User
interface Browser engine Rendering engine Networking Data persistence JavaScript UI Backend

#codecamp Let’s talk about the rendering engine

Rendering html basically does this #codecamp Parse HTML and create
the DOM tree Parse styles and create the render tree Layout the render tree Paint the render tree

#codecamp Apologies, but we’re going to go into a bit
of detail about parsing things

#codecamp In a nutshell, parsing is translating some input into
a structure that can be used in code

For example, 2 + 3 - 1 could return this
#codecamp Expression - Number 1 Expression + Number 2 Number 3

#codecamp Grammars, vocabulary, and syntax

XML is a context free grammar #codecamp <memo> <addressee>John</addressee> <sender>Carla</addressee>
<date>19980901</date> <title>New coffee maker</title> <body> The new coffee maker has been installed! Operation is simple: put a cup in the opening and press the red button. </body> </memo>

#codecamp parsing involves two processes - lexical analysis and syntax
analysis

Lexical analysis - breaking the input into tokens #codecamp Expression
- Number 1 Expression + Number 2 Number 3

Syntax analysis - converting tokens into a tree #codecamp Expression
- Number 1 Expression + Number 2 Number 3

#codecamp Ok, sure. but how is parsing html different?

#codecamp HTML is not a context free grammar so we
cannot use common parsing techniques

#codecamp How html is parsed is defined by the w3c

#codecamp There are two main reasons for this: - HTML
parsers are extremely fault tolerant - the html can be modified as it is being parsed

html’s forgiving nature is pretty nice really #codecamp <html> <div>
<p> </div> </span>Crappy HTML </p> </html> <html> <head></head>  <body> <div> <p></p> </div> Crappy HTML <p></p> </body> </html>

html’s forgiving nature is pretty nice really #codecamp <table> <table>
<tr><td>inner table</td></tr> </table> <tr><td>outer table</td></tr> </table> <table> <tr><td>outer table</td></tr> </table> <table> <tr><td>inner table</td></tr> </table>

HTML’s parsing process is reentrant #codecamp Dynamic code can modify
the HTML as it is being parsed. This can add extra tokens to the HTML. Think about a script tag that gets evaluated in the middle of the input which contains a document.write call

The html parsing flow #codecamp DOM tree construction Network data
Tokeniser Script execution DOM document.write()

#codecamp both the tokeniser and the tree constructor act like
a state machine

Let’s look at how the tokeniser would parse this #codecamp
<html> <body> Codecamp </body> </html> Current state: “Data”

<html> <body> Codecamp </body> </html> Current state: “Tag open”

<html> <body> Codecamp </body> </html> Current state: “Tag name”

<html> <body> Codecamp </body> </html> Token created: start-tag {name: html}

<html> <body> Codecamp </body> </html> Token created: character {data: C}

<html> <body> Codecamp </body> </html> Current state: “Tag open”

<html> <body> Codecamp </body> </html> Current state: “Close tag open”

We end up with twelve tokens #codecamp start-tag { name:
html } start-tag { name: body } character { data: C } character { data: o } character { data: d } character { data: e } character { data: c } character { data: a } character { data: m } character { data: p } end-tag { name: body } end-tag { name: html }

Next the DOM tree is constructed from the tokens #codecamp
First, the root document node is created and all other nodes will be added to this. For each token, the spec deﬁnes which DOM element is relevant. These elements are added both to the DOM tree and also to the stack of open elements.

How does this get converted into the dom tree? #codecamp
start-tag { name: html } start-tag { name: body } character { data: C } character { data: o } character { data: d } character { data: e } character { data: c } character { data: a } character { data: m } character { data: p } end-tag { name: body } end-tag { name: html } State: Initial

start-tag { name: html } start-tag { name: body } character { data: C } character { data: o } character { data: d } character { data: e } character { data: c } character { data: a } character { data: m } character { data: p } end-tag { name: body } end-tag { name: html } State: Before html HTMLHtmlElement

start-tag { name: html } start-tag { name: body } character { data: C } character { data: o } character { data: d } character { data: e } character { data: c } character { data: a } character { data: m } character { data: p } end-tag { name: body } end-tag { name: html } State: Before head HTMLHtmlElement HTMLHeadElement

start-tag { name: html } start-tag { name: body } character { data: C } character { data: o } character { data: d } character { data: e } character { data: c } character { data: a } character { data: m } character { data: p } end-tag { name: body } end-tag { name: html } State: In head HTMLHtmlElement HTMLHeadElement

start-tag { name: html } start-tag { name: body } character { data: C } character { data: o } character { data: d } character { data: e } character { data: c } character { data: a } character { data: m } character { data: p } end-tag { name: body } end-tag { name: html } State: After head HTMLHtmlElement HTMLHeadElement

start-tag { name: html } start-tag { name: body } character { data: C } character { data: o } character { data: d } character { data: e } character { data: c } character { data: a } character { data: m } character { data: p } end-tag { name: body } end-tag { name: html } State: In body HTMLHtmlElement HTMLHeadElement HTMLBodyElement

start-tag { name: html } start-tag { name: body } character { data: C } character { data: o } character { data: d } character { data: e } character { data: c } character { data: a } character { data: m } character { data: p } end-tag { name: body } end-tag { name: html } State: In body HTMLHtmlElement HTMLHeadElement HTMLBodyElement Text

start-tag { name: html } start-tag { name: body } character { data: C } character { data: o } character { data: d } character { data: e } character { data: c } character { data: a } character { data: m } character { data: p } end-tag { name: body } end-tag { name: html } State: After body HTMLHtmlElement HTMLHeadElement HTMLBodyElement Text

start-tag { name: html } start-tag { name: body } character { data: C } character { data: o } character { data: d } character { data: e } character { data: c } character { data: a } character { data: m } character { data: p } end-tag { name: body } end-tag { name: html } State: After after body HTMLHtmlElement HTMLHeadElement HTMLBodyElement Text

This is our final dom tree #codecamp HTMLHtmlElement HTMLHeadElement HTMLBodyElement
Text <html> <body> Codecamp </body> </html>

slightly more complicated #codecamp HTMLHtmlElement <html> <body> <div> <p>Code</p> <p>camp</p>
</div> <div> <p> <span>2016</span> </p> </div> </body> </html> HTMLHeadElement HTMLBodyElement HTMLDivElement HTMLDivElement HTMLParagrahElement HTMLParagraphElement HTMLParagraphElement HTMLSpanElement Text Text Text

#codecamp AnD... We’re done parsing the html

We can now execute ‘defer’ scripts #codecamp <script> Ref: https://html.spec.whatwg.org/
Parse HTML Fetch script Execute script <script async> <script defer>

#codecamp Styles also pause the parsing of html. Well, mostly

#codecamp Not all of these styles will block <link href=“style.css"
rel=“stylesheet"> <link href=“style.css" rel="stylesheet" media=“all”> <link href="portrait.css" rel="stylesheet" media=“orientation:portrait"> <link href=“print.css" rel="stylesheet" media="print">

#codecamp WHILE the DOM tree is being constructed, so is
another tree - the render tree

#codecamp Unlike html, css is a context free grammar

For example - dom tree #codecamp <head></head> <body> <p> Codecamp
<span> 2016 </span> </p> <span> Wellington </span> <div> <img src=“codecamp.jpg” /> </div> </body> body span div img p span text text text head

For example - cssom #codecamp body {  font-size: 16px; }
p { font-weight: bold; } p span { display: none; } span { color: red; } img {  float: right; } img p span body span font-size: 16px; font-weight: bold; font-size: 16px; font-size: 16px; font-weight: bold; display: none; color: red; font-size: 16px; color: red; font-size: 16px; ﬂoat: right; Note - this doesn’t include browser styles so is incomplete, but hopefully you get the idea div font-size: 16px;

Not all dom nodes are rendered to the screen #codecamp
The <head> tag, for example, has no visual element. Neither does anything with display: none.

For example - render tree #codecamp div p span body
font-size: 16px; font-weight: bold; font-size: 16px; font-size: 16px; color: red; font-size: 16px; ﬂoat: right; text text img font-size: 16px;

BTW, some nodes have more than one element to render
#codecamp Take a select box for example Select an item Item number two Item number three Item number four Item number one Input Button Dropdown box

Some nodes are also present in a different position #codecamp
That is, the node is present both in the DOM tree and render tree, but not at the same point. Eg, an absolute or fixed positioned element

Previous example with an absolutely positioned img #codecamp body span
div p span img body {  position: relative } img {  position: absolute; }

Where does the visual property information come from? #codecamp It
comes from a few different places: - the browser’s defaults - user stylesheets - author stylesheets (those from the developer) - inline styles

#codecamp style rules can appear multiple times The order of
these rules is very important

#codecamp not only does the browser have to keep track
of the order, but also calculate the specificity

#codecamp Example - different sources of css /* Browser defaults
*/ body {  font-family: serif; } /* Author styles */ body {  font-family: sans-serif; } #header { font-family: Helvetica; } h1 { font-family: ‘Comic Sans MS’; } <body> <main> <h1 id=“header” style=“font- family: Impact;”>Hello!</h1> </main> </body>

#codecamp GECKO creates an extra couple of trees for styles
- a rule tree and a style context tree

#codecamp Example - here’s some html and some css /*
1 */ div { display: block; text-indent: 1em; } /* 2 */ h1 { display: block; font-size: 3em; } /* 3 */ span { display: block; } /* 4 */ .un { text-decoration: underline; } <body> <div> <h1>Birdy bird</h1> <span class=“un”>Name: <strong>Spotted shag</strong></span> <span>Family: <em class=“un”>Phalacrocoracidae</em></span> </div> </body>

#codecamp Example - This would be the dom tree <body>
<div> <h1>Birdy bird</h1> <span class=“un”>Name: <strong>Spotted shag</strong></span> <span>Family: <em class=“un”>Phalacrocoracidae</em></span> </div> </body> div span.un h1 em.un span body strong

#codecamp Example - th is would be the rule tree
/* 1 */ div { display: block; text-indent: 1em; } /* 2 */ h1 { display: block; font-size: 3em; } /* 3 */ span { display: block; } /* 4 */ .un { text-decoration: underline; } A (null) C: 2 E: 4 B: 1 F: 4 D: 3

#codecamp Example - this would be the style context tree
body div span.un h1 span em.un strong B: 1 C: 2 A: null E: 4 F: 4 D: 3

There are also hash maps for quickly looking up styles
#codecamp Both gecko and webkit implement a few different hash maps for storing references to styles - ids - classes - tags - general

#codecamp We have our styles, now we need to calculate
layout

determining layout positions is a recursive process #codecamp To ﬁgure
out the exact position of each node in the render tree, we start at the root and traverse it to compute the geometry of all nodes. <body> <main style=“width:50%”> <div style=“width:50%”>  Hello! </div> </main> </body> Hello! Viewport (size = device width) div (50%) div (50%)

Layout - briefly #codecamp 1. A node determines it’s own
width 2. Over all the nodes children: 1. A child x and y positions set 2. Layout is called on the child if necessary 3. The parent uses the childs accumulated height to determine it’s own height

#codecamp

#codecamp small changes to the dom should have a proportionally
small processing time

#codecamp Final step - paint all the things

#codecamp each node in the render is painted individually

Painting things happens in a certain order #codecamp 1. Background
colour 2. Background image (which includes gradients) 3. Border 4. Children 5. Outline

We’ve finished rendering! we can now interact with stuff

Places where this info came from #codecamp HTML5 Rocks article
(from 2011, but very in depth) http://www.html5rocks.com/en/tutorials/internals/howbrowserswork Google’s BlinkOn internal talks https://www.youtube.com/channel/UCIfQb9u7ALnOE4ZmexRecDg Google Developers https://developers.google.com/web/fundamentals/performance Mozilla - Gecko overview https://wiki.mozilla.org/Gecko:Overview

@jamesmacﬁe Thanks #codecamp

#codecamp

CodeCamp 2016 - How the web and browsers actual...

CodeCamp 2016 - How the web and browsers actually work

More Decks by James Macfie

Featured

Transcript