WebXR: Beyond  WebGL

WebXR Beyond  WebGL HTML5 Conference 2018 | 2018-11-25

Frontend Engineer at PixelGrid, Inc. Akihiro Oyamada @yomotsu

https://www.youtube.com/watch?v=ttDLGV5IJCM

This will possible on  every mobile phone

1. WebXR Device API

• The API for both AR and VR  (WebVR API
will be replaced by WebXR Device API) • Close relationship  with Khronos’ OpenXR WebXR Device API

• Currently, only available in  Chrome Canary with ﬂags •
Only works on either  https or localhost As of Nov. 2018

Connect your Android Chrome to localhost  with DevTools on desktop

FYI: Origin Trials Token is required  except localhost https://github.com/GoogleChrome/OriginTrials/blob/gh-pages/developer-guide.md

Consist of 2 sources

WebGL scene (Under your control) XR imaginary  (Camera input) WebGL
framebuﬀer

WebGL framebuﬀer

How does it work?

How does it work? const device = await navigator.xr.requestDevice();

XRSession device.requestSession( { immersive: false, outputContext: xrContext, environmentIntegration: true, }
);

XRSession const glCanvas = document.createElement( 'canvas' ); const gl =
glCanvas.getContext( 'webgl', { xrCompatible: true } ); xrSession.baseLayer = new XRWebGLLayer( xrSession, gl ); WebGL context to be used  as a source for XR imagery

The framebuﬀer can be ﬁlled with XR imagery gl.bindFramebuffer( gl.FRAMEBUFFER,
xrSession.baseLayer.framebuffer );

Draw your 3D scene on it

frameOfReference Real world coordinate

XRFrame  (delta) frameOfReference

xrFrame.view[n].projectionMatrix    const pose = xrFrame.getDevicePose(frameOfRef) pose.getViewMatrix( view ) frameOfReference
XRFrame  (delta) Position and  Orientation

Dive into the code!

const width = window.innerWidth; const height = window.innerHeight; navigator.xr.requestDevice().then( (
device ) => { const outputCanvas = document.getElementById( 'xrCanvas' ); outputCanvas.width = width; outputCanvas.height = height; const xrContext = outputCanvas.getContext( 'xrpresent' ); // session request must be done in user action such as click window.addEventListener( 'click', onEnterAR ); async function onEnterAR() { const xrSession = await device.requestSession( { outputContext: xrContext, environmentIntegration: true, } ); const renderer = new THREE.WebGLRenderer(); renderer.autoClear = false; renderer.setSize( width, height );

window.addEventListener( 'click', onEnterAR ); async function onEnterAR() { const xrSession
= await device.requestSession( { outputContext: xrContext, environmentIntegration: true, } ); const renderer = new THREE.WebGLRenderer(); renderer.autoClear = false; renderer.setSize( width, height ); // bind gl context to XR session const gl = renderer.getContext(); gl.setCompatibleXRDevice( xrSession.device ); xrSession.baseLayer = new XRWebGLLayer( xrSession, gl ); const scene = new THREE.Scene(); const camera = new THREE.PerspectiveCamera(); camera.matrixAutoUpdate = false; const box = new THREE.Mesh( new THREE.BoxBufferGeometry( .2, .2, .2 ), new THREE.MeshNormalMaterial()

xrSession.baseLayer = new XRWebGLLayer( xrSession, gl ); const scene =
new THREE.Scene(); const camera = new THREE.PerspectiveCamera(); camera.matrixAutoUpdate = false; const box = new THREE.Mesh( new THREE.BoxBufferGeometry( .2, .2, .2 ), new THREE.MeshNormalMaterial() ); scene.add( box ); const frameOfRef = await xrSession.requestFrameOfReference( 'eye-level' ); xrSession.requestAnimationFrame( onDrawFrame ); function onDrawFrame( timestamp, xrFrame ) { const session = xrFrame.session; // xrSession === xrFrame.session const pose = xrFrame.getDevicePose( frameOfRef ); session.requestAnimationFrame( onDrawFrame ); gl.bindFramebuffer( gl.FRAMEBUFFER, session.baseLayer.framebuffer );

scene.add( box ); const frameOfRef = await xrSession.requestFrameOfReference( 'eye-level' );
xrSession.requestAnimationFrame( onDrawFrame ); function onDrawFrame( timestamp, xrFrame ) { const session = xrFrame.session; // xrSession === xrFrame.session const pose = xrFrame.getDevicePose( frameOfRef ); session.requestAnimationFrame( onDrawFrame ); gl.bindFramebuffer( gl.FRAMEBUFFER, session.baseLayer.framebuffer ); if ( ! pose ) return; // if the session is for both right and left eyes, length of views would be 2. // if not, length is 1, xrFrame.views.forEach( ( view ) => { const viewport = session.baseLayer.getViewport( view ); renderer.setSize( viewport.width, viewport.height );

function onDrawFrame( timestamp, xrFrame ) { const session = xrFrame.session;
// xrSession === xrFrame.session const pose = xrFrame.getDevicePose( frameOfRef ); session.requestAnimationFrame( onDrawFrame ); gl.bindFramebuffer( gl.FRAMEBUFFER, session.baseLayer.framebuffer ); if ( ! pose ) return; // if the session is for both right and left eyes, length of views would be 2. // if not, length is 1, xrFrame.views.forEach( ( view ) => { const viewport = session.baseLayer.getViewport( view ); renderer.setSize( viewport.width, viewport.height ); camera.projectionMatrix.fromArray( view.projectionMatrix ); const viewMatrix = new THREE.Matrix4().fromArray( pose.getViewMatrix( view ) ); camera.matrix.getInverse( viewMatrix ); camera.updateMatrixWorld( true ); renderer.clearDepth(); renderer.render( scene, camera );

if ( ! pose ) return; // if the session
is for both right and left eyes, length of views would be 2. // if not, length is 1, xrFrame.views.forEach( ( view ) => { const viewport = session.baseLayer.getViewport( view ); renderer.setSize( viewport.width, viewport.height ); camera.projectionMatrix.fromArray( view.projectionMatrix ); const viewMatrix = new THREE.Matrix4().fromArray( pose.getViewMatrix( view ) ); camera.matrix.getInverse( viewMatrix ); camera.updateMatrixWorld( true ); renderer.clearDepth(); renderer.render( scene, camera ); } ); } } } );

Hit test

source: https://codelabs.developers.google.com/codelabs/ar-with-webxr/index.html#4 Find intersection point  that ray with the real-world
surface

The code

const width = window.innerWidth; const height = window.innerHeight; const startButton
= document.getElementById( 'startButton' ); navigator.xr.requestDevice().then( ( device ) => { const outputCanvas = document.getElementById( 'xrCanvas' ); outputCanvas.width = width; outputCanvas.height = height; const xrContext = outputCanvas.getContext( 'xrpresent' ); // session request must be done in user action such as click startButton.addEventListener( 'click', onEnterAR ); async function onEnterAR() { startButton.style.display = 'none'; const xrSession = await device.requestSession( { The same as previous one…

renderer.render( scene, camera ); } ); } window.addEventListener( 'click', onClick
); // use Raycaster to make ray origin and direction const raycaster = new THREE.Raycaster(); // onClick must be async, since hitTest will be done with await async function onClick() { const x = 0; const y = 0; raycaster.setFromCamera( { x, y }, camera ); const origin = new Float32Array( raycaster.ray.origin.toArray() ); const direction = new Float32Array( raycaster.ray.direction.toArray() ); const hits = await xrSession.requestHitTest( origin, direction, frameOfRef ); if ( hits.length ) { Add a click action

renderer.render( scene, camera ); } ); } window.addEventListener( 'click', onClick
); // use Raycaster to make ray origin and direction const raycaster = new THREE.Raycaster(); // onClick must be async, since hitTest will be done with await async function onClick() { const x = 0; const y = 0; raycaster.setFromCamera( { x, y }, camera ); const origin = new Float32Array( raycaster.ray.origin.toArray() ); const direction = new Float32Array( raycaster.ray.direction.toArray() ); const hits = await xrSession.requestHitTest( origin, direction, frameOfRef ); if ( hits.length ) {

window.addEventListener( 'click', onClick ); // use Raycaster to make ray
origin and direction const raycaster = new THREE.Raycaster(); // onClick must be async, since hitTest will be done with await async function onClick() { const x = 0; const y = 0; raycaster.setFromCamera( { x, y }, camera ); const origin = new Float32Array( raycaster.ray.origin.toArray() ); const direction = new Float32Array( raycaster.ray.direction.toArray() ); const hits = await xrSession.requestHitTest( origin, direction, frameOfRef ); if ( hits.length ) { const hit = hits[ 0 ]; const hitMatrix = new THREE.Matrix4().fromArray( hit.hitMatrix ); const box = new THREE.Mesh(

const raycaster = new THREE.Raycaster(); // onClick must be async,
since hitTest will be done with await async function onClick() { const x = 0; const y = 0; raycaster.setFromCamera( { x, y }, camera ); const origin = new Float32Array( raycaster.ray.origin.toArray() ); const direction = new Float32Array( raycaster.ray.direction.toArray() ); const hits = await xrSession.requestHitTest( origin, direction, frameOfRef ); if ( hits.length ) { const hit = hits[ 0 ]; const hitMatrix = new THREE.Matrix4().fromArray( hit.hitMatrix ); const box = new THREE.Mesh( new THREE.BoxBufferGeometry( .2, .2, .2 ), new THREE.MeshNormalMaterial() );

raycaster.setFromCamera( { x, y }, camera ); const origin =
new Float32Array( raycaster.ray.origin.toArray() ); const direction = new Float32Array( raycaster.ray.direction.toArray() ); const hits = await xrSession.requestHitTest( origin, direction, frameOfRef ); if ( hits.length ) { const hit = hits[ 0 ]; const hitMatrix = new THREE.Matrix4().fromArray( hit.hitMatrix ); const box = new THREE.Mesh( new THREE.BoxBufferGeometry( .2, .2, .2 ), new THREE.MeshNormalMaterial() ); box.position.setFromMatrixPosition( hitMatrix ); scene.add( box ); } } }

Demo 52

How about iOS?

2. AR Quick Look

• Upon the release of iOS 12 • Only works
exclusively on Safari  (Doesn’t work even in iOS Chrome) • Special HTML syntax • Apple’s propriety AR Quick Look

<a href="./3d-model.usdz" rel="ar"> <img src="./thumb.jpg" alt=""> </a> Link to usdz
with rel="ar" Must contain one <img> or <picture>

58 Demo

• Stands for “Universal Scene Description” archived with Zip •
3D model format for AR Quick Look • Created by Apple and Pixar USDZ

• With “usdz_converter”  which is a Xcode command line  Only
for macOS • Vectary (Web service)  Cannot conﬁgure details such as size How to prepare USDZ

• Up to 10M Polygons • Up to 10 seconds
for the animation • Up to 2048×2048 texture size Limitations

$ xcrun usdz_converter ./my-model.obj my-model.usdz -color_map albedo.jpg -metallic_map metallic.jpg  -roughness_map
roughness.jpg -normal_map normal.jpg -ao_map ao.jpg -emissive_map emissive.jpg USDZ Converter Terminal.app

roughness.jpg -normal_map normal.jpg -ao_map ao.jpg -emissive_map emissive.jpg Terminal.app

roughness.jpg -normal_map normal.jpg -ao_map ao.jpg -emissive_map emissive.jpg Terminal.app File input Output name Input option name Option value

https://www.vectary.com/ Vectary

https://www.vectary.com/

https://www.vectary.com/ USDZ export

• Apple's propriety  (Hopefully it’s a temporary spec until WebXR
Device API) • Some limitations of USDZ • Just pop and show in AR  (Cannot be utilized as game and others)

3D Model format  for Web Apps?

3. glTF

• Stands for GL Transmission Format • Open standard 3D
model format • JPEG of 3D • Maintained by Khronos What is glTF

• JSON format as the container with binary payloads  or
packed single binary ﬁle called glb • Animation supported • Extensible just like WebGL Spec  (Like Adobe Fireworks PNG) What is glTF

• Supported by many 3D modeling tools • Several WebGL
libraries support glTF loading  (Such as three.js, BabylonJS, Cesium) • Microsoft Paint3D, Oﬃce and others use glb as 3D model format • Adobe Animate has glTF exporter • Facebook's 3D posts use glb • VRM: glTF extended format for humanoid avatar  (For Virtual-YouTuber, VRChat and others) glTF of the present

https://www.youtube.com/watch?v=H2XoeQmkchw

https://www.youtube.com/watch?v=zzDM42PdqZk

Load glTF in three.js

const width = window.innerWidth; const height = window.innerHeight; const scene
= new THREE.Scene(); const camera = new THREE.PerspectiveCamera( 45, width / height, 0.001, 100 ); camera.position.set( 0, 0, 0.5 ); const renderer = new THREE.WebGLRenderer(); renderer.setSize( width, height ); renderer.gammaInput = true; renderer.gammaOutput = true; document.body.appendChild( renderer.domElement ); scene.add( new THREE.HemisphereLight( 0xffffff , 0x332222 ) );

document.body.appendChild( renderer.domElement ); scene.add( new THREE.HemisphereLight( 0xffffff , 0x332222 )
); const loader = new THREE.GLTFLoader(); loader.load( './models/barger/barger.gltf', function ( gltf ) { scene.add( gltf.scene ); } ); ( function anim () { requestAnimationFrame( anim );

scene.add( gltf.scene ); } ); ( function anim () {
requestAnimationFrame( anim ); renderer.render( scene, camera ); } )();

Demo 83

glTF Animation

glTF is very popular  as the standard

Security issues

• Gaze tracking  Detect virtual keyboard type with gaze direction
in a VR environment • Trusted environment  Motion sickness by low FPS and pose track error • Fingerprinting  Identify users room shape or even face shape with depth

Conclusion

XR is coming to the Web

• A Web API (in development) • For both VR
and AR • Just works on WebBrowsers  no add-ons nor installation required WebXR Device API

• AR for iOS • Using USDZ • Basic AR
feature AR Quick Look

• The standard • 3D model format in JSON or
Binary • Can be seen many places • Loaders are available in JavaScript glTF

Web will be connected  to the real world 94 Source:
https://www.netﬂix.com/jp/title/80182418 © ࢜࿠ਖ਼फɾProduction I.G/ߨஊࣾɾ߈֪ػಈୂ੡࡞ҕһձ

gl.ﬁnish(); @yomotsu

WebXR: Beyond WebGL

WebXR: Beyond WebGL

More Decks by yomotsu

Featured

Transcript

WebXR: Beyond  WebGL

WebXR: Beyond  WebGL