Slide 1

Slide 1 text

No content

Slide 2

Slide 2 text

BézierSketch: A generative model for scalable vector sketches Ayan Das1,2, Yongxin Yang1,2, Timothy Hospedales1,3, Tao Xiang1,2 and Yi-Zhe Song1,2 1 SketchX, CVSSP, University of Surrey, United Kingdom 2 iFlyTek-Surrey Joint Research Centre on Artificial Intelligence 3 University of Edinburgh, United Kingdom ECCV 2020 (Poster)

Slide 3

Slide 3 text

Traditional Sketch generator ➢ Prior works: ❑ Sketch-RNN[1] by Ha et. al. for free-hand doodles ❑ Handwriting generation by A. Graves[2] [1] Ha, D., Eck, D.: A neural representation of sketch drawings. In: ICLR (2018) [2] Graves, A.: Generating Sequences With Recurrent Neural Networks. (2013)

Slide 4

Slide 4 text

Traditional Sketch generator ➢ Prior works: ❑ Sketch-RNN[1] by Ha et. al. for free-hand doodles ❑ Handwriting generation by A. Graves[2] [1] Ha, D., Eck, D.: A neural representation of sketch drawings. In: ICLR (2018) [2] Graves, A.: Generating Sequences With Recurrent Neural Networks. (2013) ➢ Generates explicit waypoints, i.e., sampled coordinates from the continuous drawing trajectory ❑ Relatively longer representation ❑ More temporal correlation ❑ Fixed resolution (depends on sampling rate) ❑ Bad quality, at least for artistic applications

Slide 5

Slide 5 text

➢ Prior works: ❑ Learning to synthesize parametric curves[1] (unsupervised) ❑ Font generation[2] (supervised) [1] Ganin, Y., Kulkarni, T., Babuschkin, I., Eslami, S.M.A., Vinyals, O.: Synthesizing programs for images using reinforced adversarial learning. In: ICML (2018) [2] Lopes, R.G., Ha, D., Eck, D., Shlens, J.: A learned representation for scalable vector graphics. In: ICCV (2019) Vectorized Sketch generator

Slide 6

Slide 6 text

➢ Prior works: ❑ Learning to synthesize parametric curves[1] (unsupervised) ❑ Font generation[2] (supervised) [1] Ganin, Y., Kulkarni, T., Babuschkin, I., Eslami, S.M.A., Vinyals, O.: Synthesizing programs for images using reinforced adversarial learning. In: ICML (2018) [2] Lopes, R.G., Ha, D., Eck, D., Shlens, J.: A learned representation for scalable vector graphics. In: ICCV (2019) ➢ Generates parametric curve specifications, i.e., control points of Bézier curves, B-Splines, etc. ❑ Relatively shorter representation ❑ Inherently arbitrary resolution ❑ High quality for artistic uses. Vectorized Sketch generator

Slide 7

Slide 7 text

➢ Prior works: ❑ Learning to synthesize parametric curves[1] (unsupervised) ❑ Font generation[2] (supervised) [1] Ganin, Y., Kulkarni, T., Babuschkin, I., Eslami, S.M.A., Vinyals, O.: Synthesizing programs for images using reinforced adversarial learning. In: ICML (2018) [2] Lopes, R.G., Ha, D., Eck, D., Shlens, J.: A learned representation for scalable vector graphics. In: ICCV (2019) ➢ Generates parametric curve specifications, i.e., control points of Bézier curves, B-Splines, etc. ❑ Relatively shorter representation ❑ Inherently arbitrary resolution ❑ High quality for artistic uses. Vectorized Sketch generator

Slide 8

Slide 8 text

❑ Converts traditional waypoint-based representation into Bézier curves[1] ❑ The model is learnable, high-quality and faster than standard fitting-based algorithms BézierEncoder: An inverse-graphics based encoder for Bézierrepresentations [1] Salomon, D.: Curves and surfaces for computer graphics. Springer Science & Business Media (2007) Softmax+

Slide 9

Slide 9 text

❑ Converts traditional waypoint-based representation into Bézier curves[1] ❑ The model is learnable, high-quality and faster than standard fitting-based algorithms BézierEncoder: An inverse-graphics based encoder for Bézierrepresentations [1] Salomon, D.: Curves and surfaces for computer graphics. Springer Science & Business Media (2007) Softmax+

Slide 10

Slide 10 text

❑ Converts traditional waypoint-based representation into Bézier curves[1] ❑ The model is learnable, high-quality and faster than standard fitting-based algorithms BézierEncoder: An inverse-graphics based encoder for Bézierrepresentations [1] Salomon, D.: Curves and surfaces for computer graphics. Springer Science & Business Media (2007) Softmax+

Slide 11

Slide 11 text

❑ Converts traditional waypoint-based representation into Bézier curves[1] ❑ The model is learnable, high-quality and faster than standard fitting-based algorithms BézierEncoder: An inverse-graphics based encoder for Bézierrepresentations [1] Salomon, D.: Curves and surfaces for computer graphics. Springer Science & Business Media (2007) Softmax+

Slide 12

Slide 12 text

❑ Converts traditional waypoint-based representation into Bézier curves[1] ❑ The model is learnable, high-quality and faster than standard fitting-based algorithms BézierEncoder: An inverse-graphics based encoder for Bézierrepresentations [1] Salomon, D.: Curves and surfaces for computer graphics. Springer Science & Business Media (2007) Softmax+

Slide 13

Slide 13 text

❑ Converts traditional waypoint-based representation into Bézier curves[1] ❑ The model is learnable, high-quality and faster than standard fitting-based algorithms BézierEncoder: An inverse-graphics based encoder for Bézierrepresentations [1] Salomon, D.: Curves and surfaces for computer graphics. Springer Science & Business Media (2007) Softmax+

Slide 14

Slide 14 text

❑ Converts traditional waypoint-based representation into Bézier curves[1] ❑ The model is learnable, high-quality and faster than standard fitting-based algorithms BézierEncoder: An inverse-graphics based encoder for Bézierrepresentations [1] Salomon, D.: Curves and surfaces for computer graphics. Springer Science & Business Media (2007) Softmax+ Bézier curve formulation

Slide 15

Slide 15 text

❑ Converts traditional waypoint-based representation into Bézier curves[1] ❑ The model is learnable, high-quality and faster than standard fitting-based algorithms BézierEncoder: An inverse-graphics based encoder for Bézierrepresentations [1] Salomon, D.: Curves and surfaces for computer graphics. Springer Science & Business Media (2007) Softmax+ Bézier curve formulation Non-parametric Decoder Parametric Encoder

Slide 16

Slide 16 text

BézierEncoder: Training ➢ We simply minimize reconstruction loss :

Slide 17

Slide 17 text

BézierEncoder: Training ➢ We simply minimize reconstruction loss : ➢ Furthermore, we extend this simple formulation by considering an ensemble of losses with different degrees of Bézier curves

Slide 18

Slide 18 text

BézierEncoder: Training ➢ We simply minimize reconstruction loss : ➢ Furthermore, we extend this simple formulation by considering an ensemble of losses with different degrees of Bézier curves ➢ Restricting consecutive control points to be closer has regularizing effect

Slide 19

Slide 19 text

Waypoint based representation Bézier curve representation BézierEncoder: Inference BézierEncoder

Slide 20

Slide 20 text

Waypoint based representation Bézier curve representation BézierEncoder: Inference BézierEncoder

Slide 21

Slide 21 text

BézierSketch: Control Point mode ΔP1 [0,0] [0,0] Pi-1 ❑ At each time-step, it predicts difference vector of successive control points. ΔP1

Slide 22

Slide 22 text

BézierSketch: Control Point mode ΔP1 [0,0] ΔPi Δpi-1 [0,0] ΔPi Pi-1 Pi … ΔPi ❑ At each time-step, it predicts difference vector of successive control points. ΔP1

Slide 23

Slide 23 text

BézierSketch: Control Point mode ΔP1 [0,0] ΔPi Δpi-1 [0,0] ΔPi Pi-1 Pi … ΔPi ❑ At each time-step, it predicts difference vector of successive control points. ΔP1 ❑ Also predicts “stroke-end” and “sketch- end” bit, just like Sketch-RNN

Slide 24

Slide 24 text

BézierSketch: Control Point mode ΔP1 [0,0] ΔPi Δpi-1 [0,0] ΔPi Pi-1 Pi … ΔPi ❑ At each time-step, it predicts difference vector of successive control points. ΔP1 ❑ Also predicts “stroke-end” and “sketch- end” bit, just like Sketch-RNN ❑ Average length goes down by 33% Stroke lengths Sketch lengths

Slide 25

Slide 25 text

S1 Si Si-1 … S0 BézierSketch: Stroke mode ❑ At each time-step, it predicts the next stroke

Slide 26

Slide 26 text

S1 Si Si-1 … S0 Si BézierSketch: Stroke mode ❑ At each time-step, it predicts the next stroke

Slide 27

Slide 27 text

S1 Si Si-1 … S0 Si BézierSketch: Stroke mode ❑ At each time-step, it predicts the next stroke ❑ Fixed-degree Bézier representation used. Predicts 2N real numbers (N being the degree of Bézier curve)

Slide 28

Slide 28 text

S1 Si Si-1 … S0 Si BézierSketch: Stroke mode ❑ At each time-step, it predicts the next stroke ❑ Fixed-degree Bézier representation used. Predicts 2N real numbers (N being the degree of Bézier curve) ❑ RNN time-steps is the number of strokes in a sketch

Slide 29

Slide 29 text

Qualitative results Unconditional “Control point mode” generation Unconditional “Stroke mode” generation Conditional “Control point mode” generation Conditional “Stroke mode” generation

Slide 30

Slide 30 text

Future works

Slide 31

Slide 31 text

Future works ❑ Use more sophisticated parametric curves, like BSpline

Slide 32

Slide 32 text

Future works ❑ Use more sophisticated parametric curves, like BSpline ❑ Build encoder that predicts parametric curves directly from raster images

Slide 33

Slide 33 text

Thank You