OpenAI GPT and Friends - Speaker Deck

Slide 1

Slide 1 text

OpenAI Generative Pre-Training … and friends ULMFiT, ElMO, GPT, BERT, GPT2 Robin Ranjit Singh Chauhan https://twitter.com/robinc

Slide 19

Slide 19 text

Input In a shocking finding, scientist discovered a herd of unicorns living in a remote, previously unexplored valley, in the Andes Mountains. Even more surprising to the researchers was the fact that the unicorns spoke perfect English. Generated by GPT-2 XL The scientist named the population, after their distinctive horn, Ovid’s Unicorn. These four-horned, silver-white unicorns were previously unknown to science. Now, after almost two centuries, the mystery of what sparked this odd phenomenon is finally solved. Dr. Jorge Pérez, an evolutionary biologist from the University of La Paz, and several companions, were exploring the Andes Mountains when they found a small valley, with no other animals or humans. Pérez noticed that the valley had what appeared to be a natural fountain, surrounded by two peaks of rock and silver snow. However, Pérez also pointed out that it is likely that the only way of knowing for sure if unicorns are indeed the descendants of a lost alien race is through DNA. “But they seem to be able to communicate in English quite well, which I Input Data Science Lightning Presentation Rules - The subject should be something about data science. - Total Length: 360 to 400 seconds - Slide length: 10 to 40 seconds - Prepare for 10 minutes of questions after each presentation. Generated by GPT-2 small ( talktotransformer.com ) - For each lecture, two slides should be provided. No single slide. - For each session, two presentations should be provided. Each presentation should be 15 minutes long. - At least one lecture should be on a topic specific to the topic at hand. - No more than two questions should be asked in a single lecture. - For each session, we need to create a question about any part of our dataset and ask it directly. The number of questions should be based upon the volume of data analyzed. - The question should be short enough that anyone with a background in data science knows what it is. - We should present each segment of the problem with some numbers: - The size of the dataset: - The number of points of each dataset as well as its number of data elements (like lines). - The distribution of all data points as follows: * - Random numbers. * - Normal

Slide 1

Slide 1 text

Slide 2

Slide 2 text

Slide 3

Slide 3 text

Slide 4

Slide 4 text

Slide 5

Slide 5 text

Slide 6

Slide 6 text

Slide 7

Slide 7 text

Slide 8

Slide 8 text

Slide 9

Slide 9 text

Slide 10

Slide 10 text

Slide 11

Slide 11 text

Slide 12

Slide 12 text

Slide 13

Slide 13 text

Slide 14

Slide 14 text

Slide 15

Slide 15 text

Slide 16

Slide 16 text

Slide 17

Slide 17 text

Slide 18

Slide 18 text

Slide 19

Slide 19 text

Slide 20

Slide 20 text

Slide 21

Slide 21 text