Slide 1

Slide 1 text

No content

Slide 2

Slide 2 text

Me Strata Conference Hadoop Summit Big Data Spain

Slide 3

Slide 3 text

Miguel Romero [short bio] @donkelito • Working in SDG Group. • My professional experience. • 7 years as Data Engineer • 6 years as Business Developer and Solution Architect in BIG Data & Advanced Analytics practice • 1 year as Big Data Head • Executive MBA • Not working on my PhD  • Big Data Master Professor • Member of M team. • Agile mode in each instant • Currently: Developing my speaker facet! #BDS15 #BDS17

Slide 4

Slide 4 text

W TF i s a m em e? • Big Data technology has brought an unprecedented explosion in unstructured data. • A meme (neologism modeled on gene) that is a shortening of mimeme (greek) that means imitated thing, is an attempt to explain the way cultural information spreads in terms of evolutionary principles between humans; • Internet memes are – a subset of this general meme concept specific to the culture and environment of the Internet. – a piece of art which spreads from Person to Person via the Internet which carries an additional property that ordinary memes do not (the media through which they propagate that renders them traceable and analyzable)

Slide 5

Slide 5 text

No content

Slide 6

Slide 6 text

No content

Slide 7

Slide 7 text

No content

Slide 8

Slide 8 text

No content

Slide 9

Slide 9 text

No content

Slide 10

Slide 10 text

No content

Slide 11

Slide 11 text

No content

Slide 12

Slide 12 text

No content

Slide 13

Slide 13 text

No content

Slide 14

Slide 14 text

No content

Slide 15

Slide 15 text

No content

Slide 16

Slide 16 text

No content

Slide 17

Slide 17 text

No content

Slide 18

Slide 18 text

No content

Slide 19

Slide 19 text

No content

Slide 20

Slide 20 text

No content

Slide 21

Slide 21 text

No content

Slide 22

Slide 22 text

No content

Slide 23

Slide 23 text

Memes could be your advisor [fads & sensations]

Slide 24

Slide 24 text

The Bi g M em e I ndex Analyzing fads and sensations on the Internet

Slide 25

Slide 25 text

tw i tter D ata Sources A PI s # m em e Si tes D ata Col l ect D ata Fast A nal yti cs Stream M edi a Processor R D D D Stream FA Stream Media Collector [http source] FA Stream Media Collector [http source] M em e R eposi tory R D D { photo: ’ di stracted- boyfri end’ , sense: ‘ opi ni onchange’ , w ords: [{w : ‘ H I V E’ , p: 1}, {w : ’ D S’ , p: 2}, {w : ’ SPA R K ’ , p: 3}]} # spark # ds # hi ve Enri ch R eposi tory R D D D Stream D ata Servi ces m i croservi ces What do you have? What do you like? Breadth of the Influencers according to trend memes Cultural VS Education VS Trending Stream M edi a B uffer

Slide 26

Slide 26 text

Template matching is a technique for finding areas of an image that match (are similar) to a template image (patch). • Use the OpenCV function matchTemplate to search for matches between an image patch and an input image • Use the OpenCV function minMaxLoc to find the maximum and minimum values (as well as their positions) in a given array Tem pl ate m atchi ng

Slide 27

Slide 27 text

tesseract.exe staktreak_meme1_youwanadata.png staktreak_meme1_youwanadata.txt --oem 2 -l eng –psm 6 A You want data? 3 im} 3~ Here‘s your data O penCV i nterface Tesseract [plain text]

Slide 28

Slide 28 text

tw i tter D ata Sources A PI s # m em e Si tes D ata Col l ect D ata Fast A nal yti cs Stream M edi a Processor R D D D Stream FA Stream Media Collector [http source] FA Stream Media Collector [http source] M em e R eposi tory R D D { photo: ’ di stracted- boyfri end’ , sense: ‘ opi ni onchange’ , w ords: [{w : ‘ H I V E’ , p: 1}, {w : ’ D S’ , p: 2}, {w : ’ SPA R K ’ , p: 3}]} # spark # ds # hi ve Enri ch R eposi tory R D D D Stream D ata Servi ces m i croservi ces What do you have? What do you like? Breadth of the Influencers according to trend memes Cultural VS Education VS Trending Stream M edi a B uffer

Slide 29

Slide 29 text

Takeaw ay poi nt s • Memes are a vehicle whose information could be used as advisory • Memes are unstructured data and until now was not easy to analyze • With Big Data and Advance Analytics technology an architecture to analyze memes (image + text) in NRT or even video on live can be built.

Slide 30

Slide 30 text

B i g D ata Spai n 2018 i dea! W hat i s there i n com m on am ong the m usi c that data-l overs l i sten to?

Slide 31

Slide 31 text

Me #BDS17 #BDS18 2017