Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Y Soft Technology Hour - Ultimate Commodity Sup...

Sponsored · Ship Features Fearlessly Turn features on and off without deploys. Used by thousands of Ruby developers.

Y Soft Technology Hour - Ultimate Commodity Supercomputer

Presentation by Jiri Matela and Martin Jirman - Comprimato.

Source code to this presentation can be found at https://github.com/ysoftdevs/th/tree/master/2014_06_19_Comprimato_Ultimate_Commodity_supercomputer

Avatar for Y Soft Developers

Y Soft Developers

June 19, 2014
Tweet

More Decks by Y Soft Developers

Other Decks in Programming

Transcript

  1. The  Evolu-on  of  Compu-ng   Intel  ASCI  Red    

    1  TFLOPS     7904  CPUs     850  KW     150  m2  
  2. The  Evolu-on  of  Compu-ng   NVIDIA  GeForce     5.1

     TFLOPS     250W     296  cm2  
  3. The  Evolu-on  of  Compu-ng   Mobile  GPU     0.36

     TFLOPS     5W     0.14  cm2  
  4. Where  GPUs  Shine   Neural  networks  (Ne_lix)     GPU

     accelerated  database  query  (PgOpenCL)     Physics  (Games)     Ray  tracing  (FurryBall,  NVIDIA  Op-X)     Linear  Algebra  (CUBLAS)     Video  Compression      
  5. General-­‐purpose  compu-ng  on   graphics  processing  units  (GPU)   • 

    Video  Controller  -­‐>  GPU  (nvidia)   -­‐>  GPGPU   •  Shaders  Languages  -­‐>  CUDA    /  OpenCL   •  CUDA   – Compu-ng  architecture   – Programming  language  
  6. CUDA  Quick  Start  SLIDE   •  NVIDIA  GPU  –  GeForce

     (mobile),  Tesla,  Quadro     •  Win  /  Lin  /  Mac   •  NVIDIA  Driver   •  NVIDIA  Installer*   –  Toolkit   –  Samples   –  Tools   *  developer.nvidia.com/cuda-­‐downloads  
  7. CPU  Basic  Single  Thread  Implementa-on       GPU  Basic

     Implementa-on   (1pixel  =  1thread)  
  8. GPU  –  Final     1  thread  =  mul-ple  pixels

      private  array  (registers)   #pragma  unroll  
  9. Conclusion   •  Gaussian  blur   •  CPU  160  -­‐>

     100  ms   – Core  i5  –  4  Cores   •  GPU  16  -­‐>  3ms   – GeFroce  740m   – 2  SM   – 368  cuda  cores