Upgrade to Pro — share decks privately, control downloads, hide ads and more …

pandas.from[0]

 pandas.from[0]

First presented at Kariera IT on 2021-03-06
Updated for Warsaw IT Days on 2021-04-07

Vitaliy Rudnytskiy

March 06, 2021
Tweet

More Decks by Vitaliy Rudnytskiy

Other Decks in Programming

Transcript

  1. 3 @sygyzmundovych - aka Vitaliy Rudnytskiy, Віталій Рудницький - A

    Developer Advocate in SAP - All things Data (with the focus on SAP HANA, SAP Data Intelligence, Analytics) - Based in Wrocław, Poland - Organizer of Wrocław SAP Meetup - https://people.sap.com/vitaliy.rudnytskiy 51°04'40.3"N 16°57'48.8"E (WGS84) Witalij Rudnicki @Sygyzmundovych
  2. 7 @sygyzmundovych What pandas have in common with the black

    hole? source: https://iopscience.iop.org/article/10.3847/2041-8213/ab0c57
  3. 8 @sygyzmundovych What is pandas? “[…] we are concerned with

    data structures and tools for working with data sets in-memory, […]” “We hope that pandas will help make scientific Python a more attractive and practical statistical computing environment for academic and industry practitioners alike.” “pandas is a new Python library of data structures and statistical tools initially developed for quantitative finance applications. Most of our examples here stem from time series and cross- sectional data arising in financial modeling.” source: https://conference.scipy.org/proceedings/scipy2010/pdfs/mckinney.pdf
  4. 10 @sygyzmundovych What is pandas? “As a bit of background,

    I started building pandas in early 2008 during my tenure at AQR Capital Management, a quantitative investment management firm. At the time, I had a distinct set of requirements that were not well addressed by any single tool at my disposal…” “The pandas name itself is derived from panel data, an econometrics term for multidimensional structured datasets, and a play on the phrase Python data analysis itself.” “pandas provides high-level data structures and functions designed to make working with structured or tabular data fast, easy, and expressive.” source: https://pandas.pydata.org/about/
  5. 13 @sygyzmundovych data analysis / manipulation in Python working with

    structured or tabular data open source high-level data structures and functions data sets in-memory the most powerful … tool available in any language What is pandas?
  6. 14 @sygyzmundovych Modelling (Schema) Data Collection Data Analysis Data Storytelling

    Data Collection Data Analysis Modelling Data Storytelling Traditional Data Analytics Exploratory Data Analysis
  7. 15 @sygyzmundovych Data Analysis?? I want to be the Data

    Scientist!! source: https://www.forbes.com/sites/gilpress/2016/03/23/data-preparation-most-time-consuming-least-enjoyable-data-science-task-survey-says/
  8. 19 @sygyzmundovych data analysis / manipulation in Python working with

    structured or tabular data open source high-level data structures and functions data sets in-memory the most powerful … tool available in any language What is pandas?
  9. 20 @sygyzmundovych The Global CTO Survey 2020 Report by STX

    Next source: https://www.stxnext.com/cto-survey-2020-report/
  10. 23 @sygyzmundovych data analysis / manipulation in Python working with

    structured or tabular data open source high-level data structures and functions data sets in-memory the most powerful … tool available in any language What is pandas?
  11. 25 @sygyzmundovych data analysis / manipulation in Python working with

    structured or tabular data open source high-level data structures and functions data sets in-memory the most powerful … tool available in any language What is pandas?
  12. 29 @sygyzmundovych data analysis / manipulation in Python working with

    structured or tabular data open source high-level data structures and functions data sets in-memory the most powerful … tool available in any language What is pandas?
  13. 32 @sygyzmundovych By our own data engineers, data scientists and

    data analysts, obviously By our software engineers: § https://pypi.org/project/contextual-ai/ § https://pypi.org/project/sailor/ § https://pypi.org/project/hana-ml/ How is it used in companies like SAP? https://www.youtube.com/watch?v=fSiVmL4S00w& list=PLSXNnd21oW416dQIZu5-XGx9K0806AUKN
  14. 33 @sygyzmundovych data analysis / manipulation in Python working with

    structured or tabular data open source high-level data structures and functions data sets in-memory the most powerful … tool available in any language What is pandas?
  15. 35 @sygyzmundovych data analysis / manipulation in Python working with

    structured or tabular data open source high-level data structures and functions data sets in-memory the most powerful … tool available in any language What is pandas?