Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Seminar #41 - An Introduction to Exploratory v6.5 New Features

Seminar #41 - An Introduction to Exploratory v6.5 New Features

We have released Exploratory v6.5 on 4/19 with many exciting new features. Kan will introduce some of the new features including the following.

- Summary View: New Chart Types
- Analytics: Time Series Clustering
- Analytics: Updates for Correlation
- Multiple Excel / CSV Files Import
- Google Drive / S3 Support
- Contents Search inside Project

Kan Nishida

April 21, 2021
Tweet

More Decks by Kan Nishida

Other Decks in Technology

Transcript

  1. Kan Nishida CEO/co-founder Exploratory Summary In Spring 2016, launched Exploratory,

    Inc. to democratize Data Science. Prior to Exploratory, Kan was a director of product development at Oracle leading teams to build various Data Science products in areas including Machine Learning, BI, Data Visualization, Mobile Analytics, Big Data, etc. While at Oracle, Kan also provided training and consulting services to help organizations transform with data. @KanAugust Speaker
  2. 3 Data Science is not just for Engineers and Statisticians.

    Exploratory makes it possible for Everyone to do Data Science. The Third Wave
  3. 4 Questions Communication Data Access Data Wrangling Visualization Analytics (Statistics

    / Machine Learning) Data Analysis Data Science Workflow
  4. 5 Questions Communication (Dashboard, Note, Slides) Data Access Data Wrangling

    Visualization Analytics (Statistics / Machine Learning) Data Analysis ExploratoryɹModern & Simple UI
  5. • Project • Summary View • Analytics • Chart (Visualization)

    • Data Wrangling • Data Source Areas of New Features / Enhancements
  6. Now all the charts are shown inside the pop-up and

    you can scroll through when there are too many charts!
  7. Data Frame Export You can export a data frame including

    all the charts, analytics, and branches. Note that the data frames that are joined or merged with the exported data frame won’t be exported. They will need to be exported as separate files.
  8. You can import the exported data frame to reproduce the

    data frame with charts, analytics, and branches.
  9. • Multiple Excel / CSV Files Import • Google Drive

    • Amazon S3 • Enhancements for Google BigQuery Data Source
  10. If you want to apply the same setting to all

    the files instead of configuring one by one then you can click ‘OK for All’ button.
  11. You can ‘Skip’ to cancel a particular file import or

    ‘Skip All’ to cancel all the remaining files.
  12. Google Drive You can import the files (CSV / Excel)

    that are saved at Google Drive now!
  13. You can import the files as separate data frames or

    as a single data frame by merging them together.
  14. You can simply click on ‘Re-import’ button to import all

    the files that matches with the file selection condition.
  15. The Aggregation type shows you the aggregated values (mean or

    ratio) for each numeric or categorical value.
  16. The Distribution type shows you a distribution of a target

    variable for each numerical or categorical value.
  17. The Uncertainty type shows you the mean or the ratio

    with the confidence interval for each numerical or categorical value so that you can see if a given difference is significant or not.
  18. • Time Series Clustering under Analytics View • Correlation -

    Significance Test • Variable Importance with FIRM algorithm Analytics
  19. When you want to cluster the data based on the

    similarities of the raw values you want to set ‘Normalize Value’ to FALSE.
  20. But some times, you want to cluster the data not

    based on the similarity of t values but based on the similarity of the trend (ups and downs).
  21. Then, you can see how they are clustered under the

    ‘Time Series (Normalized)’ tab.
  22. New ‘Significance’ tab shows P Value and color each combination

    based on whether it is statistically significant or not.
  23. FIRM - Feature Importance Ranking Measure A new algorithm ‘FIRM’

    is added for calculating the variable importance.
  24. • Sorting Support for Stack Bar Chart • Enhancements for

    Word Cloud • Table: Support Date / Time Formatting • Summarize Table / Pivot Table: Grand Total Calculation Timing • Sorting Support for Cumulative Ratio Chart (Visualization)
  25. Let’s say you want to know ‘What are the countries

    that are top 80% of your sales?’
  26. With Exploratory Server, you can Share, Schedule, and Interact with

    all your Data and Insights (Chart, Analytics, Dashboard, Notes, and Slides). Exploratory Server
  27. You might have data or insights that you want to

    be updated when the data is updated (scheduled). You can now subscribe the email notification.