Slide 1

Slide 1 text

With Mark Hornick, Senior Director, Product Management, Data Science and Machine Learning @MarkHornick Marcos Arancibia, Product Mgr. Data Science and Big Data @MarcosArancibia oracle.com/goto/machinelearning Oracle Machine Learning Office Hours Oracle Machine Learning for Python Copyright © 2020, Oracle and/or its affiliates. All rights reserved

Slide 2

Slide 2 text

Topics • Upcoming session • Web Questions • Speaker Mark Hornick – Oracle Machine Learning for Python • Q&A Copyright © 2019 Oracle and/or its affiliates.

Slide 3

Slide 3 text

Next Session to look for: February 12th, 2020: Oracle Machine Office Hours, 9AM US Pacific Oracle Machine Learning for Spark Oracle Machine Learning for Spark offers interfaces to run Machine Learning algorithms on top of Data Lakes, using Spark to distribute computation across Nodes, and brings integration with the Big Data ecosystem that allows for manipulation tables in HIVE and Impala, as well as integration with HDFS and the Oracle Database, using the R language as front-end. Marcos Arancibia, Product Manager, Oracle Data Science and Big Data Mark Hornick, Senior Director, Product Management, Data Science and Machine Learning Copyright © 2019, Oracle and/or its affiliates. All rights reserved

Slide 4

Slide 4 text

Web Questions • “I'd love to join the Machine Learning for Python Office Hours, but it is at 4am my local time. Will it be recorded?” • How to get OML desktop/cloud version for testing? Copyright © 2019 Oracle and/or its affiliates.

Slide 5

Slide 5 text

For product downloads and more info… https://www.oracle.com/goto/machinelearning Copyright © 2020 Oracle and/or its affiliates.

Slide 6

Slide 6 text

Copyright © 2020 Oracle and/or its affiliates. https://www.oracle.com/cloud/free/

Slide 7

Slide 7 text

Today’s Session: Oracle Machine Learning for Python, with Demos with Mark Hornick • Learn about Oracle Machine Learning for Python and how it integrates the advantages of Python with the scalability and performance of Oracle Database while enabling Python functionality on database data as if they were native Python objects. Copyright © 2019, Oracle and/or its affiliates. All rights reserved

Slide 8

Slide 8 text

Oracle Machine Learning for Python (OML4Py) Mark Hornick Senior Director, Product Management Data Science and Machine Learning Copyright © 2020 Oracle and/or its affiliates. Oracle Machine Learning Office Hours

Slide 9

Slide 9 text

The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, timing, and pricing of any features or functionality described for Oracle’s products may change and remains at the sole discretion of Oracle Corporation. Safe Harbor Copyright © 2020 Oracle and/or its affiliates.

Slide 10

Slide 10 text

What is Python? An interpreted, object-oriented, high level, general purpose programming language Designed for rapid application development and scripting to connect existing components Open source: https://www.python.org Created in the late 1980s World-wide usage Widely taught in Universities Many Data Scientists know and use Python Thousands of open source packages to enhance productivity, especially in the areas of data science and machine learning Copyright © 2020 Oracle and/or its affiliates.

Slide 11

Slide 11 text

d Oracle Machine Learning OML Microservices* Supporting Oracle Applications Image, Text, Scoring, Deployment, Model Management * Coming soon OML4SQL Oracle Advanced Analytics SQL API OML4Py* Python API OML4R Oracle R Enterprise R API OML Notebooks with Apache Zeppelin on Autonomous Database OML4Spark Oracle R Advanced Analytics for Hadoop Oracle Data Miner Oracle SQL Developer extension Copyright © 2020 Oracle and/or its affiliates.

Slide 12

Slide 12 text

Traditional Python and Database Interaction Access latency Memory limitation – data size Single threaded Paradigm shift: Python  SQL  Python Issues for backup, recovery, security Ad hoc production deployment Database Flat Files extract / export read export load SQL mxODBC, pyodbc, turboodbc, JayDeBeApi, cx_Oracle Copyright © 2020 Oracle and/or its affiliates.

Slide 13

Slide 13 text

Oracle Machine Learning for Python Coming soon to Oracle Autonomous Database and Oracle Database Use Oracle Database as HPC environment Derive even more value from Oracle Database / Exadata investment Transform data faster and at scale Use in-database parallel and distributed ML algorithms Build more models on more data – faster Score large volume data – faster Increased productivity from automated data preparation, partitioned models, integrated text mining Manage Python scripts and Python objects in Oracle Database Collaborate: Hand-off data science products from data scientist to application developer easily New automated machine learning Enhance data scientist productivity Enable non-experts to use and benefit from machine learning Produce more accurate models faster Database Server Machine Client SQL Interfaces SQL*Plus SQLDeveloper OML4Py Copyright © 2020 Oracle and/or its affiliates. On-premises architecture Empower data scientists with open source environments

Slide 14

Slide 14 text

Transparency layer Leverage proxy objects so data remain in database Overload native functions translating functionality to SQL Use familiar Python syntax on database data Parallel, distributed algorithms Scalability and performance Exposes in-database algorithms available from OML4SQL Embedded execution Manage and invoke Python user-defined functions in Oracle Database Data-parallel, task-parallel, and non-parallel execution Use open source packages to augment functionality OML4Py AutoML Model selection, feature selection, hyper-parameter tuning Database Server Machine SQL Interfaces SQL*Plus SQLDeveloper OML4Py Oracle Machine Learning for Python Copyright © 2020 Oracle and/or its affiliates. On-premises architecture Client

Slide 15

Slide 15 text

Demonstration Using open source Zeppelin notebook environment Copyright © 2020 Oracle and/or its affiliates. OML4Py Beta installed on Exadata with Oracle Database

Slide 16

Slide 16 text

For more information… https://www.oracle.com/goto/machinelearning Copyright © 2020 Oracle and/or its affiliates.

Slide 17

Slide 17 text

Thank You Mark Hornick | [email protected] Senior Director, Product Management Data Science and Machine Learning