Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Oracle Machine Learning for Python

Oracle Machine Learning for Python

Learn about the Oracle Machine Learning for Python and how it integrates the advantages of Python with the scalability and performance of Oracle Database while enabling Python functionality on database data as if they were native Python objects.

Users transparently move from Python functions written for single core to overloaded functions leveraging database parallelism and scalability. Oracle Machine Learning for Python allows users to manipulate data in Oracle Database tables and views using Python syntax and functions, but translating Python functionality into SQL for in-database execution.

Users develop and operationalize comprehensive scripts for analytical applications without leaving the Python environment. Directly integrate user-defined Python scripts into applications and dashboards by immediately invoking Python scripts from SQL. This drastically reduces time-to-market by eliminating porting Python code and developing custom infrastructure, while enabling immediate updates to application code.

Marcos Arancibia

January 08, 2020
Tweet

More Decks by Marcos Arancibia

Other Decks in Technology

Transcript

  1. With Mark Hornick, Senior Director, Product Management, Data Science and

    Machine Learning @MarkHornick Marcos Arancibia, Product Mgr. Data Science and Big Data @MarcosArancibia oracle.com/goto/machinelearning Oracle Machine Learning Office Hours Oracle Machine Learning for Python Copyright © 2020, Oracle and/or its affiliates. All rights reserved
  2. Topics • Upcoming session • Web Questions • Speaker Mark

    Hornick – Oracle Machine Learning for Python • Q&A Copyright © 2019 Oracle and/or its affiliates.
  3. Next Session to look for: February 12th, 2020: Oracle Machine

    Office Hours, 9AM US Pacific Oracle Machine Learning for Spark Oracle Machine Learning for Spark offers interfaces to run Machine Learning algorithms on top of Data Lakes, using Spark to distribute computation across Nodes, and brings integration with the Big Data ecosystem that allows for manipulation tables in HIVE and Impala, as well as integration with HDFS and the Oracle Database, using the R language as front-end. Marcos Arancibia, Product Manager, Oracle Data Science and Big Data Mark Hornick, Senior Director, Product Management, Data Science and Machine Learning Copyright © 2019, Oracle and/or its affiliates. All rights reserved
  4. Web Questions • “I'd love to join the Machine Learning

    for Python Office Hours, but it is at 4am my local time. Will it be recorded?” • How to get OML desktop/cloud version for testing? Copyright © 2019 Oracle and/or its affiliates.
  5. Today’s Session: Oracle Machine Learning for Python, with Demos with

    Mark Hornick • Learn about Oracle Machine Learning for Python and how it integrates the advantages of Python with the scalability and performance of Oracle Database while enabling Python functionality on database data as if they were native Python objects. Copyright © 2019, Oracle and/or its affiliates. All rights reserved
  6. Oracle Machine Learning for Python (OML4Py) Mark Hornick Senior Director,

    Product Management Data Science and Machine Learning Copyright © 2020 Oracle and/or its affiliates. Oracle Machine Learning Office Hours
  7. The following is intended to outline our general product direction.

    It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, timing, and pricing of any features or functionality described for Oracle’s products may change and remains at the sole discretion of Oracle Corporation. Safe Harbor Copyright © 2020 Oracle and/or its affiliates.
  8. What is Python? An interpreted, object-oriented, high level, general purpose

    programming language Designed for rapid application development and scripting to connect existing components Open source: https://www.python.org Created in the late 1980s World-wide usage Widely taught in Universities Many Data Scientists know and use Python Thousands of open source packages to enhance productivity, especially in the areas of data science and machine learning Copyright © 2020 Oracle and/or its affiliates.
  9. d Oracle Machine Learning OML Microservices* Supporting Oracle Applications Image,

    Text, Scoring, Deployment, Model Management * Coming soon OML4SQL Oracle Advanced Analytics SQL API OML4Py* Python API OML4R Oracle R Enterprise R API OML Notebooks with Apache Zeppelin on Autonomous Database OML4Spark Oracle R Advanced Analytics for Hadoop Oracle Data Miner Oracle SQL Developer extension Copyright © 2020 Oracle and/or its affiliates.
  10. Traditional Python and Database Interaction Access latency Memory limitation –

    data size Single threaded Paradigm shift: Python  SQL  Python Issues for backup, recovery, security Ad hoc production deployment Database Flat Files extract / export read export load SQL mxODBC, pyodbc, turboodbc, JayDeBeApi, cx_Oracle Copyright © 2020 Oracle and/or its affiliates.
  11. Oracle Machine Learning for Python Coming soon to Oracle Autonomous

    Database and Oracle Database Use Oracle Database as HPC environment Derive even more value from Oracle Database / Exadata investment Transform data faster and at scale Use in-database parallel and distributed ML algorithms Build more models on more data – faster Score large volume data – faster Increased productivity from automated data preparation, partitioned models, integrated text mining Manage Python scripts and Python objects in Oracle Database Collaborate: Hand-off data science products from data scientist to application developer easily New automated machine learning Enhance data scientist productivity Enable non-experts to use and benefit from machine learning Produce more accurate models faster Database Server Machine Client SQL Interfaces SQL*Plus SQLDeveloper OML4Py Copyright © 2020 Oracle and/or its affiliates. On-premises architecture Empower data scientists with open source environments
  12. Transparency layer Leverage proxy objects so data remain in database

    Overload native functions translating functionality to SQL Use familiar Python syntax on database data Parallel, distributed algorithms Scalability and performance Exposes in-database algorithms available from OML4SQL Embedded execution Manage and invoke Python user-defined functions in Oracle Database Data-parallel, task-parallel, and non-parallel execution Use open source packages to augment functionality OML4Py AutoML Model selection, feature selection, hyper-parameter tuning Database Server Machine SQL Interfaces SQL*Plus SQLDeveloper OML4Py Oracle Machine Learning for Python Copyright © 2020 Oracle and/or its affiliates. On-premises architecture Client
  13. Demonstration Using open source Zeppelin notebook environment Copyright © 2020

    Oracle and/or its affiliates. OML4Py Beta installed on Exadata with Oracle Database