Skip to Main Content

Oracle Machine Learning Office Hours

Free tips and training every month! Subscribe for reminders and more from Office Hours. FAQ

Header container

August 03

15:00 UTC   Start Times Around the World

Subscribe to be notified of changes to sessions and give us feedback!

Having trouble watching the video on this page? Open the video in your browser.

Description

ML Concepts - Using Cross-Validation with OML in-Database and with Embedded Python Execution
On this weekly Office Hours for Oracle Machine Learning on Autonomous Database, Jie Liu, Data Scientist for Oracle Machine Learning, will cover Cross-Validation methods, why they are useful and how to run it using in-Database methods using OML and also Embedded Python Execution open-source methods. He will also present a demo running on a notebook with OML4Py.

The Oracle Machine Learning product family supports data scientists, analysts, developers, and IT to achieve data science project goals faster while taking full advantage of the Oracle platform.

The Oracle Machine Learning Notebooks offers an easy-to-use, interactive, multi-user, collaborative interface based on Apache Zeppelin notebook technology, and support SQL, PL/SQL, Python and Markdown interpreters. It is available on all Autonomous Database versions and Tiers, including the always-free editions.

OML includes AutoML, which provides automated machine learning algorithm features for algorithm selection, feature selection and model tuning, in addition to a specialized AutoML UI exclusive to the Autonomous Database.

OML Services is also included in Autonomous Database, where you can deploy and manage native in-database OML models as well as ONNX ML models (for classification and regression) built using third-party engines, and can also invoke cognitive text analytics.

Video Highlights:
00:51 Topics for today
01:18 OML Services Cognitive Text new Italian Language capabilities
03:47 Main Topic - Cross-Validation in OML4Py
04:48 Motivation for Cross-Validation
07:00 Techniques for Validation: Train/Test Split
10:43 Techniques for Validation: Leave one Out Validation
11:37 Techniques for Validation: K-Fold Cross-Validation
14:00 K-Fold Cross-Validation in OML4Py
14:58 Live Demo of OML4Py Cross-validation Notebook
26:26 Q&A

Your Experts

  • #SELECTION#
    Marcos Arancibia

    Marcos Arancibia   

    Marcos Arancibia is the Product Manager for Oracle Machine Learning, working with Machine Learning in the Oracle Database and on Spark. He develops product strategy, roadmap prioritization, product positioning and product evangelization, helping define the product roadmap for Oracle Machine Learning. Before joining Oracle in 2010 he spent 13 years at SAS Institute Inc., from Country Manager in LAD to Regional Data Mining lead in the US. He holds a bachelor's degree with additional courses in the master's degree, both in Statistics from UNICAMP in Brazil. He has Certifications from Stanford on AI and Machine Learning, and from the University of Washington on Computational Neuroscience.
    #MISC#
    #ACTIONS#
  • #SELECTION#
    Jie Liu

    Jie Liu

    Jie Liu is a data scientist. He works with Oracle Machine Learning Product Management team to develop marketing content for OML products and deliver data science solutions for customers inside and outside Oracle. Before joining Oracle, he was a data scientist in Epsilon developing machine learning driven real time bidding strategy and application for online advertisement. He obtained his PhD in Electrical Engineering from University of Notre Dame.
    #MISC#
    #ACTIONS#
  • #SELECTION#
    Mark Hornick

    Mark Hornick   

    Mark Hornick is the Senior Director of Product Management for the Oracle Machine Learning (OML) family of products. He leads the OML PM team and works closely with Product Development on product strategy, positioning, and evangelization, Mark has over 20 years of experience with integrating and leveraging machine learning with Oracle technologies, working with internal and external customers in the application of Oracle’s machine learning technologies for scalable and deployable data science projects. Mark is Oracle’s representative on the R Consortium’s Board of Directors, an Oracle Adviser and founding member of the Business Intelligence Warehousing and Analytics (BIWA) User Community, and Content Selection Committee Chair for the Analytics and Data Summits.
    #MISC#
    #ACTIONS#

All Sessions