Skip to Main Content

Oracle Machine Learning Office Hours

Free tips and training every month! Subscribe for reminders and more from Office Hours.

Header container

February 18, 2020

Oracle Machine Learning for Spark
Oracle Machine Learning for Spark offers interfaces to run Machine Learning algorithms on top of Data Lakes, using Spark to distribute computation across Nodes, and brings integration with the Big Data ecosystem that allows for manipulation tables in HIVE and Impala, as well as integration with HDFS and the Oracle Database, using the R language as front-end.

It makes the open source R scripting language and environment ready for the enterprise and big data. Designed for problems involving both large and small volumes of data, Oracle Machine Learning for Spark integrates R with Data Lakes, allowing users to execute R commands and scripts for data processing, statistical and machine learning analytics on HIVE, IMPALA, Spark DataFrame tables and views using R and Spark SQL syntax. Many familiar R functions are overloaded and translate R functions into SQL for in-Data Lake execution.

Oracle Machine Learning consists of complementary components supporting scalable machine learning algorithms for in-database and big data environments (including Cloud and on-premises), notebook technology, SQL, Python and R APIs, and Hadoop/Spark environments.

This session's over, but a recording of the session is not yet available.

Check out the resources at the bottom of the page to explore great websites packed with lots more information - and answers to questions.

Ask a Question

Ask The Experts - Right Now!

Do you have a question about Oracle Machine Learning you'd like our experts to answer in their next session? Sign in and submit it here.

Please note that we cannot guarantee to answer all questions. We cannot help you with open Service Requests or account/licensing issues.

Experts

Your Experts
Marcos Arancibia
Marcos Arancibia, Product Manager, Data Science and Big Data    
Marcos Arancibia is the Product Manager for Oracle Data Science and Big Data. He works with Machine Learning in the Oracle Database and on Big Data clusters under Hadoop and Spark, on premises and in the Oracle Cloud. He works within Product Management to develop product strategy, roadmap prioritization, product positioning and product evangelization, working closely with the engineering team in defining the product roadmaps for Oracle Machine Learning and Big Data in the Cloud. Before joining Oracle 9 years ago he was at SAS Institute Inc. for 13 years as a Data Mining architect and expert in the US and Latin America. He holds a Bachelor Degree of Science in Statistics with additional courses in the Master of Science in Statistics, both from UNICAMP in Brazil. He has Certifications from Stanford on AI and Machine Learning, and from the University of Washington on Computational Neuroscience. He is an expert on Deep Learning and passionate about Machine Learning.
Mark Hornick
Mark Hornick, Senior Director, Product Management, Data Science and Machine Learning    
Mark Hornick is the Senior Director of Product Management for the Oracle Machine Learning (OML) family of products. He leads the OML PM team and works closely with Product Development on product strategy, positioning, and evangelization, Mark has over 20 years of experience with integrating and leveraging machine learning with Oracle technologies, working with internal and external customers in the application of Oracle’s machine learning technologies for scalable and deployable data science projects. Mark is Oracle’s representative on the R Consortium’s Board of Directors, an Oracle Adviser and founding member of the Business Intelligence Warehousing and Analytics (BIWA) User Community, and Content Selection Committee Chair for the Analytics and Data Summits.