Learn with Luke from Advancing Analytics
As an extension of the popular YouTube channel, Advancing Spark, the Advancing Analytics Academy delves deeper into data science and data engineering, offering self-paced courses to enhance your skills.
Luke Menzies is a Senior Data Scientist who's worked at Advancing Analytics for 2 years.
Why Machine Learning with Spark?
This course is designed to give you an overview on using Spark for Machine Learning. It includes an intro to Machine Learning and data science, before delving deeper into feature engineering and selection, metrics, pipelines, tuning, and ensemble modelling. The course provides real-world examples to allow you to understand Machine Learning in Spark in applied context.
We use a three-step method - all concepts are explained, then demonstrated, before you try it out for yourselves.
Example Curriculum
- Module Introduction (2:58)
- Problems encountered by data scientists (19:39)
- Terminology (8:32)
- Introduction to regression (11:02)
- Introduction to classification (9:04)
- Introduction to clustering (9:12)
- Labs i - Introduction to regression (14:07)
- Labs ii - Introduction to classification (14:12)
- Labs iii - Introduction to clustering (14:03)
- Module Introduction (3:30)
- Feature scaling and encoding (8:41)
- Labs - Feature scaling and encoding (18:46)
- Handling and imputing missing values (5:03)
- Labs - Handling and imputing missing values (9:05)
- Feature selection and dimensionality reduction (15:50)
- Labs - Feature selection and dimentionality reduction (16:01)
Our other courses
Why not check out some of our other courses?