Get Started with RapidMiner & Machine Learning
This track of RapidMiner Tutorials introduces you to RapidMiner's Studio and platform, while also explaining many important data science concepts.
Sections
-
General
RapidMiner Studio - GUI Intro
ElectiveVideo
Viewing time: ~6m Find out what features the graphical user interface (GUI) of RapidMiner provides and how the different elements can be used to start your first process.Visualizing Data In RapidMiner
ElectiveVideo
Viewing time: ~7m Learn how to visualize results in RapidMiner Studio to help you in finding outliers and get interesting stats with data analytics.The RapidMiner MarketPlace - Extensions
ElectiveVideo
Viewing time: ~3m Extensions allow you to add new functionalities to processes in Studio, like text mining, web crawling, or integration with Python and R. Learn how to install an extension in this video. -
Data access and preparation
Data Loading via a Process
ElectiveVideo
Viewing time: ~6m Learn how to open an excel spreadsheet in RapidMiner Studio. Using the 'read excel' operator you can always get your latest data for your project.Importing Data in RapidMiner Studio
ElectiveVideo
Viewing time: ~7m See how to import an excel spreadsheet into RapidMiner Studio via the generic 'Import data' functionality.Turbo Prep - Introduction
ElectiveVideo
Viewing time: ~6m This is an introduction to RapidMiner's guided ETL tool Turbo Prep which will show you how you can save a lot of data preparation time.Turbo Prep - Data Cleansing
ElectiveVideo
Viewing time: ~7m RapidMiner's Turbo Prep allows you to fix most of your data quality issues very quickly via a GUI. It can even apply an Automatic cleansing for you.Turbo Prep - Merging Data
ElectiveVideo
Viewing time: ~5m Watch this video to learn how join (add columns) and append (add rows) data to your primary data set. Of course you can also join multiple ones if you want.Turbo Prep - Data Pivoting
ElectiveVideo
Viewing time: ~3m Pivoting with Turbo Prep is a fast and easy way to identify patterns through calculating aggregates or to rearrange you data to fit your analytics needs.Data Preparation
ElectiveVideo
Viewing time: ~9m Find out how a set of data prepartion and cleansing techniques is applied to a data set with common issues like, leading and trailing blanks, duplicates,…Connecting to Databases
ElectiveVideo
Viewing time: ~11m This video shows how to configure a database connection objects and how to create a connection which can be shared and reused as a template. -
Machine Learning
Creating a 'Decision Tree' Model
ElectiveVideo
Viewing time: ~6m In order to create a decision Tree model we first clean out missing examples and then we are ready to create and interpret a first model.Applying the Model
ElectiveVideo
Viewing time: ~6m Here we apply a model to data that does not have any flags yet. This is called scoring and is another very important step in machine learning.Testing a Model
ElectiveVideo
Viewing time: ~5m Once a model is created it is of interest how well it does. So assessing and testing its performance is what this video is about. Prepare for a surprise!Validating a Model
ElectiveVideo
Viewing time: ~7m Using the same data for model training and testing is wrong, except with a cross validation, which splits the data into multiple training and testing sets.Finding the Right Model
ElectiveVideo
Viewing time: ~8m Configuring one model is a task already but there are many models which could work well for your problem. This video shows how to narrow your choice down.Optimization of the Model Parameters
ElectiveVideo
Viewing time: ~6m If you want to get a high performing model then you will need to tune its parameters. This module will show how to automate the tuning.Automated Model Selection and Optimization
ElectiveVideo
Viewing time: ~7m Automatic optimize its parameters of your selected Models and then find the best one. Watch this video if you want RapidMiner Studio to do it all for you.Auto Model - Classification
ElectiveVideo
Viewing time: ~9m Guided classifications or rapid prototyping can be easily achieved with RapidMiner's Auto Model. It can also be used for model comparison and optimization.Auto Model - Clustering & Outliers
ElectiveVideo
Viewing time: ~5m Clustering is often used to group customers or clients. If you want to try it then Auto Model is the tool for you. Second half is all about outlier detection.Interactive Decision Trees
Elective -
Deployment and scaling
Collaborating using AI Studio and AI Hub - Intro
ElectiveVideo
Viewing time: ~10m This is an introductory demo on how to collaborate and then deploy your Studio processes using AI Hub.Panopticon Visualizations
ElectiveVideo
Viewing time: ~4m Use Altair RapidMiner Panopticon to visualize the results of endpoint deployment on Altair RapidMiner AI hub.AI Hub Components Overview
ElectiveVideo
Viewing time: ~5m Developing processes is step one but then what? This module provides an intro for power-users to get an understanding of the available options in AI Hub.AI Hub Deployment with Docker
ElectiveVideo
Viewing time: ~5m In this tutorial we demo a full RapidMiner AI Hub deployment using Docker Compose. For easy setup you can download the .yml and .env files from our docs pages.Introducing RapidMiner Radoop
ElectiveVideo
Viewing time: ~9m See how you can use RapidMiner Studio's visual workflow design for code-free data prep & machine learning, leveraging the power of your Hadoop cluster.