Machine Learning in R (advanced course)

Malvika Sharan   2018-06-12   Comments Off on Machine Learning in R (advanced course)

Date(s) - 2018-06-12 - 2018-06-13
09:30 CEST - 17:30 CEST

Tutors and helpers

Organised by the Bio-IT Project

Course Information

This two-day course, delivered by experts in programming for data analysis, will teach participants the principle of machine learning and its implementation in R using mlr package. The main goal of mlr is to provide a unified interface for machine learning tasks as classification, regression, cluster analysis and survival analysis in R.

Sessions will be driven by many practical exercises and case studies. The schedule and course materials will be added here.

Prior to the course, on June 11, 2018, 2:00 PM at Small Operon, an open seminar will be delivered by Prof. Bernd Bischl. Title TBA.

Course Content

This 2-day course will cover the following topics and sessions:

Sessions on Day-1

  1. Introduction to Machine Learning, mlr, KNN, and its application in a biological dataset
  2. Linear models, regression, regularization, and trees
  3. Evaluation (train, test, ROC), and its application to microbiome-based cancer detection
  4. Hands-on session: application on a new dataset (see the prerequisite #3)

Sessions on Day-2

  1. Forests and boosting with a demo
  2. Tuning and nested resampling with a demo
  3. Interpretable machine learning and feature selection
  4. Hands-on session: application on the dataset from Session-4


    1. The course is aimed at participants preferably with some knowledge of statistics and data modeling, and want to learn more about machine learning and its application and implementation through the hands-on sessions and use cases. The participants are expected to understand the concepts described in these materials before the workshop.
    2. Participants are expected to bring their own laptop with R version >=3.3.2 installed.
    3. Please create a Kaggle account for the hands-on sessions.

    Optional: The participants can have a look at the mlr tutorial to gain a little head-start, but this will be covered in the lectures.



Please note that the maximum capacity for the course is 30 participants and registration is required to secure a place. If you have any questions, please contact Malvika Sharan and Bernd Klaus.

Course Fee:

This course will be offered for free to all EMBL members.

The external participants will be charged with a course fee of 100 Euro. The invoice details will be shared via email.

Cancellation and No-Show:

The registration can be canceled for the free of charge until June 2, 2018.

The participants will be charged a cancellation fee (if canceled after June 2, 2018) or no-show fee of 50 Euros. The invoice details will be shared via email.

Course Material / Etherpad

The course material can be accessed via this ownCloud link:

About Malvika Sharan

I am a Community Outreach Coordinator for Bio-IT and Training Coordinator for ELIXIR Germany. Find me at EMBL-HD office 101 or at Bio-IT drop-in Sessions on Tuesdays 10:00-12:00 at the EMBL staff lounge. Contact Details: email -, Twitter -, GitHub -