Skip to main content

Data Mining in Practice (3 cr)

Code: TX00FB50-3002

General information


Enrollment

02.05.2023 - 03.08.2023

Timing

07.08.2023 - 11.08.2023

Number of ECTS credits allocated

3 op

Mode of delivery

Contact teaching

Unit

School of ICT

Campus

Leiritie 1

Teaching languages

  • English

Seats

0 - 40

Degree programmes

  • Degree Programme in Information Technology

Teachers

  • Daniel Rodriguez Garcia

Groups

  • ICTSUMMER
    ICT Summer School

Objective

By the end of the module, students should be able to:
- Develop an appreciation for what is involved in machine learning (data mining) from data
- Understand a wide variety of learning algorithms
- Understand how to evaluate models generated from data
- Apply the algorithms to solve real problems, optimize the models learned and report on the expected performance

Transferable skills:
- Mathematical analysis of learning methods.
- Evaluation of algorithms.
- Programming skills in Python

Content

This course aims to provide students with an in-depth introduction to the main topics of Machine Learning.

It will cover some of the main models and algorithms for regression, classification and clustering. Topics such as linear and logistic regression, classification trees, rules, SVMs, neural networks, clustering, feature selection and dimensionality reduction. Visualisation and evaluation of machine
learning models.

Materials

Bibliography
Jake VanderPlas. Python Data Science Handbook,
https://jakevdp.github.io/PythonDataScienceHandbook/
Ian Witten, Eibe Frank, Mark Hall and Chris Pal, Data Mining: Practical Machine Learning Tools
and Techniques, 4th Edt, 2016

Other bibliography
Mitchell T, Machine Learning, McGraw-Hill, 1997
S. Rogers and M. Girolami, A first course in Machine Learning, CRC Press, 2011
C. Bishop, Pattern Recognition and Machine Learning, 2007
D. Barber, Bayesian Reasoning and Machine Learning, 2012

Other online references
https://www.w3schools.com/python/python_ml_getting_started.asp
https://github.com/rasbt/python-machine-learning-book-3rd-edition

Further information

Students should bring their own laptop.

Evaluation scale

0-5

Prerequisites

The course will use Python and/or R programming languages.
Some familiarity with linear algebra, probability theory.