Building a Microservice Based Big Data PipelineLaajuus (3 ECTS)

Course unit code: TX00EL48

General information

Credits: 3 ECTS

Objective

This practical, hands-on course will introduce you to the world of big data. The idea is to create a complete pipeline. Starting with gathering the data and ending with simple data analysis and visualizations. The components will be set-up in a microservice based architecture.

At the beginning of each session we will first see some theoretical background information. We will always keep a practical approach in mind. You will learn new technology and software concepts. Critical questions will challenge you and give you new insights.

Content

Linux, daily use:
- Introduction to Ubuntu Server, the command line interface (CLI)
- Shortcuts and tools, the ones you really should know about!
- Basic text-processing tools

Containerization with Docker:
- Containers & Docker Containers
- Microservices vs. Monoliths
- Virtualization vs. Containerization
- Docker eco-system
- Best-practices

Message queue – Kafka:
- What is a message queue?
- Kafka key concepts
- Microservice example: image recognition

Elastic Stack:
- Components, quick introduction
- Working with Elasticsearch and Kibana
- Simple text-analysis
- Simple visualizations

Qualifications

Basic Python programming skills
Linux (Ubuntu) knowledge is a plus

Assessment criteria, satisfactory (1)

Attending all the labs, completing all the assignments in-time. Able to explain the created pipeline.

Assessment criteria, good (3)

Attending all the labs, completing all the assignments in-time. Able to explain the created pipeline.
Completed some of the extra exercise.

Assessment criteria, excellent (5)

Attending all the labs, completing all the assignments in-time. Able to explain the created pipeline.
Completed all the extra exercises.

Assessment criteria, approved/failed

Active attendance on all lab sessions.
Completing practical exercises.