Big Data (2021/2022)

Course code
cod wi: DT000243
Name of lecturer
Damiano Carra
Coordinator
Damiano Carra
Number of ECTS credits allocated
3
Academic sector
ING-INF/05 - INFORMATION PROCESSING SYSTEMS
Language of instruction
Italian
Location
VERONA
Period
A.A. 21/22 dottorato dal Oct 1, 2021 al Sep 30, 2022.

Lesson timetable

Go to lesson schedule

Learning outcomes

The course offers an overview of the fundamental concepts of distributed computing systems that deal with very large datasets, together with the programming paradigms adopted by these systems. In particular, it will discuss the MapReduce paradigm, and its implementation in Spark. In addition, the system aspects of the distributed computation will be presented, including the data center architectures, and the solutions for storing such large datasets.

Syllabus

- Introduction to the course
- The MapReduce programming paradigm
- Apache Hadoop and Apache Spark
- Non-relational databases for Big Data
- Datacenter architectures

Reference books

Vedi la bibliografia dell'insegnamento

Assessment methods and criteria

The exam consists in carrying out a project in which the principles presented in class are applied.