(click to insert) Big Data Systems and Analytics (2019/2020)

Course code
cod wi: DT000049
Name of lecturers
Elisa Quintarelli, Sara Migliorini
Coordinator
Elisa Quintarelli
Number of ECTS credits allocated
5
Academic sector
ING-INF/05 - INFORMATION PROCESSING SYSTEMS
Language of instruction
Italian
Location
VERONA
Period
A.A. 19/20 dottorato dal Oct 1, 2019 al Sep 30, 2020.

Lesson timetable

Go to lesson schedule

Learning outcomes

The course offers an overview of the features and challenges behind Big Data problems, applications and systems. Starting from the so-called 5 Vs of Big Data (volume, velocity, variety, variability, and value), the course focuses on the most common framework, Hadoop, and the next generation systems such as Spark, showing the differences between a traditional Database Management System and a Big Data Management System. The course will also introduce a spatial extension of Hadoop.

Syllabus

• Introduction to the course
• The MapReduce programming paradigm and Apache Hadoop
• Apache Spark
• The Hadoop Ecosystem
• SpatialHadoop: a spatial extension to Apache Hadoop
• Advanced Indexing and Partitioning in Hadoop
• DBMS for Big data
o Relational and Non-relational databases for Big Data
o Mongo DB: an example of NO-SQL dbms
• Challenges in the Big Data Era

The course will cover both theoretical and practical aspects.