Obiettivi formativi
This course shows how to help organizations collect, analyze, store and interpret large-scale data in order to develop informed business strategies, by providing a framework to improve students' understanding of data analytics, and enhance their critical thinking and decision making. In particular, students will acquire skills to recognize business problems, gain an understanding of data collection techniques and principles of data analysis, learn how to take data from the technical domain, bridge the data gap between the technical domain and business analysts, analyze and present valuable findings and recommend action to business leaders.
Prerequisiti
Statistics, Computer Programming, Machine learning
Risultati di apprendimento attesi
Knowledge and understanding:
The course will present the tools to collect, organize, model and process data
from both a theoretical and practical point of view. Concerning this last point,
during the course an extensive use of software tools will be made.
Applying knowledge and understanding:
The students are expected to use the tools presented during the course to deal with data
coming from several fields. On one hand they should be able to process the data to obtain
useful information. On the other hand they should be able to present their findings and to
address business decisions. To develop and evaluate these skills, students will be offered several
practical lab sessions and will be asked to work on project assignments.
Making judgements:
Students are expected to be able to identify the problem they need to face and define it properly.
Students should be able to decide which models are the most suitable to deal with the defined problem and how to use them to identify and process useful data.
Communications Skills:
The students are expected to be able to organize and present their findings in a clear way. They
should be be able to understand the language and the tools of the technical domain and should be able to
provide recommendations supported by quantitative results.
Learning skills:
The course is intended to give the students the tools to cope with “real world” scenarios.
After the course they should have improved their critical spirit and become more independent in approaching
problems. They should be able to support their arguments with evidence based on data and mathematical models.
Contenuti Del Corso
The course will focus on collection, exploration, analysis and visualization of data, and presentation of results with and hands on approach.
Emphasis will be given to applications.
During the course the R programming language will be presented and extensively used.
Testi Di Riferimento
Jank W. (2011) Exploring and Discovering Data. In: Business Analytics for Managers. Use R. Springer, New York, NY
James, Witten, Hastie, Tibshirani (2017) An Introduction to Statistical Learning with Applications in R. Springer Verlag.
Metodologie Didattiche
Lectures; Lab sessions
Modalità di verifica dell'apprendimento
Two quizzes – individual (15% overall)
Midterm group project (35%)
Final group project assignment and oral presentation (50%)
Criteri per l’assegnazione dell’elaborato finale
Interview with candidate
Settimana 1
Introduction to the R programming language
Practical Lab sessions
Settimana 2
Data collection and preparation
Practical Lab sessions
Settimana 3
Linear regression
Practical Lab sessions
Settimana 4
Multivariate Linear regression
Practical Lab sessions
Settimana 5
Classification methods, Clustering
Practical Lab sessions
Settimana 6
Model assessment, Resampling Methods
Practical Lab sessions
Settimana 7
Regularization Methods,
Practical Lab sessions
Settimana 8
Midterm week
Settimana 9
Tuning of the model
Practical Lab sessions
Settimana 10
Non linear Methods
Practical lab sessions
Settimana 11
Tree-based methods
Practical lab sessions
Settimana 12
Reporting and presentation of results