BIG DATA SCIENCE & GOVERNANCE
Instructional goals
As part of the major in Strategic Management of Innovation and Sustainability, the course in Big Data Science and Governance offers an integrated and multidisciplinary approach to data-driven innovation and regulatory frameworks. In the first part of the course, students will acquire advanced quantitative methods for analyzing and extracting value from large-scale datasets and continuous data flows, equipping them with the analytical tools needed to integrate data-driven methods in the decision-making process. Emphasis will be placed on data collection, processing, visualization, and predictive modeling using real-world data sources from both public and private sectors. The second part of the course focuses on the evolving European regulatory and policy framework governing data. Students will engage with key instruments such as the General Data Protection Regulation (GDPR), the Data Governance Act (DGA), the Data Act, the Digital Services Act (DSA), the Digital Markets Act (DMA), the Open Data Directive, and the development of European Data Spaces. The European Union is progressively shaping an internal data market where data are no longer viewed solely through a protectionist or conservative lens — focused on privacy or intellectual property — but are increasingly recognized as a strategic asset, to be leveraged through sharing and reuse mechanisms that enable new opportunities for innovation, value creation, and business growth. Through case-based learning and discussions with experts, the course will explore the legal, ethical, and strategic implications of these frameworks, highlighting both synergies and tensions in their application—particularly in relation to innovation, competition, and sustainability goals. By bridging technical and regulatory perspectives, the course aims to prepare students to navigate and shape the data economy in alignment with European values and global challenges.
Intended learning outcomes
Knowledge and comprehension: having demonstrable knowledge and understanding of the subject that builds on and exceeds the level achieved in the first three years; operating at a level where, with the support of specialized literature, there are some aspects that require knowledge of the latest developments in the field.
Applied knowledge and comprehension: being able to apply the acquired knowledge and understanding in such a way that shows a professional approach to one’s work, also having competencies for drawing up and deepening arguments, and for solving problems in the field.
Making autonomous judgements: being able to collect and interpret relevant information with the aim of building an opinion based on proficiently weighing relevant social, legal, economic and ethical aspects.
Communication skills: being able to communicate information, ideas and solutions to an audience consisting of specialists or non-specialists.
Ability to learn: having the necessary learning skills to enter a follow-up study that presupposes a high level of autonomy.
Course Contents
First Part (Big Data Science):
Big Data: sources and management
Data Summarization
Introduction to machine learning
Natural Language Processing
Network Analysis
Second Part (Big Data Governance):
The value of data in the EU and the evolution of related regulation
From personal data protection to data governance
The regulation of the data market in the EU
From an EU data strategy to a corporate data strategy
Reference Books
First Part (Big Data Science):
Slides, lecture notes, and datasets provided by the lecturer.
Second Part (Big Data Governance): Handouts, slides, and materials provided directly by the Chair.
Teaching Methods
Acquisition: lectures, podcasts and online quizzes
Practice: guest speakers, case study and simulation
Investigation: analyzing ideas and information in a range of materials and resources, using legal databases to collect and analyze data and comparing texts
Collaboration: discussing output and building joint output
Discussion: seminars, group based class discussion, online forums and synchronous and asynchronous discussion
Production: essays, reports and presentations
Assessment Method
15% of the final grade is based on attendance and active participation in class (7.5% for attendance and 7.5% for participation through contributions, opinions, initiatives, questions, and comments). In case of justified reasons for non-attendance, these must be communicated and validated by the Chair at the beginning of the course.
35% of the final grade is based on the first written mid-term test (Data Science);
35% of the final grade is based on the second written mid-term test (Data Governance);
15% of the final grade is based on the final oral exam.
Non-attending students or students who have failed or refused the grade of one or both mid-term tests may retake them under the same conditions on the day of the oral exam.
Thesis assignment criteria
Genuine interest in the subject and intellectual appreciation by the Chair.
Week 1
Course Introduction and Objectives (both Proff. Federico and Brozzetti)
- Prof. Federico -
Introduction to Big Data: types of data, data sources and data management.
Week 2
- Prof. Federico -
Data Summarization: data preprocessing, exploratory analysis, descriptive statistics, data visualization.
Week 3
- Prof. Federico -
Basics of machine learning: linear, logistic and Poisson regression, unsupervised clustering, introduction to neural networks.
Week 4
- Prof. Federico -
Basics of machine learning (continued): linear, logistic and Poisson regression, unsupervised clustering, introduction to neural networks.
Week 5
- Prof. Federico -
Textual data and NLP: keyword extraction, sentiment analysis, topic modeling.
Week 6
- Prof. Federico -
Network analysis: descriptive statistics on networks, network visualization, and community detection.
First Mid-Term Test
Week 7
- Prof. Brozzetti –
Data in the EU Regulatory Framework:
From the European Digital Agenda to Europe’s Digital Decade, through the Lisbon Treaty (1990–2030)
Regulation of personal data processing
The GDPR: key concepts
Week 8
- Prof. Brozzetti –
Personal Data Protection:
The GDPR: data management and data security
International data transfers: the U.S.–EU dispute
Week 9
- Prof. Brozzetti –
Data Sovereignty:
From Data Free Flow with Trust to the return of Data Localization in cloud policy
The EU’s strategic autonomy in the geopolitics of data: the Digital Strategy and the Data Strategy
Week 10
- Prof. Brozzetti –
The Data Market:
Data assets in the Digital Services Act (DSA) and the Digital Markets Act (DMA)
The issue of personal data monetization
Week 11
- Prof. Brozzetti –
The Data Market (continued):
From Data Protection to Data Governance: the Data Governance Act (DGA) and data reuse
Data Act: data sharing and data altruism
European Data Spaces
EU Open Data Strategy
Workshop: Synthetic Data (Guest Speaker with both Proff. Brozzetti and Federico)
Week 12
- Prof. Brozzetti –
A New Vision of Data Value:
Toward a proactive and dynamic strategy for corporate data asset management
An integrated legal approach to generate value and business from a company’s information assets
Second Mid-Term test