Degree | Type | Year |
---|---|---|
Modelización para la Ciencia y la Ingeniería / Modelling for Science and Engineering | OP | 1 |
You can view this information at the end of this document.
Students should have basic knowledge of linear algebra, probability, statistical inference, and linear models. Experience using R and Python is also highly recommended.
Huge amounts of data are generated today across a wide range of fields, including health, engineering, the social sciences, economics, etc. While this exponential growth of data presents significant challenges, it also creates opportunities to extract relevant information and facilitate evidence-based decision-making, process optimisation, and the generation of new knowledge. This course aims to provide students with the mathematical, statistical, and computational knowledge, as well as the necessary tools for processing, analysing, and modelling large datasets. Special emphasis is also placed on interpreting the information obtained and using it to transform data into meaningful knowledge, leading to more accurate conclusions and better decision-making. The course focuses particularly on learning and applying some mathematical, statistical, and computational methods to identify patterns, trends, and relationships within massive and complex datasets.
Block 1. Text Mining (10 h):
Block 2. Statistics for Big Data (18 h):
Block 3. Deep Learning (10 h):
Title | Hours | ECTS | Learning Outcomes |
---|---|---|---|
Type: Directed | |||
Lecture sessions | 19 | 0.76 | CA28, CA29, KA21, KA22, CA28 |
Problem-solving and practical sessions | 11 | 0.44 | CA27, CA28, CA29, CA30, SA27, SA28, CA27 |
Type: Supervised | |||
Problem-solving and practical sessions | 8 | 0.32 | CA27, CA28, CA29, CA30, SA27, SA28, CA27 |
Type: Autonomous | |||
Self-directed learning to deepen understanding of lecture topics | 43 | 1.72 | CA28, KA21, KA22, SA28, CA28 |
Tasks to practise the concepts introduced during in-person classes | 50 | 2 | CA27, CA28, CA29, CA30, KA21, KA22, SA27, CA27 |
The course is organised into three independent blocks, each taught by a different professor. While the first block of the course (10 hours) introduces concepts related to data mining, which are generally applied to large datasets, the second block (18 hours) focuses on the statistical methods and knowledge required for modelling such large volumes of data. Particular emphasis is placed on fitting and making inferences with classical linear models and generalised linear models (logistic and Poisson models) when working with large amounts of information. Finally, the third block of the course (10 hours) introduces students to some of the most relevant methods in deep learning, with a special focus on neural networks and their applications.
Each block generally combines lecture sessions introducing theory and technical concepts and with laboratory and problem-solving sessions, which may be instructor-led or based on independent student work. During lecture sessions, professors may use slides, which will be shared via the Moodle course page. Similarly, examples in R and/or Python may be presented during laboratory and problem-solving sessions, and these will generally be made available via Moodle. Students are also expected to independently review supplementary materials shared via Moodle in order to deepen their understanding of the concepts introduced in the in-person sessions.
Annotation: Within the schedule set by the centre or degree programme, 15 minutes of one class will be reserved for students to evaluate their lecturers and their courses or modules through questionnaires.
Title | Weighting | Hours | ECTS | Learning Outcomes |
---|---|---|---|---|
Projects Block 1 | 26 | 5 | 0.2 | CA27, CA28, CA29, KA22, SA27, SA28 |
Projects Block 2 | 48 | 9 | 0.36 | CA27, CA28, CA29, KA22, SA27, SA28 |
Projects Block 3 | 26 | 5 | 0.2 | CA27, CA28, CA29, CA30, KA21, KA22, SA27, SA28 |
The course evaluation is carried out independently for each of the described blocks. Each block's weighting in the final grade corresponds to its proportion of the total course hours. Each block typically consists of a number of projects, which can be completed either individually or in groups. In some cases, these projects will require an oral presentation of the content and results. The projects in blocks 1 and 3 each account for 26% of the final grade of the course, while those in block 2 account for 48% of the final grade of the course.
For each project, a document will be published on the Moodle page, containing the project description and requirements, as well as all necessary materials for its completion (datasets, additional information sources, etc.), the submission deadline and procedure, and any other information that the professor considers relevant. Grades for each project, as well as the final grades for each block and for the course as a whole, will also be published on the Moodle page.
Basic references:
Complementary references:
The professors may provide other interesting references for each block, which will be available via the Moodle page.
R Core Team (2021). R: A language and environment for statistical computing.
R Foundation for Statistical Computing, Vienna, Austria.
URL https://www.R-project.org/.
Python
Please note that this information is provisional until 30 November 2025. You can check it through this link. To consult the language you will need to enter the CODE of the subject.
Name | Group | Language | Semester | Turn |
---|---|---|---|---|
(TEm) Theory (master) | 1 | English | second semester | afternoon |