Logo UAB
2022/2023

Probability and Statistics

Code: 106552 ECTS Credits: 6
Degree Type Year Semester
2504392 Artificial Intelligence FB 1 2

Contact

Name:
Noelia Viles Cuadros
Email:
noelia.viles@uab.cat

Use of Languages

Principal working language:
english (eng)
Some groups entirely in English:
Yes
Some groups entirely in Catalan:
No
Some groups entirely in Spanish:
No

Prerequisites

There are no prerequisites besides the main contents of Mathematics from high school.

Objectives and Contextualisation

The goal of the course is to introduce the basic tools of probability and statistics used to analyse data coming from either natural, experimental, social or economic phenomena. A special focus will be given to the correct use of these tools and the interpretation of the results by providing the student with the required theoretical background. Moreover, a part of the course will be dedicated to introduce and familiarize the student with the use of the most common computer tools for statistical analysis.

Competences

  • Introduce changes to methods and processes in the field of knowledge in order to provide innovative responses to society's needs and demands.
  • Know, understand, use and apply appropriately the mathematical foundations necessary to develop systems for reasoning, learning and data manipulation.
  • Students must be capable of collecting and interpreting relevant data (usually within their area of study) in order to make statements that reflect social, scientific or ethical relevant issues.
  • Work independently, with responsibility and initiative, planning and managing time and available resources, and adapting to unforeseen situations.

Learning Outcomes

  1. Identify the main statistical distributions and how they are applied to different problems.
  2. Perform hypothesis tests and correctly interpret the results.
  3. Propose new methods or informed alternative solutions.
  4. Students must be capable of collecting and interpreting relevant data (usually within their area of study) in order to make statements that reflect social, scientific or ethical relevant issues.
  5. Understand and apply the basic concepts of probability theory.
  6. Understand the concept of regression and its application.
  7. Work independently, with responsibility and initiative, planning and managing time and available resources, and adapting to unforeseen situations.

Content

  • Topic 1.Descriptive statistics.
    • Descriptive study in one variable: frequency distributions, graphic representation and numerical summaries.
    • Descriptive study in two variables: correlation and regression. Contingency tables.
  • Topic 2.  Probability.

    • Notion of probability and main properties. Conditional probability. Bayes Formula. Independence of events.
    • Expectation and variance of a random variable.
    • Discrete random variables: Bernoulli, Binomial and Poisson distributions.
    • Continuous random variables: Normal and Exponential distributions. 
    • Independence of random variables.
    • Central limit theorem.
  • Topic 3.  Statistical inference.
    • Sample and population. Most frequent statistics.
    • Confidence intervals for a single mean or proportion.
    • Hypothesis tests. Main theoretical concepts; p-values.
    • Test for the mean and for the variance of a Normal population,. Test for comparison of means, proportions...
    • Chi-square goodness of fit test for independence.

Methodology

Besides the mandatory student’s personal work, the course will have three distinguished types of activities: The core and main theoretical lessons, problem sessions, and practice in the computer lab. With the correct combination of these activities the specific skills will be achieved.

New material will be mainly introduced in lectures where the professor will explain the main theoretical results backing the tools that will be used throughout. Although these explanations can be complemented with support exercises, those will be mainly worked during the problem sessions where the teacher will provide an oriented resolution of some proposed problems. The student is encouraged to attend the sessions having previously tried the exercises by himself. 

The main theoretical part together with typical exercises will have a partial evaluation halfway along the semester. This should provide the student with a measure of his or her progression. 

There will be practice sessions with specialized computer software for statistical analysis. These sessions will have the double effect of introducing the student with the typical procedures of data analysis, as well as providing lots of examples illustrating the subject. 

The practical sessions will have an independent evaluation with assignments to hand in, possibly at the end of some of the sessions.

In all the evaluations, special attention will be given to the correctness and validity of the assertions and arguments used. These include vocabulary, mathematical correctness and clarity in writing. 

*The proposed teaching methodology may experience some modifications depending on the restrictions to face-to-face activities enforced by health authorities.

Annotation: Within the schedule set by the centre or degree programme, 15 minutes of oneclass will be reserved for students to evaluate their lecturers and their courses or modules through questionnaires.

 

Annotation: Within the schedule set by the centre or degree programme, 15 minutes of one class will be reserved for students to evaluate their lecturers and their courses or modules through questionnaires.

Activities

Title Hours ECTS Learning Outcomes
Type: Directed      
Practices at the Computer Lab 12 0.48
Problem sessions 12 0.48
Theoretical lectures 26 1.04
Type: Supervised      
Tutoring and consultations 10 0.4
Type: Autonomous      
Independent study and preparation 60 2.4

Assessment

The evaluation of theory and problems will consist of two partial exams. The first one, with a  weight of 30% and the second one with a weight of 40%. For these evaluations there will be a second-chance exam at the end of the semester. The remaining 30% of the evaluation weight will come from the computer practices. This will be obtained from different assignments delivered throughout the course, for which there will not be a second chance evaluation.

In order to attend the recovery examination, students must have been previously evaluated in a series of activities whose weight equals at least two thirds of the total.

A weighted average of a minimum of 4 out of 10 is required in the partial exams or in their recovery. A minimum grade of 4 out of 10 is also required in the average grade of the practice assignments. If the minimum of each module is reached, the final grade is the weighted mean. Otherwise, the final grade is the minimum between the weighted means and 4.5 (out of 10).

Those who have not taken tests that add up to 50% of the course will be considered Non-Assessable.

In order to pass the course with honours, the final grade must be equal to or higher than 9 (out of 10). This will be given to students that, according to the criterion of the professor, have reached in a brilliant manner all the goals of the subject.

The dates for the assessments and delivery of assignments will be published in a dedicated webpage for the course.

Assessment Activities

Title Weighting Hours ECTS Learning Outcomes
First partial exam 30% 2 0.08 5, 6, 1, 3, 4, 2, 7
Hand in assignments 30% 20 0.8 5, 6, 1, 3, 4, 2, 7
Recovery exam 70% 5 0.2 5, 6, 1, 3, 4, 2, 7
Second partial exam 40% 3 0.12 5, 6, 1, 3, 4, 2, 7

Bibliography

Bardina, X. Farré, M. Estadística descriptiva. Manuals UAB, 2009.

Besalú, M. Rovira C. Probabilitats i estadística. Publicacions i Edicions de la Universitat de Barcelona, 2013.

Delgado, R. Probabilidad y Estadística para ciencias e ingenierías. Delta, Publicaciones Universitarias. 2008.

Devore, J. L. Probabilidad y estadística para ingeniería y ciencias. Thomson, 2005.

Montgomery, D. C. Runger, G. C. Probabilidad y estadística aplicadas a la ingeniería. Limusa Wiley, 2002.

Walpole, R. Myers, R. H. Myers, S. L. Probabilidad y estadística para ingenieros. Prentice Hall, 1999.

Software

One of the main tools for statistical analysis and development and which has gained a growing popularity in academia is the R language. The R project is a free software environment providing a large set of libraries and tools aimed at statistical computing and graphical representation of data.

The student will learn the basics of the R language though the use of the integrated development environment RStudio

  R:https://www.r-project.org/          

  Rstudio: https://www.rstudio.com/

No special version of the software is required for the aims of this course.