Logo UAB
2023/2024

Probability and Statistics

Code: 106552 ECTS Credits: 6
Degree Type Year Semester
2504392 Artificial Intelligence FB 1 2

Contact

Name:
Jozsef Zoltan Farkas
Email:
jozsefzoltan.farkas@uab.cat

Teaching groups languages

You can check it through this link. To consult the language you will need to enter the CODE of the subject. Please note that this information is provisional until 30 November 2023.


Prerequisites

There are no prerequisites besides the main contents of Mathematics from high school.


Objectives and Contextualisation

The goal of the course is to introduce the basic tools of probability and statistics used to analyse data coming from either natural, experimental, social or economic phenomena. A special focus will be given to the correct use of these tools and the interpretation of the results by providing the student with the required theoretical background. Moreover, a part of the course will be dedicated to introduce and familiarize the student with the use of the most common computer tools for statistical analysis.


Competences

  • Introduce changes to methods and processes in the field of knowledge in order to provide innovative responses to society's needs and demands.
  • Know, understand, use and apply appropriately the mathematical foundations necessary to develop systems for reasoning, learning and data manipulation.
  • Students must be capable of collecting and interpreting relevant data (usually within their area of study) in order to make statements that reflect social, scientific or ethical relevant issues.
  • Work independently, with responsibility and initiative, planning and managing time and available resources, and adapting to unforeseen situations.

Learning Outcomes

  1. Identify the main statistical distributions and how they are applied to different problems.
  2. Perform hypothesis tests and correctly interpret the results.
  3. Propose new methods or informed alternative solutions.
  4. Students must be capable of collecting and interpreting relevant data (usually within their area of study) in order to make statements that reflect social, scientific or ethical relevant issues.
  5. Understand and apply the basic concepts of probability theory.
  6. Understand the concept of regression and its application.
  7. Work independently, with responsibility and initiative, planning and managing time and available resources, and adapting to unforeseen situations.

Content

 

  • Topic 1.  Probability.

    • Introduction, sets, Venn diagrams, sample spaces.
    • Axioms of probability and main properties. Permutations, combinations.
    • Conditional probability. Bayes theorem. Independent events.
    • Random variables and distributions, expected value and variance.
    • Discrete random variables: Binomial and Poisson distributions.
    • Continuous random variables: Uniform, Normal and Exponential distributions. 
    • Central limit theorem and applications.

 

  • Topic 2.  Statistics
    • Descriptive statistics, visualisation techniques.
    • Joint distributions, covariance and correlation.
    • Point estimation. General concepts, unbiased estimators. Maximum likelihood estimation.
    • Interval estimation. Confidence intervals for a single mean or proportion.
    • Confidence interval for the mean and variance of a normal population, t distribution.
    • Prediction interval for a future value, chi-squared distribution
    • Hypothesis testing, general concepts. Type I and II errors. Error estimation.
    • Test for a population proportion. Test for the mean of a normal population.
    • Large sample test for a normal population with unknown variance.

Methodology

Besides the mandatory student’s personal work, the course will have three distinguished types of activities: The core and main theoretical lessons, problem sessions, and practice in the computer lab. With the correct combination of these activities the specific skills will be achieved.

New material will be mainly introduced in lectures where the professor will explain the main theoretical results backing the tools that will be used throughout. These theoretical results will be complemented by exercises.  More examples and exercises will be discussed during the problem sessions where the teacher will provide an oriented resolution of the proposed problems. Students are encouraged to attend the sessions having previously tried the exercises by themselves. 

The main theoretical part together with typical exercises will have a partial evaluation halfway along the semester. This should provide the student with a measure of their progression. 

There will be practice sessions with specialized computer software for statistical analysis. These sessions will have the double effect of introducing students to the typical procedures of data analysis, as well as providing lots of examples illustrating the subject. 

The practical sessions will have an independent evaluation with assignments to hand in, possibly at the end of some of the sessions.

In all the evaluations, special attention will be given to the correctness and validity of the assertions and arguments used. These include vocabulary, mathematical correctness and clarity in writing. 

*The proposed teaching methodology may experience some modifications depending on the restrictions to face-to-face activities enforced by health authorities.

Annotation: Within the schedule set by the centre or degree programme, 15 minutes ofoneclass will be reserved for students to evaluate their lecturers and their courses or modules through questionnaires.

 

Annotation: Within the schedule set by the centre or degree programme, 15 minutes of one class will be reserved for students to evaluate their lecturers and their courses or modules through questionnaires.


Activities

Title Hours ECTS Learning Outcomes
Type: Directed      
Practices at the Computer Lab 12 0.48
Problem sessions 12 0.48
Theoretical lectures 26 1.04
Type: Supervised      
Tutoring and consultations 10 0.4
Type: Autonomous      
Independent study and preparation 60 2.4

Assessment

The evaluation of theory and problems will consist of two partial exams. The first one, with a  weight of 30% and the second one with a weight of 40%. For these evaluations there will be a second-chance exam at the end of the semester. The remaining 30% of the evaluation weight will come from the computer practices. This will be obtained from different assignments delivered throughout the course, for which there will not be a second chance evaluation.

In order to attend the recovery examination, students must have been previously evaluated in a series of activities whose weight equals at least two thirds of the total.

A weighted average of a minimum of 4 out of 10 is required in the partial exams or in their recovery. A minimum grade of 4 out of 10 is also required in the average grade of the practice assignments. If the minimum of each module is reached, the final grade is the weighted mean. Otherwise, the final grade is the minimum between the weighted means and 4.5 (out of 10).

Those who have not taken tests that add up to 50% of the course will be considered Non-Assessable.

In order to pass the course with honours, the final grade must be equal to or higher than 9 (out of 10). This will be given to students that, according to the criterion of the professor, have reached in a brilliant manner all the goals of the subject.

The dates for the assessments and delivery of assignments will be published in a dedicated webpage for the course.


Assessment Activities

Title Weighting Hours ECTS Learning Outcomes
First partial exam 30% 2 0.08 5, 6, 1, 3, 4, 2, 7
Hand in assignments 30% 20 0.8 5, 6, 1, 3, 4, 2, 7
Recovery exam 70% 5 0.2 5, 6, 1, 3, 4, 2, 7
Second partial exam 40% 3 0.12 5, 6, 1, 3, 4, 2, 7

Bibliography

 Main textbook:

Jay L. Devore; Probability and Statistics for engineering and the sciences

 

Further reading:

Bardina, X. Farré, M. Estadística descriptiva. Manuals UAB, 2009.

Besalú, M. Rovira C. Probabilitats i estadística. Publicacions i Edicions de la Universitat de Barcelona, 2013.

Delgado, R. Probabilidad y Estadística para ciencias e ingenierías. Delta, Publicaciones Universitarias. 2008.

Montgomery, D. C. Runger, G. C. Probabilidad y estadística aplicadas a la ingeniería. Limusa Wiley, 2002.

Walpole, R. Myers, R. H. Myers, S. L. Probabilidad y estadística para ingenieros. Prentice Hall, 1999.


Software

One of the main tools for statistical analysis and development and which has gained a growing popularity in academia is the R language. The R project is a free software environment providing a large set of libraries and tools aimed at statistical computing and graphical representation of data.

The student will learn the basics of the R language though the use of the integrated development environment RStudio

  R:https://www.r-project.org/          

  Rstudio: https://www.rstudio.com/

No special version of the software is required for the aims of this course.