Logo UAB

Vision and Learning

Code: 106582 ECTS Credits: 6
2025/2026
Degree Type Year
Artificial Intelligence OT 3
Artificial Intelligence OT 4

Contact

Name:
Debora Gil Resina
Email:
debora.gil@uab.cat

Teachers

Guillermo Eduardo Torres
Debora Gil Resina

Teaching groups languages

You can view this information at the end of this document.


Prerequisites

Have taken the subjects of Fundamentals of Machine Learning, Fundamentals of Programming, Fundamentals of Computer Vision, Probability and Statistics, and Neural Networks and Deep Learning.

It is recommended that the student have knowledge and skills of:

  • Programming in the Python programming language
  • Signal, Image and Video Processing
  • statistical validation
  • Computational Learning and Deep Learning

 


Objectives and Contextualisation

Roughly every decade there is a technological tsunami that transforms multiple industries. Artificial Intelligence (AI) is this wave that sweeps the current technological world. If you have ever wondered:

  • How do computers perform face detection in crowds?
  • how do video calling apps blur background or replace background with other images?
  • How do autonomous cars move safely in an urban environment?
  • How do you track the ball with such precision in televised sporting events like tennis, soccer, and basketball?
  • Can we know the most effective cancer treatment from multimodal patient data?
  • Can we know the emotions of a person with a video?
  • how do machines learn?


If we have aroused your curiosity, this course is what you need. In this course we will learn about topics in Computer Vision such as Object Tracking, Image Classification, Personalized Medicine, Face Detection, Optical Flow, Human Pose estimation and many more.

Unlike other computer vision courses, this course approaches computer vision in a more practical, experiential and intuitive way. Its main component is a set of projects that must be developed by students divided into teams. All you need is a working knowledge of the Python programming language.

We will use Python which allows us to incorporate different computer vision libraries. It is used by thousands of companies, products, and devices and is tested every day for scalability and performance. We will also learn to design and adapt specific networks and choose the most appropriate processing method according to the requirements and restrictions of each application.

In summary, Vision and Learning is an eminently practical and interdisciplinary subject that stands on the bridge between artificial intelligence and the real world and that aims to cross this bridge in both directions.


Competences

    Artificial Intelligence
  • Analyse and solve problems effectively, generating innovative and creative proposals to achieve objectives.
  • Conceptualize and model alternatives of complex solutions to problems of application of artificial intelligence in different fields and create prototypes that demonstrate the validity of the proposed system.
  • Develop critical thinking to analyse alternatives and proposals, both one's own and those of others, in a well-founded and argued manner.
  • Develop strategies to formulate and solve different learning problems in a scientific, creative, critical and systematic way, knowing the capabilities and limitations of the different existing methods and tools.
  • Introduce changes to methods and processes in the field of knowledge in order to provide innovative responses to society's needs and demands.
  • Students can apply the knowledge to their own work or vocation in a professional manner and have the powers generally demonstrated by preparing and defending arguments and solving problems within their area of study.
  • Work cooperatively to achieve common objectives, assuming own responsibility and respecting the role of the different members of the team.

Learning Outcomes

  1. Analyse a situation and identify areas for improvement.
  2. Analyse and solve problems effectively, generating innovative and creative proposals to achieve objectives.
  3. Design the best convolutional network architectures for solving image sequence problems.
  4. Develop critical thinking to analyse alternatives and proposals, both one's own and those of others, in a well-founded and argued manner.
  5. Identify the basic concepts of computational learning and adequately apply its techniques to image recognition.
  6. Plan, develop, evaluate and implement a solution to a particular visual recognition problem.
  7. Propose new methods or informed alternative solutions.
  8. Select and design the best data sets for training networks.
  9. Select and design the best methods for training neural networks.
  10. Select and design the best techniques for evaluating the results of training methods or networks.
  11. Students can apply the knowledge to their own work or vocation in a professional manner and have the powers generally demonstrated by preparing and defending arguments and solving problems within their area of study.
  12. Use optimization techniques to plan, develop, evaluate, and implement a solution to a particular problem.
  13. Work cooperatively to achieve common objectives, assuming own responsibility and respecting the role of the different members of the team.

Content

1. Introduction to Computational Learning in Computer Vision

2. Classification of Images

3. Object Detection

4. Segmentation of Regions

5. Indexing and Retrieval

6. Image Generation

7. Multimodal Learning

 


Activities and Methodology

Title Hours ECTS Learning Outcomes
Type: Directed      
Theory lectures 10 0.4 5, 6, 8, 9, 10, 12
Type: Supervised      
Working seminars 20 0.8 2, 1, 3, 11, 8, 9, 10, 12
Type: Autonomous      
Personal work 115 4.6 2, 1, 4, 3, 6, 7, 11, 13, 12

The management of the teaching of the subject will be carried out through the Caronte document manager (http://caronte.uab.cat/), which will serve as a management tool for the work teams, make the corresponding deliveries, see the notes, communicate with teachers, etc. In order to use it, the following steps must be taken:

  1. Register as a user by giving your name, NIU, and a passport photo in JPG format. If you have already registered for another subject, it is not necessary to do it again, you can go to the next step.
  2. Enroll in the type of teaching "VISION AND LEARNING", giving as subject code the one provided on the first day of class.


The course will follow a teaching learning methodology called Project Based Learning (ABP). The PBL methodology aims to empower and motivate the student in their learning. Groups of between 5 and 6 students will be formed who will be entrusted with carrying out a set of projects (medium size) throughout the semester. There will be a weekly follow-up and both group and individual tutoring of the students

The projects are set by the teaching staff in such a way that they meet the following conditions: be as real as possible; be treatable by elementary tools; not have an associated standard solution algorithm.

On the other hand, it is essential to understand that it is not a question of finding an algorithm that works in 100 x 100 cases —often there is no such thing— but simply of “giving you a reasonable solution proposal”.

Projects should be developed by each team with the maximum possible autonomy. Each team will be assigned a tutor who will follow their evolution but in principle will refrain from imposing their ideas. On the other hand, the student must be clear that it is not a question of looking for the solution of the problem in other places, but of making an original contribution. This does not mean that you have to renounce the information that may exist in the bibliography oron the Internet; but when it is used it is necessary to have the teacher informed and explain it in the memory.

The realization of the project must end in a program and a final report. In addition to delivering it in written form, the results of this report will be the subject of an oral presentation. Both of them, written memory and oral exposition, must be addressed mainly to the entity, surely hypothetical, that would have proposed the problem. As a general rule, technicalities will be relegated to specific sections of the written report.

In the oral presentations of the projects it is expected that the whole class attend, and that they intervene through questions and observations.

Note: 15 minutes of a class will be reserved, within the calendar established by the center/degree, for the completion by the students of the surveys to evaluate the performance of the teaching staff and the evaluation of the subject/module.

Annotation: Within the schedule set by the centre or degree programme, 15 minutes of one class will be reserved for students to evaluate their lecturers and their courses or modules through questionnaires.


Assessment

Continous Assessment Activities

Title Weighting Hours ECTS Learning Outcomes
Group Note 60% 0 0 2, 1, 4, 3, 5, 6, 7, 8, 9, 10, 13, 12
Individual Note 40% 5 0.2 2, 3, 5, 6, 11, 8, 9, 10, 12

The subject has 2 assessment activities:

  1. Group projects (60%)
  2. Individual written test (40%)

 Project Evaluation

The subject has 3 projects of increasing difficulty. At the end of each project, students will submit a report of the work carried out that will be evaluated by the professors of the subject, whether or not they are the tutors. The following INSTRUMENTS and ACTIVITIES will be used for the evaluation:

    • PROJECT REPORT Document where the development of the work carried out is explained: project approach, minutes of meetings, information searched, explanation of the application implemented with a small user manual and tests and tests carried out. It is mandatory to follow the model provided in the CV, both in terms of form, structure and content of each section.
    • APPLICATION: developed program.
    • CLASS MONITORING: Evaluation based on the observations made by the tutors in the tutored sessions, where the attitude, initiative, participation, attendance and punctuality of the student to the group sessions will be taken into account...

Given that the projects are developed throughout the course, their evaluation is continuous, and their final result is not recoverable.

Proba Individual

At the end of the course, an individual written test will be taken where the student must demonstrate that they have understood the contents and methodologies used in the projects carried out.

Ratings

The grade of the subject is the weighted average between the grade of the projects and the grade of the individual test:

Final grade = 0.6 * Project grade + 0.4 * individual grade

The grade of the projects will come out of the average of the 3 projects carried out. You have to have a 5 in the Individual Grade to make the average. The subject is approved if the Final Grade >=5

To distinguish between 'failed' and 'not presented', a deadline is set for students to withdraw from the assessment, in which case they will appear as 'not presented'. To unsubscribe, it will be necessary to notify the teacher, in writing or by email, and obtain a receipt acknowledgement.

If it is proven that some of the content of the project has been plagiarized and/or elaborated by a third person other than the student and/or generated by AI, it will be automatically suspended.

Single Assessment: This subject does not contemplate the single assessment system.

Use of AI. AI tools can be used as learning support tools (e.g. to improve writing, style, expository clarity, linguistic correctness/hearing for technical assistance). In no case may they replace and/or supplant the student's learning activity, or their acquisition of the specific knowledge of the subject.

It is not acceptable to use artificial intelligence tools to generate work content that is subject to evaluation. Evaluable tasks/activities suspected of having been generated by an AI instead of by the student will be considered as a copy and will be evaluated with a 0.

 

 


Bibliography

- Richard Szeliski, Computer Vision: Algorithms and Applications, 2nd Edition. Springer (Texts in computer Science) 2021. (http://szeliski.org/Book/)

- Ian Goodfellow and Yoshua Bengio and Aaron Courville, Deep Learning, MIT Press, 2016. (http://www.deeplearningbook.org)

- Adrian Kaehler, Gary Bradsky, Learning OpenCV 3: Computer Vision in C++ with the OpenCV Library, O'Reilly, 2016.

- Aurélien Géron, Hands-On Machine Learning with Scikit-Learn & TensorFlow, O'Reilly, 2017.

- Eli Stevens, Luca Antiga, Thomas Viehmann, Deep learning with Pytorch, Manning Publications, 2020 (https://pytorch.org/assets/deep-learning/Deep-Learning-with-PyTorch.pdf)

- François Chollet, Deep learning with Python, Manning Publications, 2021 (https://github.com/fchollet/deep-learning-with-python-notebooks)


Software

To develop different computer vision systems, both in practice and in problems sessions, the Python programming language will be used, working with Jupyter Notebooks.


Groups and Languages

Please note that this information is provisional until 30 November 2025. You can check it through this link. To consult the language you will need to enter the CODE of the subject.

Name Group Language Semester Turn
(PAUL) Classroom practices 711 English first semester afternoon
(PLAB) Practical laboratories 711 English first semester afternoon
(TE) Theory 71 English first semester afternoon