This version of the course guide is provisional until the period for editing the new course guides ends.

Logo UAB

3D Vision

Code: 44775 ECTS Credits: 6
2024/2025
Degree Type Year
4318299 Computer Vision OB 0

Contact

Name:
Maria Isabel Vanrell Martorell
Email:
maria.vanrell@uab.cat

Teachers

Josep Ramon Casas Pla
Javier Ruiz Hidalgo
Gloria Haro Ortega
Antonio Agudo Martínez
Marc Pérez Quintana
Federico Sukno

Teaching groups languages

You can view this information at the end of this document.


Prerequisites

Degree in Engineering, Maths, Physics or similar


Objectives and Contextualisation

Module Coordinator: Dr. Gloria Haro

The goal of this module is to learn the principles of the 3D reconstruction of an object or a scene from multiple images or stereoscopic videos. For that, the basic concepts of the projective geometry and the 3D space are firstly introduced. The rest of the theoretical aspects and applications are built upon these basic tools. The mapping from the 3D world to the image plane will be studied, for that we will introduce different camera models, their parameters and how to estimate them (camera calibration and auto-calibration). The geometry that relates a pair of views will be analyzed. All these concepts will be applied to obtain a 3D reconstruction in the two main possible settings: calibrated or uncalibrated cameras. In particular, we will learn how to: estimate the depth of image points, extract the underlying 3D points given a set of point correspondences in the images, generate novel views, estimate the 3D object given a set of calibrated color images or binary images, and estimate a sparse set of 3D points given a set of uncalibrated images. The 3D representation in voxels and meshes will be studied. We will explain the reconstruction and modeling from Kinect data, as a particular model of sensors that provide an image of the scene together with its depths. Finally, we will see some techniques for processing 3D point clouds. The concepts and techniques learnt in this module are used in real applications ranging from augmented reality, object scanning, motion capture, new view synthesis, bullet-time effect, robotics, etc.


Learning Outcomes

  1. CA01 (Competence) Integrate the formulation of all the components of a complete system of 3D information recovery and synthesis.
  2. CA06 (Competence) Achieve the objectives of a project of vision carried out in a team.
  3. KA04 (Knowledge) Identify the basic problems to be solved in a case of 3D information recovery from a scene.
  4. KA12 (Knowledge) Provide the best geometric information needed to model all parts of a problem of recovering 3D information from a scene.
  5. SA04 (Skill) Solve a problem of 3D information recovery and evaluate the results.
  6. SA10 (Skill) Define the best data sets for training 3D vision architectures.
  7. SA15 (Skill) Prepare a report that describes, justifies and illustrates the development of a project of vision.
  8. SA17 (Skill) Prepare oral presentations that allow debate of the results of a project of vision.

Content

  1. Introduction and applications.
  2. 2D projective geometry. Planar transformations. 
  3. Homography estimation. Affine and metric rectification
  4. 3D projective geometry and transformations. Camera models
  5. Camera calibration. Pose estimation.
  6. Epipolar geometry. Fundamental matrix. Essential matrix. Extraction of camera matrices.
  7. Computation of the fundamental matrix. Image rectification
  8. Triangulation methods. Depth computation. New View Synthesis
  9. Multi-view stereo. Structure from motion.
  10. Auto-calibration. Bundle adjustment.
  11. 3D sensors (kinect).
  12. Point cloud processing.

Activities and Methodology

Title Hours ECTS Learning Outcomes
Type: Directed      
Lecture sessions 20 0.8 CA01, CA06, KA04, KA12, SA04, SA10, SA15, SA17
Type: Supervised      
Project follow-up sessions 8 0.32 CA01, CA06, KA04, KA12, SA04, SA10, SA15, SA17
Type: Autonomous      
Homework 113 4.52 CA01, CA06, KA04, KA12, SA04, SA10, SA15, SA17

Supervised sessions: (Some of these sessions could be synchronous on-line sessions)

  • Lecture Sessions, where the lecturers will explain general contents about the topics. Some of them will be used to solve the problems.

Directed sessions: 

  • Project  Sessions, where the problems and goals of the projects will be presented and discussed, students will interact with the project coordinator about problems and ideas on solving the project (approx. 1 hour/week)
  • Presentation Session, where the students give an oral presentation about how they have solved the project and a demo of the results.
  • Exam Session, where the students are evaluated individually. Knowledge achievements and problem-solving skills

Autonomous work:

  • Student will autonomously study and work with the materials derived from the lectures.
  • Student will work in groups to solve the problems of the projects with deliverables:
    • Code
    • Reports
    • Oral presentations

Annotation: Within the schedule set by the centre or degree programme, 15 minutes of one class will be reserved for students to evaluate their lecturers and their courses or modules through questionnaires.


Assessment

Continous Assessment Activities

Title Weighting Hours ECTS Learning Outcomes
Exam 0.4 2.5 0.1 CA01, CA06, KA04, KA12, SA04, SA10, SA15, SA17
Project 0.55 6 0.24 CA01, CA06, KA04, KA12, SA04, SA10, SA15, SA17
Session attendance 0.05 0.5 0.02 CA01, CA06, KA04, KA12, SA04, SA10, SA15, SA17

The final mark for this module will be computed with the following formula

Final Mark = 0.4 x Exam + 0.55 x Project+ 0.05 x Attendance

where,

Exam:  is the mark obtained in the Module Exam (must be >= 3).

Attendance: is the mark derived from the control of attendance at lectures (minimum 70%)

Projects:  is the mark provided by the project coordinator based on the weekly follow-up of the project and deliverables (must be >= 5). All accordingly with specific criteria such as:

  • Participation in discussion sessions and in team work (inter-member evaluations)
  • Delivery of mandatory and optional exercises.
  • Code development (style, comments, etc.)
  • Report (justification of the decisions in your project development)
  • Presentation and exam (Talk, explanations and demonstrations on your project)

Only those students that fail (Final Mark < 5.0) can do a retake exam.


Bibliography

Books: 

  1. O. Faugeras, Three-dimensional computer vision: a geometric viewpoint, MIT Press, cop. 1993.
  2. O. Faugeras, Q.T. Loung, The geometry of multiple images, MIT Press, 2001.
  3. D. A. Forsyth, J. Ponce, Computer vision: a modern approach, Prentice Hall, 2003.
  4. R. I. Hartley, A. Zisserman, Multiple view geometry in computer vision, Cambridge University Press, 2000.
  5. R. Szeliski, Computer Vision: Algorithms and Applications, Springer, 2011.  

Tutorials:

  1. Y. Furukawa and C. Hernández, Multi-View Stereo: A Tutorial, Foundations and Trends® in Computer Graphics and Vision, vol. 9, no. 1-2, pp.1-148, 2013.
  2. T. Moons, L. Van Gool, M. Vergauwen, 3D Reconstruction from Multiple Images Part 1, Principles, Foundations and Trends® in Computer Graphics and Vision, vol. 4: no. 4, pp 287-404, 2010. 

Software

Python Programming Tools with special attention to computer vision and image processing libraries

 


Language list

Name Group Language Semester Turn
(PLABm) Practical laboratories (master) 1 English first semester morning-mixed
(TEm) Theory (master) 1 English first semester morning-mixed