Logo UAB
2022/2023

Scenarios and Uses of Big Data

Code: 104749 ECTS Credits: 6
Degree Type Year Semester
2503873 Interactive Communication OB 4 1

Contact

Name:
Laura Cervi
Email:
laura.cervi@uab.cat

Use of Languages

Principal working language:
catalan (cat)
Some groups entirely in English:
No
Some groups entirely in Catalan:
Yes
Some groups entirely in Spanish:
No

Teachers

Marta Portales Oliva

Prerequisites

1. Student should have passed "Introduction to Big Data"
2. Students must have a sufficient working knowledge of Catalan and Spanish, the languages in which the classes are taught, and have at least knowledge of English at the reading level (the language in which most of the teaching materials are found).

Objectives and Contextualisation

The goal of the subject is the study and practice of Big Data architectures (Environment Hadoop/MapReduce-Spark). Cloud solutions and Big Data usage scenarios (geographical intelligence, social analytics, open data paradigm)
 

Competences

  • Act with ethical responsibility and respect for fundamental rights and duties, diversity and democratic values.
  • Act within one's own area of knowledge, evaluating sex/gender-based inequalities.
  • Determine and plan the technological infrastructure necessary for the creation, storage, analysis and distribution of interactive multimedia and social-networking products.
  • Introduce changes in the methods and processes of the field of knowledge to provide innovative responses to the needs and demands of society.
  • Manage time efficiently and plan for short-, medium- and long-term tasks.
  • Promote and launch new products and services based on massive-scale mining and analysis of data from the Media.
  • Search for, select and rank any type of source and document that is useful for creating messages, academic papers, presentations, etc.
  • Students must be capable of applying their knowledge to their work or vocation in a professional way and they should have building arguments and problem resolution skills within their area of study.
  • Students must be capable of communicating information, ideas, problems and solutions to both specialised and non-specialised audiences.
  • Students must develop the necessary learning skills to undertake further training with a high degree of autonomy.
  • Take account of social, economic and environmental impacts when operating within one's own area of knowledge.

Learning Outcomes

  1. Analyse a situation and identify its points for improvement.
  2. Communicate using language that is not sexist or discriminatory.
  3. Critically analyse the principles, values and procedures that govern the exercise of the profession.
  4. Cross-check information to establish its veracity, using evaluation criteria.
  5. Describe the infrastructure needed to store big data.
  6. Differentiate between the various types of existing architectures for working with big data.
  7. Distinguish the salient features in all types of documents within the subject.
  8. Evaluate the impact of problems, prejudices and discrimination that could be included in actions and projects in the short or medium term in relation to certain people or groups.
  9. Explain the characteristics of the infrastructure needed to recover big data.
  10. Explain the explicit or implicit deontological code in your area of knowledge.
  11. Explain the infrastructure needed to process big data.
  12. Extract large volumes of data from social networks and the new digital media in particular.
  13. Identify situations in which a change or improvement is needed.
  14. Identify the social, economic and environmental implications of academic and professional activities within one's own area of knowledge.
  15. Plan and execute academic projects in the field of big data.
  16. Propose new methods or well-founded alternative solutions.
  17. Propose projects and actions that are in accordance with the principles of ethical responsibility and respect for fundamental rights and obligations, diversity and democratic values.
  18. Propose projects and actions that incorporate the gender perspective.
  19. Propose viable projects and actions to boost social, economic and environmental benefits.
  20. Share experiences with the group as a path to learning, in order to work subsequently in multidisciplinary groups.
  21. Solve basic problems in big data.
  22. Submit course assignments on time, showing the individual and/or group planning involved.
  23. Weigh up the risks and opportunities of both one's own and other people's proposals for improvement.

Content

Unit 1. Introduction to the scenarios and uses of Big Data
										
											
										
											Unit 2: Study and practice of Big Data architectures (Hadoop/MapReduce- Spark Environment).
										
											
										
											
										
											Unit 3. Cloud solutions and Big Data usage scenarios
										
											3.1 Geographic intelligence
										
											3.2 Social analytics
										
											3.3 Open data paradigm.
										
											
										
											The detailed program will be provided on the first day of class

Methodology

The methodology is based on  the following activities:
Theoretical sessions: Introduction theoretical sessions to concepts
Laboratory practices: individual or team works in which practical activities are carried out with one point deliverable with time limit. Students must apply knowledge, distribute time and prepare deliveries within the classroom and in the hours spent in practice under the teacher's guidance.
Seminars: individual or teamwork in which more extensive practical activities are carried out and with deliverables open to student creativity. There are no limited time in the classroom, but deadlines for delivery. Students must apply knowledge, distribute time and prepare submissions by starting their work within the classroom, but continue it in the form of activities supervised by the teaching team.
Course final work: practical group assessment exercise in which students must solve, during course development, a practical application problem linked to the subject's objectives. Students must raise the problem and perform the four processes to provide a solution proposal based on large amounts of data: search, extraction, analysis and publication of data report including a proposal for a decision based on the information collected and analysed.

Annotation: Within the schedule set by the centre or degree programme, 15 minutes of one class will be reserved for students to evaluate their lecturers and their courses or modules through questionnaires.

Activities

Title Hours ECTS Learning Outcomes
Type: Directed      
Lab practices 18 0.72 3, 1, 20, 2, 4, 5, 7, 6, 10, 11, 9, 12, 14, 13, 15, 23, 22, 16, 17, 18, 19, 21, 8
SEMINARS 15 0.6 1, 4, 5, 7, 6, 11, 12, 13, 8
THEORETICAL SESSIONS 15 0.6 3, 2, 5, 7, 11, 9, 8
Type: Supervised      
ASSESSMENT 8.5 0.34 20, 2, 4, 5, 7, 12, 15, 22, 17, 18, 21, 8
TUTORING 8.5 0.34 1, 20, 2, 14, 13, 15, 23, 22, 16, 17, 18, 19, 21, 8
Type: Autonomous      
OTHER ACTIVITIES (study time; practice preparation; seminar preparation, readings, etc.) 50 2 3, 1, 2, 4, 5, 7, 6, 10, 11, 9, 12, 14, 13, 15, 22, 17, 18, 21, 8

Assessment

Activity A. Examination on (individual) contents. 20% of the endnote is worth.
Activity B. Course work (group). 25% of the endnote is worth.
Activity C: Oral presentation of the work (group). Endnote 10% value
Activity D. Laboratory Practice It is 35% worth of the endnote.
Activity E. Student portfolio. Endnote 10% is worth.



To approve the subject, it is necessary to get a minimum approval note (5,0) in each of the activities.



MAKE-UP:

In the last three weeks of the course, students who have not exceeded the subject can submit to a re-evaluation synthesis test consisting of a theoretical test and a practical exercise. The compulsory condition for the assignment recovery is to have done at least 2/3 of the total course practices (A and D activities) and to have obtained an average grade equal to or greater than 3.5 (and less than 5) in all evaluation activities.

In accordance with the above criteria, if a student does not perform at least 66% of the practices of the evaluation activities, it will be regarded as non-evaluationable.

PLAGIARISM:
In the event that the student performs any irregularity that may lead to a significant variation of an evaluation act, this evaluation act will be graded with 0, regardless of the disciplinary process that could be instructed. In the event, that several irregularities occur in the evaluation acts of the same subject, the final grade for this subject will be 0. 

Assessment Activities

Title Weighting Hours ECTS Learning Outcomes
COURSE PROJECT 25 10 0.4 1, 20, 2, 12, 13, 15, 23, 22, 16, 17, 18, 19, 21, 8
EXAM 20 2 0.08 3, 2, 5, 7, 6, 10, 11, 9, 12, 14, 8
ORAL PRESENTATION 10 2 0.08 1, 20, 2, 5, 6, 11, 9, 12, 13, 15, 22, 16, 17, 18, 19, 21, 8
PRACTICAL SESSIONS 35 13 0.52 3, 1, 20, 2, 4, 5, 7, 6, 10, 11, 9, 12, 14, 13, 15, 23, 22, 16, 17, 18, 19, 21, 8
STUDENT'S PORTFOLIO 10 8 0.32 3, 1, 20, 2, 4, 5, 7, 6, 10, 11, 9, 12, 14, 13, 15, 23, 22, 16, 17, 18, 19, 21, 8

Bibliography

Basic References

 

Casas Roma, Jordi (2019) Big data: análisis de datos en entornos masivos. Barcelona: Editorial UOC.

Duran, Xavier (2019). El imperio de los datos: el big data, la privacidad y la sociedad del futuro. PUV Publicacions, Universitat de València: Càtedra de Divulgació de la Ciència, UCC+i, Unitat de Cultura Científica i de la Innovació, Universitat de València.

Dur Lahoz-Beltrá, Rafael (2019). En las entrañas del big data: una aproximación a la estadística. Emse Edapp, S.L.

Fuller A. (2012). The White Book of Big Data. The definitive guide to the revolution in business analytics. Fujitsu. https://www.fujitsu.com/rs/Images/WhiteBookofBigData.pdf  

 

Other references

 

Tascón, Mario. "Introducción: Big data. Pasado, presente y futuro" Telos: Cuadernos de comunicación e innovación 95 (2013): 47-50. https://telos.fundaciontelefonica.com/archivo/numero095/#contenido 

Mohamed, Azlinah, et al. "The state of the art and taxonomy of big data analytics: view from new big data framework" Artificial Intelligence Review 53.2 (2020): 989-1037.  

Gandomi, Amir, and Murtaza Haider. "Beyond the hype: Big data concepts, methods, and analytics" International journal of information management 35.2 (2015): 137-144. 

Aldana Montes, José Francisco (2018). Introducción al trabajo con datos. Madrid: García-Maroto Editores 

Lucivero, Federica. "Big data, big waste? A reflection on the environmental sustainability of big data initiatives." Science and engineering ethics 26.2 (2020): 1009-1030.

 

Software

This is mostly a practical vourse,thus the required software is the usual one for the performance of capture, processing and analysis tasks in different formats.

In particular, the following tools are required:

Text editing software: Word or similar

Data analysis software: Illustrator, Flourish, Tableau, Infogram, Datawrapper