Durham University
Programme and Module Handbook

Postgraduate Programme and Module Handbook 2024-2025

Module MATH42515: Data Exploration, Visualization, and Unsupervised Learning

Department: Mathematical Sciences

MATH42515: Data Exploration, Visualization, and Unsupervised Learning

Type Tied Level 4 Credits 15 Availability Available in 2024/2025 Module Cap None.
Tied to G5K823
Tied to G5K923
Tied to G5P223
Tied to G5P123
Tied to G5P323
Tied to G5P423

Prerequisites

  • None

Corequisites

  • None

Excluded Combination of Modules

  • None

Aims

  • To introduce the concepts and methods of exploratory data analysis, data visualization, and unsupervised learning.

Content

  • Advanced exploratory data analysis.
  • Density estimation and data visualization.
  • Unsupervised learning and clustering.
  • Principal component analysis (PCA) and dimension reduction.
  • Data visualization and statistical computing with R.
  • Methods for non-numerical data: e.g. categorical, spatial and temporal, text, images, networks, graphs.
  • Further topics: e.g. anomaly detection, treatment of missing values, association rules.

Learning Outcomes

Subject-specific Knowledge:
  • By the end of the module students will:
  • Have a systematic and coherent understanding of the theory, computation, and application of the topics studied;
  • Have mastered advanced exploratory data analysis, data visualization, and statistical computing with R;
  • Have acquired a coherent body of applicable knowledge on density estimation, unsupervised learning, clustering, PCA and dimension reduction.
Subject-specific Skills:
  • In addition, students will have acquired:
  • The ability to use statistical software R to conduct synthesis of data, data analysis, and date visualization;
  • Programming skills generally used in advanced methods such as clustering and unsupervised learning;
  • The ability to identify and apply appropriate unsupervised learning methods to modern real-world problems.
Key Skills:
  • Sufficient mastery of advanced data analysis, data visualization and unsupervised learning methods and ability to apply them appropriately to real-world applications.
  • The ability to clearly communicate statistical methods and relevant conclusions through writing.
  • The ability to organise prioritise, and manage time effectively.
  • The ability to advance and extend their knowledge through significant independent learning and research.

Modes of Teaching, Learning and Assessment and how these contribute to the learning outcomes of the module

  • This module will be delivered by the Department of Mathematical Sciences.
  • Teaching will be delivered primarily by workshops, lectures, and practicals.
  • Lectures demonstrate what is required to be learned and the application of the theory to practical examples.
  • Workshops describe theory and its application to concrete examples, enable students to test and develop their understanding of the material by applying it to practical problems, and provide feedback and encourage active engagement.
  • Workshops are a combination of taught material, practicals, problem classes, tutorials and guided group work.
  • Practicals consolidate the studied material, explore theoretical ideas in practice, enhance practical understanding, and develop practical data analysis skills.
  • Lectures, workshops, and computer practicas will be supported by the distribution of materials such as video content, directed reading, e-assessments, reflective activities, opportunities for self-assessment, and peer-to-peer learning within a tutor-facilitated discussion board.
  • Students will be able to obtain further help in their studies via scheduled office hours as well as by approaching their lecturers by email.
  • Students will be expected to work in between workshops and lectures, and to discuss their own work during the workshops. This work will be guided by the module leader, but will be organised by the students themselves, thereby enabling them to demonstrate their time management skills.
  • Students will undertake independent research to further their knowledge of the topic and self-directed learning to further their technical and transferable skills.
  • The workshops also provide opportunities for module leaders to monitor progress and to provide feedback and guidance on the development of ideas for the project, and for students to gauge their progress throughout the duration of the module.
  • Student performance will be assessed through two individual assignments.
  • The assignments will provide the means for students to demonstrate their acquisition of subject knowledge and the development of their problem-solving skills.

Teaching Methods and Learning Hours

Activity Number Frequency Duration Total/Hours
Lectures 16 2 times per week (Term 2, weeks 1-4, 6-9) 1 hour 16
Workshops 8 1 time per week (Term 2, weeks 1-4, 6-9) 1 hour 8
Practicals 8 1 time per week (Term 2, weeks 1-4, 6-9) 1 hour 8
Preparation, exercises, and reading 118
Total 150

Summative Assessment

Component: Assignment Component Weighting: 100%
Element Length / duration Element Weighting Resit Opportunity
Assignment 1 30% Yes
Assignment 2 70% Yes

Formative Assessment:

Workshop discussion of students' ideas and experiences; informal discussions of student progress with module leader when necessary.


Attendance at all activities marked with this symbol will be monitored. Students who fail to attend these activities, or to complete the summative or formative assessment specified above, will be subject to the procedures defined in the University's General Regulation V, and may be required to leave the University