Module Database Search



MODULE DESCRIPTOR
Module Title
Big Data Systems
Reference CM3708 Version 2
Created January 2023 SCQF Level SCQF 9
Approved May 2019 SCQF Points 30
Amended June 2023 ECTS Points 15

Aims of Module
To introduce students to the use of state-of-the-art Big Data analytics techniques and tools, including NoSQL data stores, and modern parallel computation methodologies.

Learning Outcomes for Module
On completion of this module, students are expected to be able to:
1 Compare and contrast different types of NoSQL data stores.
2 Critically analyse the suitability of a NoSQL data store for a given problem.
3 Extract actionable knowledge from big data, using the parallel computation framework.
4 Design, implement and evaluate scalable program solutions using a big data computation framework.

Indicative Module Content
NoSQL data stores (e.g., key-value, document, and graph). Case studies of NoSQL data stores. Properties of NoSQL data stores. Schema migration in NoSQL data stores. Modern parallel data processing techniques, e.g. MapReduce/Hadoop, Spark. Case studies on using parallel data processing for analysis and mining of Big Data.

Module Delivery
The module is delivered in Blended Learning mode using structured online learning materials/activities and directed study, facilitated by regular online tutor support. Workplace Mentor support and work-based learning activities will allow students to contextualise this learning to their own workplace. Face-to-face engagement occurs through annual induction sessions, employer work-site visits, and modular on-campus workshops.

Indicative Student Workload Full Time Part Time
Contact Hours 30 N/A
Non-Contact Hours 30 N/A
Placement/Work-Based Learning Experience [Notional] Hours 240 N/A
TOTAL 300 N/A
Actual Placement hours for professional, statutory or regulatory body 240  

ASSESSMENT PLAN
If a major/minor model is used and box is ticked, % weightings below are indicative only.
Component 1
Type: Coursework Weighting: 100% Outcomes Assessed: 1, 2, 3, 4
Description: This coursework will consist of a big data development exercise and analysis.

MODULE PERFORMANCE DESCRIPTOR
Explanatory Text
The calculation of the overall grade for this module is based on 100% weighting of C1. An overall minimum grade D is required to pass the module.
Module Grade Minimum Requirements to achieve Module Grade:
A The student must achieve an A in C1.
B The student must achieve a B in C1.
C The student must achieve a C in C1.
D The student must achieve a D in C1.
E The student must achieve an E in C1.
F The student must achieve an F in C1.
NS Non-submission of work by published deadline or non-attendance for examination

Module Requirements
Prerequisites for Module None, in addition to course entry requirements.
Corequisites for module None.
Precluded Modules None.

INDICATIVE BIBLIOGRAPHY
1 HARRISON, G., 2015. Next Generation Databases: NoSQL, NewSQL, and Big Data. Apress.
2 LESKOVEC, J., ANAND, R. and ULLMAN, J.D., 2015. Mining of massive datasets. Cambridge University Press.
3 BERMAN, J., 2018. Principles and practice of big data: preparing, sharing, and analyzing complex information. 2nd ed. London: Academic Press.
4 MISHRA, R., 2018. PySpark recipes: a problem-solution approach with PySpark2. United States: Apress.
5 WIKTORSKI, T., 2019. Data-intensive systems principles and fundamentals using Hadoop and Spark. Cham: Springer.


Robert Gordon University, Garthdee House, Aberdeen, AB10 7QB, Scotland, UK: a Scottish charity, registration No. SC013781