Module Database Search



MODULE DESCRIPTOR
Module Title
Big Data Systems
Reference CM3153 Version 1
Created December 2023 SCQF Level SCQF 9
Approved April 2024 SCQF Points 15
Amended ECTS Points 7.5

Aims of Module
This modules aims to introduce students to state-of-the-art tools and analytics techniques for Big Data tasks, including NoSQL data stores, and modern parallel computation methodologies.

Learning Outcomes for Module
On completion of this module, students are expected to be able to:
1 Discuss different components of the Big Data ecosystem.
2 Manipulate different types of NoSQL data stores.
3 Assess the suitability of a given data store for a given problem.
4 Demonstrate the use of a parallel computation framework to extract information from Big Data.
5 Assemble a scalable data solution using a Big Data computation framework.

Indicative Module Content
Big Data ecosystems. Three V's - volume, velocity, veracity. Key-value store, document store, graph database, relational database. Schema migration. Apache Hadoop, MapReduce, Spark. Issues in Big Data ethics, law and security.

Module Delivery
This module is based on lectures supplemented with laboratory sessions, where appropriate software is applied to varied learning tasks and practical exercises.

Indicative Student Workload Full Time Part Time
Contact Hours 30 N/A
Non-Contact Hours 120 N/A
Placement/Work-Based Learning Experience [Notional] Hours N/A N/A
TOTAL 150 N/A
Actual Placement hours for professional, statutory or regulatory body    

ASSESSMENT PLAN
If a major/minor model is used and box is ticked, % weightings below are indicative only.
Component 1
Type: Coursework Weighting: 100% Outcomes Assessed: 1, 2, 3, 4, 5
Description: This coursework will consist of a big data development exercise.

MODULE PERFORMANCE DESCRIPTOR
Explanatory Text
The calculation of the overall grade for this module is based on 100% weighting of component 1. An overall minimum grade D is required to pass the module.
Module Grade Minimum Requirements to achieve Module Grade:
A The student must achieve an A in C1.
B The student must achieve a B in C1.
C The student must achieve a C in C1.
D The student must achieve a D in C1.
E The student must achieve an E in C1.
F The student must achieve an F in C1.
NS Non-submission of work by published deadline or non-attendance for examination

Module Requirements
Prerequisites for Module None.
Corequisites for module None.
Precluded Modules None.

INDICATIVE BIBLIOGRAPHY
1 HARRISON, G., 2015. Next Generation Databases: NoSQL, NewSQL, and Big Data. Apress.
2 LESKOVEC, J., ANAND, R. and ULLMAN, J.D., 2015. Mining of massive datasets. Cambridge University Press.
3 BERMAN, J., 2018. Principles and practice of big data: preparing, sharing, and analyzing complex information. 2nd ed. London: Academic Press.
4 MISHRA, R., 2018. PySpark recipes: a problem-solution approach with PySpark2. United States: Apress.
5 WIKTORSKI, T., 2019. Data-intensive systems principles and fundamentals using Hadoop and Spark. Cham: Springer.


Robert Gordon University, Garthdee House, Aberdeen, AB10 7QB, Scotland, UK: a Scottish charity, registration No. SC013781