/Data Management and Analysis
Provided by: Open University
Course Area: All areas
Course Code: TM351
Course Type: Other undergraduate
Start date: 20211002
End date: 20220630
Subjects: Data Analytics, Data Science, Information Visualisation
Who is this module for
This module is an Open University Stage 3 (final year undergraduate) module. OU stage 3 modules build on study skills and subject knowledge acquired from previous studies at stages 1 and 2. They are only intended for students with recent experience of higher education in a related subject.
What will I learn
Some of the key concepts that are covered in this module include:
Introducing data analysis
Starting with a data file such as a spreadsheet, this unit will provide you with a brief introduction to some basic operations on simple data files. This will give you an opportunity to study an outline of the key ideas in the module and help you become familiar with the module software.
Concepts in data management
You will look at three key areas in data management: data architectures and data access (CRUD), data integrity, and transaction management (ACID). Each of these topics will be illustrated using a relational database, and one non-relational alternative. The advantages and limitations of each model are discussed.
Legal and ethical issues
Here you will consider the legal and ethical issues involved in managing data collections. You will be required to obtain and read (parts of) the Data Protection Act and the Freedom of Information Act, and demonstrate how these apply to issues in data management. You will also consider privacy, ownership, intellectual property and licensing issues in data collection, management, retrieval and reuse.
Concepts in data analytics
These sections will focus on using data to answer a real question; the focus will be on exploratory techniques (such as visualisation) and formulating a question into a form that can be answered realistically using the data that is available. Issues in processing techniques for large and real-time streamed data collections will also be addressed along with techniques and technologies (such as MapReduce) for handling them. In this part of the module you will use a statistical package such as the python scientific libraries and/or ggplot2 to visualise the data and carry out appropriate analyses.