Data Visualisation and Predictive Modelling Using R
Overview
We are not currently accepting applications for this course.
This new short course is part of an Office for Students (OfS) funded pilot project to explore flexible courses that expand digital skillsets in data science and analysis.
There are two components to this course. Both require previous knowledge of R, and the second component will require a significant statistical background which the student is either expected to possess already or be capable of acquiring independently through the strength of prior studies or aptitude.
The course “Introduction to statistics and data science with R” is a suitable prerequisite.
Component 1 - Visualisation in R
The objective of this component is to identify patterns and display information from data from several sources; to encourage statistical thinking by a series of examples of good and not-so-good visualisations and guide students in their development of creativity within a scientific framework.
This component will highlight how visualisation plays a key role in many disciplines.
Component 2 - Predictive modelling
This component will introduce the principles for the application of linear modelling and machine learning methodologies for the analysis of experimental and observational data. R is used throughout.
The first strand of this component will study regression models, variable selection and ANOVA. The second strand will introduce several machine learning techniques. The third strand of the module will study techniques for dealing with missing data.
Learning outcomes
By the end of the course, you will be able to demonstrate the following:
- Summarise and understand information on categorical and continuous variables
- Explore relationships between different variables
- Display graphical information and complex relationships in datasets using R
- Use advanced statistical packages like ggplot2 and produce statistical reports with Rmarkdown
- Create interactive plots.
- Use the module statistical methods and techniques to review, consolidate, extend and apply knowledge.
- Critically evaluate the performance of the Data analysis methods studied.
- Familiarity with a general linear model using real data;
- Familiarity with assessing fitted models and validating linear model assumptions;
- Ability to identify and conduct simple designed experiments;
- Familiarity with the construction of factorial experiments in blocks;
- Ability to employ and assess the results of discriminant analysis, multiple logistic regression, principal component, clustering and multivariate analysis of variance with real observational data.
- Communicate via data visualisation and modelling results to both specialist and non-specialist audiences.
Entry Requirements
Please note that we are not currently accepting applications for this course.
Our short courses are designed to be accessible to all. There are no specific entry requirements you need to meet, and you do not need a background in mathematics or computer science to be eligible for this course.
You may find the course Introduction to statistics and data science with R helpful, but it is not a requirement for this course.
Structure
Module Outline
The purpose of this module is to introduce:
- Predictive modelling using R
- The use of R for data visualisation.
The module will be delivered over a 13-week period. This will consist of 10 hours of online lectures for each of the two components, totalling 20 hours, which will be broken into bite size chunks.
In addition, there will be 10 hours of face-to-face labs on campus for each component, totalling 20 hours of face-to-face labs.
Participants will also need to carry out independent work such as practicing skills and reading in advance of lectures and labs.
Teaching schedule
Monday 2 January 2023
10 – 1pm and 2 – 4pm – teaching over zoom
Tuesday 3 January 2023
10 – 12pm and 1 – 4pm – IT Labs on campus
Monday 9 January 2023
10 – 1pm and 2 – 4pm – teaching over zoom
Tuesday 10 January 2023
10 – 12pm and 1 – 4pm – IT Labs on campus
Thursday 12 January 2023
10 – 1pm – teaching over zoom
Friday 13 January 2023
10 – 12pm – IT labs on campus
Monday 27 March 2023
10 – 12pm and 1 – 4pm – teaching over zoom
Tuesday 28 March 2023
10 – 1pm and 2 – 4pm – IT labs on campus
Thursday 30 March 2023
10 – 12pm – teaching over zoom
Friday 31 March 2023
10 – 1pm – IT labs on campus
Please note dates and times may be subject to change.
Assessment strategy
Assessment will consist of a report of no more than 10 pages, and a related Rmarkdown.
In addition, participants will deliver a short presentation to describe their findings.
Fees and funding
The course costs £2,310. This includes all lectures, labs and assessment costs, and access to university facilities such as the library, the Silberrad Student Centre and student support services.
Funding for this course can be applied for through Student Finance Higher Education Short Course Loans.
What's next
All applications for this course can be made online.
For further help or information, please contact ofs-shortcourses@essex.ac.uk