Code | Faculty | Department |
---|---|---|
02250196 | Faculty of Natural and Agricultural Sciences | Department: Statistics |
Credits | Duration | NQF level |
---|---|---|
Minimum duration of study: 1 year | Total credits: 180 | NQF level: 09 |
Details of compilation of curriculum are available from the Head of the Department of Statistics as well as from the departmental postgraduate brochure.
A candidate must compile his/her curriculum in consultation with the head of department or his representative. Refer to the Departmental website for further information.
Note: Additional modules may be required in order to reach the desired level of competency
The progress of all master's candidates is monitored biannually by the supervisor and the postgraduate coordinator. A candidate's study may be terminated if the progress is unsatisfactory or if the candidate is unable to finish his/her studies during the prescribed period.
Subject to exceptions approved by the Dean, on recommendation of the relevant head of department, and where applicable, a student may not enter for the master's examination in the same module more than twice.
Module content:
A compulsory bootcamp must be attended as part of this module – usually presented during the last week of January each year. Details regarding the venue and specific dates are made available by the department each year. The bootcamp will cover the basics of research to prepare students for the research component of their degree. Students can be exempt from the bootcamp if it was already attended in a previous year or for a previous degree. Each year of registration for the master’s degree will also require the attendance of three departmental seminars. Students should ensure that their attendance is recorded by the postgraduate co-ordinator present at the seminars. The department approves the seminars attended. Students are also required to present their mini-dissertation research proposal within the department or at a conference.
Module content:
Unsupervised learning: deterministic clustering, model-based clustering, latent class and behavioural analytics, dimension reduction. Natural language processing and topic modelling; recommender systems. Organisation of data, data wrangling and data structure exploration.
Module content:
This module covers the most recent literature that discusses current and contemporary research topics in advanced data analytics.
Module content:
Difference equations. Lag operators. Stationary ARMA processes. Maximum likelihood estimation. Spectral analysis. Vector processes. Non-stationary time series. Long-memory processes.
Module content:
Supervised learning and applications. Multicollinearity, ridge regression, the LASSO and the elastic net. Parametric and nonparametric logistic regression and nonlinear regression. Survival regression. Regression extensions: Random forests MARS and Conjoint analysis. Neural networks.
Module content:
Reviewing, from a statistical perspective, the cyber-infrastructure ecosystem including distributed computing, multi node and distributed file eco systems, such as Amazon Web Services. Structured and unstructured data sources, including social media data and image data. Setting up of large data structures for analysis. Algorithms and techniques for computing statistics and statistical models on distributed data. Software to be used include, Hadoop, Map reduce, SAS, SAS Data loader for Hadoop.
Copyright © University of Pretoria 2024. All rights reserved.
Get Social With Us
Download the UP Mobile App