Foundations of Data Science II
Overview
- Credit value: 15 credits at Level 5
- Convenor: Dr Paul Yoo
- Prerequisite: Foundations of Data Science I
- Assessment: an online quiz (50%) and two-hour examination (50%)
Module description
Continuing directly from Foundations of Data Science I, this module covers further fundamental aspects of data science and analytics. You will consolidate the knowledge acquired on FDS I and further develop your mathematical knowledge and skills, including basic elements of calculus, linear algebra, continuous probability theory and statistics.
We will show you how to use the popular and powerful language Python to solve computational tasks from these mathematical subjects, and acquaint you with popular Python libraries and packages for programming to solve problems arising from calculus, probability theory and statistics.
Indicative syllabus
- Differentiation
- Indefinite and definite integration
- Solving systems of polynomial equations (e.g. Newton’s methods) and basic optimisation algorithm (e.g. gradient descent)
- Continuous probability (e.g. random variables, pdf, cdf, expectation, variance and correlation)
- Common distribution families (e.g. Poisson, normal distribution)
- Probabilistic inequalities and concentration (e.g. LNT, CLT)
- Statistical testing (e.g. hypothesis testing, chi-squared testing)
- Sampling and confidence intervals
- Eigenvalues and eigenvectors
- SVD decomposition
- Tools: Python
Learning objectives
By the end of this module, you will have:
- knowledge of basic calculus, further linear algebra and matrix theory, continuous probability theory and statistics, and relevant Python libraries and packages
- satisfactory skills of programming in Python to solve computational tasks from calculus, linear algebra, continuous probability theory and statistics
- an understanding of the link between the basic knowledge acquired from the module and data science/analytics applications.