Faculties and departments | Blackbuck Graduate Program

Course Curriculum

DSC01	Mathematics for Data Scientists	Distribution of random variables, conditional probability and independence, distributions of functions of random variables, limiting distributions. Differentiation and integration of functions; basic matrix operations; linearization; linear and nonlinear optimization techniques; clustering and similarity measures, basic computational algorithms. Includes frequent illustration of concepts using mathematical computation tools.	Term 1 45 Hours
DSC02	Statistical Programming	Programming structures and algorithms for large-scale statistical data processing and visualization. Students will use commonly available data analysis software packages to apply concepts and skills to large data sets and will also develop their own code using an object¬ oriented programming language.	Term 1 60 Hours
DSC03	Data Science Fundamentals	This module teaches students about how data science is performed within academic and industry (via invited talks), research methods and how different research strategies are applied across different disciplines, and data science techniques for processing and analysing data.	Term1 20 Hours
DSC04	Introduction to Data Mining and Analytics	Overview of the field of data mining and analytics; large-scale file systems and Map-Reduce, measures of similarity, link analysis, frequent item sets, clustering, e-advertising as an application, recommendation systems. Data Mining course has a special focus on statistical model building for Data Mining for Marketing, Sales and Finance	Term1 60 Hours
DSC05	Applied Data Mining	This module will provide students with up-to-date information on current applications of data in both industry and research. The module will build on Fundamentals of Data by explaining how data is processed and applied at large-scale across a variety of different areas.	Term1 20 Hours
DSC06	Large-Scale Data Storage Systems	The design and operation of large-scale, cloud-based systems for storing data. Topics include operating system virtualization, distributed network storage; distributed computing, cloud models (IAAS, PAAS, and SAAS), and techniques for securing cloud and virtual systems.	Term1 30 Hours
DSC07	Programming for Data Science	This module aims to provide students with the necessary programming skills to statistically process and explore disparate datasets using R, to become confident in using this language to create and analyse variables in order to discover patterns and relationships through the use of visualisation, testing and modelling. It also aims to provide students with experience in using object-oriented programming concepts and principles to read in data from both local files and databases so that it can be merged together, using record-reconciliation techniques, and then output this into a single file for processing; this will be taught using the object-oriented programming language Java. The teaching of both Java and R is essential here as the former is well-suited to handling data, via the creation of bespoke data objects, while the latter is good for statistically assessing data.	Term2 120 Hours
DSC08	Forecasting	The module introduces time series and causal forecasting methods so that passing students will be able to prepare methodologically competent, understandable and concisely presented reports for clients. By the end of the course, students should be able to model causal and time series models, assess their accuracy and robustness and apply them in a real world problem domain.	Term2 60 Hours
DSC09	Data Visualization	The theory and practice of visualizing large, complicated data sets to clarify areas of emphasis. Human factors best practices will be presented. Programming with advanced visualization frameworks and practices will be demonstrated and used in group programming projects.	Term2 60 Hours
DSC10	Project	o complete the MS degree each student must undertake a project worth 60 credits. This is a project chosen by you to investigate a challenging but constrained Data Science problem. The project will integrate the subject knowledge and generic skills that you will acquire during your Masters. We offer a wide range of projects, and each student is normally allocated a different project. We take student preferences and capabilities into account when we allocate the projects. The student will also have the opportunity to propose his / her own project, subject to academic approval.	Term3&4 500 Hours

Advanced Analytics and Data Science

Course Curriculum