It's UWAweek 47

unitinfo

This page provides helpful information about many coursework units offered by Computer Science and Software Engineering in 2023.

The information here is not official - for official information please see the current UWA Handbook. Instead, it will help students to prepare for their future units, before the beginning of each semester, and before they have access to UWA's Learning Management System (LMS).

About the unit CITS4009 Computational Data Analysis (2nd semester 2023)

Unit description:

This unit answers an urgent call to harness the unprecedented amount of data now generated from every facet of our daily life by introducing data science as the discipline dealing with collecting, representing, manipulating and visualising data in contemporary society. Students taking the unit learn to write computer programs to extract, transform and integrate data from multiple heterogeneous sources, including traditional relational databases and web-based resources. Different data representation formats such as XML, JSON and HDF5, as well as storage options including SQL and NOSQL type of databases, are introduced and compared. Another core objective is the development of programming skills to enable effective and meaningful visualisation of the data. Students are given the opportunity to put the learned knowledge in data acquisition, data processing, data representation and exploratory visualisation into practice through projects that are highly relevant to real-world data analytics. The unit provides the fundamental knowledge, introduces the essential processes for exploratory data analysis and builds the specific critical programming skills required during the journey of growing a student into a capable data scientist.

Unit outcomes:

Students are able to (1) write programs to systematically collect, process and integrate data of different types and from different sources.; (2) select appropriate data visualisation options; (3) demonstrate programming abilities to build solutions for exploratory data analysis using visualisation and clustering techniques; (4) critically assess the outcomes of a data analysis; and (5) communicate effectively with stakeholders.

Unit coordinator:

Dr Du Huynh
[email protected]

Unit homepage:

Unit is offered in these majors and courses:

Indicative weekly topics:

week 1 Data Science Primer
week 2 Basic R and Visualization
week 3 Exploratory Data Analysis - Single Variables
week 4 Exploratory Data Analysis - Two or More Variables
week 5 Data Cleaning
week 6 Data Transformation
week 7 Mid-semester Test (online, open-book, multiple choice questions)
week 8 Classification Model Evaluation and Single Variable Models
week 9 Single Variable Models
week 10 Multi-Variable Models (Decision Trees, kNN)
week 11 Unsupervised Methods - Hierarchical Clustering and k-means Clustering
week 12 Linear and Logistic Regression

Indicative assessment:

Mid-semester test, 2 programming projects, and final exam

Useful prior experience and background knowledge:

Some programming background (Python, Java, or C)

Useful prior programming and software experience:

R and RStudio

Hardware required for this unit:

Students are able to undertake their laboratory exercises and projects in laboratories in the CSSE building, but most students also complete work on their own laptops.
The following hardware is required to successfully complete this unit:


Standard laptop

Operating system(s) used in this unit:

Different units will use different operating systems for their teaching - for in-class examples, laboratory exercises, and programming projects.
If an operating system is REQUIRED, it will be used when marking assessments.


ANY reasonable platform




This information last updated 6:48pm Thu 20th Apr 2023

The University of Western Australia

Computer Science and Software Engineering

CRICOS Code: 00126G
Written by [email protected]
Powered by history
Feedback always welcome - it makes our software better!
Last modified  8:32AM Jul 16 2023
Privacy policy