It's UWAweek 47
|
unitinfo
This page provides helpful information about many coursework units offered by
Computer Science and Software Engineering
in 2023.
The information here is not official -
for official information please see the
current UWA Handbook.
Instead, it will help students to prepare for their future units,
before the beginning of each semester,
and before they have access to
UWA's
Learning Management System (LMS).
|
About the unit CITS4009 Computational Data Analysis (2nd semester 2023)
Unit description:
This unit answers an urgent call to harness the unprecedented amount of data now generated from every facet of our daily life by introducing data science as the discipline dealing with collecting, representing, manipulating and visualising data in contemporary society. Students taking the unit learn to write computer programs to extract, transform and integrate data from multiple heterogeneous sources, including traditional relational databases and web-based resources. Different data representation formats such as XML, JSON and HDF5, as well as storage options including SQL and NOSQL type of databases, are introduced and compared. Another core objective is the development of programming skills to enable effective and meaningful visualisation of the data. Students are given the opportunity to put the learned knowledge in data acquisition, data processing, data representation and exploratory visualisation into practice through projects that are highly relevant to real-world data analytics. The unit provides the fundamental knowledge, introduces the essential processes for exploratory data analysis and builds the specific critical programming skills required during the journey of growing a student into a capable data scientist.
Unit outcomes:
Students are able to (1) write programs to systematically collect, process and integrate data of different types and from different sources.; (2) select appropriate data visualisation options; (3) demonstrate programming abilities to build solutions for exploratory data analysis using visualisation and clustering techniques; (4) critically assess the outcomes of a data analysis; and (5) communicate effectively with stakeholders.
Unit coordinator:
Unit homepage:
|
|
Unit is offered in these majors and courses:
Indicative weekly topics:
week 1 |
Data Science Primer |
week 2 |
Basic R and Visualization |
week 3 |
Exploratory Data Analysis - Single Variables |
week 4 |
Exploratory Data Analysis - Two or More Variables |
week 5 |
Data Cleaning |
week 6 |
Data Transformation |
week 7 |
Mid-semester Test (online, open-book, multiple choice questions) |
week 8 |
Classification Model Evaluation and Single Variable Models |
week 9 |
Single Variable Models |
week 10 |
Multi-Variable Models (Decision Trees, kNN) |
week 11 |
Unsupervised Methods - Hierarchical Clustering and k-means Clustering |
week 12 |
Linear and Logistic Regression |
Indicative assessment:
Mid-semester test, 2 programming projects, and final exam
Useful prior experience and background knowledge:
Some programming background (
Python, Java, or
C)
Useful prior programming and software experience:
Hardware required for this unit:
Students are able to undertake their laboratory exercises and projects in laboratories in the CSSE building, but most students also complete work on their own laptops.
The following hardware is required to successfully complete this unit:Standard laptop
Operating system(s) used in this unit:
Different units will use different operating systems for their teaching - for in-class examples, laboratory exercises, and programming projects.
If an operating system is REQUIRED, it will be used when marking assessments.
ANY reasonable platform
This information last updated 6:48pm Thu 20th Apr 2023