Perform new course search

Search Filter

Data Capture and Preparations G (11520.1)

Level: Graduate Level
Credit Points: 3
HECS Bands:

Band 2 2013-2020 (Expires 31 Dec 2020) Band 2 2021 (Commenced After 1 Jan 2021) Band 3 2021 (Commenced Before 1 Jan 2021)

Faculty: Faculty of Science and Technology
Discipline: Academic Program Area - Technology


Possible changes to your unit's learning activities and assessment items

As a result of the Australian Government's and or the ACT Government’s directives requiring physical distancing and restrictions on movement because of the COVID-19 pandemic, you may find that learning activities and/or assessment items in some units you are studying have changed. These changes will not be updated in the published Unit Outline but will be communicated to you via your unit’s UCLearn(Canvas) teaching site. The new learning activities and/or assessment items will continue to meet the unit's learning outcomes, as described in the Unit Outline.

New learning activities and/or assessment items are available on your unit's UCLearn(Canvas) teaching site. Please contact your Unit Convener with any questions.

Unit Outlines

To view your Unit Outline, click View to log in to MyUC and access this information, or visit your unit's online teaching site.

  • Semester 1, 2021, ON-CAMPUS, BRUCE (200473) - View
  • Semester 1, 2020, ON-CAMPUS, BRUCE (192330) - View

If a link to your Unit Outline is not displayed, please check back later. Unit Outlines are generally published by Week One of the relevant teaching period.


A score skill of a data scientist is to capture, extract and clean data. Real world data often come from various data sources, in various formats and are unorganized. This unit introduces students to the concepts and techniques a data scientist employs in the early stages of data analysis process. This unit will provide hands-on experience in capturing data from sensors, collecting data from public information as well as working with existing data sets using real-world examples. Such data may be temporal or spatial, ordinal or categorical, embedded in documents or files. Students will learn how to import and clean the data, which usually involves multiple, often complicated, steps to convert data from its raw format to a clean format that greatly facilitates the later stages of the data analysis. This is known as data wrangling.

Learning Outcomes

After successful completion of this unit, students will be able to:

1. Work with sensors for capturing data;

2. Choose and apply appropriate techniques for capturing data from existing sources;

3. Import data into R;

4. Convert data from one format to another one in R;

5. Employ suitable techniques for tidying data; and

6. Develop a sound understanding of text mining methods in R.

Assessment Items

Contact Hours

Four hours of problem-based learning activities, interactive workshops & practical work in laboratory classes on campus per week.





Assumed Knowledge

Working knowledge of discrete mathematics, algebra and numerical analysis.

Incompatible Units


Equivalent Units


Print this page
Why choose UC

Perform new course search

Search Filter

Frequently Asked Questions