It's UWAweek 26 (mid-year break)


This forum is provided to promote discussion amongst students enrolled in Open Source Tools and Scripting.

Please consider offering answers and suggestions to help other students! And if you fix a problem by following a suggestion here, it would be great if other interested students could see a short "Great, fixed it!"  followup message.

How do I ask a good question?
Displaying the 2 articles in this topic
Showing 2 of 439 articles.
Currently 6 other people reading this forum.

 UWA week 20 (1st semester, week 11) ↓
SVG not supported

Login to reply

5:27pm Thu 16th May, Jinyoung L.

I would like to check what I understand from the guideline is correct. I made a sample data sheet to explain. * This sheet is not in real and is finished by my_cantril_data_cleaning. <sample data sheet> <Country> <Code> <Year> <GDP> <Population> <Homicide> <Life> <Cantril> Australia AUS 2011 1 1 1 0 1 Australia AUS 2012 2 2 2 2 Australia AUS 2013 3 3 3 3 3 Australia AUS 2014 4 4 4 4 Q1. AS for "Also based on the header line, report any lines that do not have the same number of cells. (Cells are allowed be empty)", even if there is a empty value like AUS-2012-population, does this line go to stdout of my_cantril_data_cleaning, which in the same cell numbers of the header? Q2. In using best_predictor, the line of 2014 will not go to join to be calculated because the cantril of 2011 is empty, right? Q3. In using best_perdictor, I believe that the line of Life-2011, 0, goes to be calculated, but the line of Life-2012, empty, will not go to be calculated, right? Q4. If the three questions above are right, the number of the pair of Life-Cantril, 2, is not sufficient with the guideline. I will not calculate the information of AUS for Life-Cantril correlation. But, Do I have to calculate for the GDP-Cantril of AUS because the number of GDP-Cantril of AUS is more than 3? Q5. If the four questions above are right, the quantity of information, n, for calculation correlations would be different for each countries and in a case, the each quantities of correlations to calculate a mean correlation of Homicide, GDP, Population, and Life would be different. Is this right?

SVG not supported

Login to reply

11:46am Sat 18th May, ANONYMOUS

Some unofficial answers here, but these are exactly how I worked on my own assignment.


According to the clarification email from the professor:

All rows with a country code, between the years 2011 and 2021 are to be reported.

So an empty value like population does not affect whether a line should be outputted or not.






Again, according to the clarification email from the professor:

That implies that, for a given country and predictor, each Cantril data point corresponds to a a predictor data point, and there are at least 3 of those. This also implies that the number of correlations may be slightly different for each predictor.

So Life-Cantril correlation does not need to be calculated in this case.

But you still need to calculate correlation for GDP-Cantril.


Yes. For each predictor, n can be different for each entity, for the reason we talked about in Q4.

The University of Western Australia

Computer Science and Software Engineering

CRICOS Code: 00126G
Written by [email protected]
Powered by history
Feedback always welcome - it makes our software better!
Last modified  5:07AM Sep 06 2023
Privacy policy