Please consider offering answers and suggestions to help other students!
And if you fix a problem by following a suggestion here,
it would be great if other interested students could see a short
"Great, fixed it!" followup message.
Hello Professor,
Hope you are doing well.
I am not able to find any feedback in my cssubmit dashboard. Can you kindly confirm if it was uploaded?
Thank you for your time.
Hi professor,
I see there is a country which is not present in one of the file(year is different) and should we suppose to retain that row as well or only take the rows which are in common(same country and year) in 3 files? For data cleaning process
Hello Professor, I seem to have a problem with the file join in the data cleaning part. The standard output seems to simply merge 3 tsv files together without deleting duplicates. What should I do to solve this problem? Thank you so much.
I guess he was wondering whether we should calculate the country that lacks some column. Like some rows have data on GDP, but not homicide. Should we include a valid column in our correlation or just ignore the entire line.
Apologies. I tried to attach the tsv and it failed. Hope this works
Entity Code Year GDP per capita, PPP (constant 2017 international ) Population (historical estimates) Homicide rate per 100,000 population - Both sexes - All ages Life expectancy...
Hi Michael
Could I do a sanity check?
I believe I have filtered the tsv data per requirements and about the exclude rows that do have insufficient Cantril-Value pairs. This will be done on the basis that
- Cantril data may be absent in certain years wi...
Dear Professor,
Thank you for your response.
I only see the zip file I submitted in the cssubmit.Is there any chance that feedback was not given or uploaded?
Thank you again for your time.
Hi Zeke,
This is not an error. The data is valid; the Cantril values just happen to be the same. (Perhaps they asked the same person each year?? ). In any case, from the point of view of data cleaning, it is clean data, and for this analysis, all ...
ANONYMOUS wrote
Hi
When you are working to the screen you cannot tell whether the data came from stdout or stderr. The only way is to redirect the streams
program tested from stdout 2 from stderr
Cheers
MichaelW
Hi,
Given that you posted anonymously I can't comment, but most marks feedback have been posted, with just a small number left of those which had been set aside groan and then the auto-testing failed, so they have to tested by hand. I'm down to the...
Hi,
That is not how I understand things, which are illustrated in the sample1.xlsx worked example based on sample1.tsv. For each country there are 3 corrections to be done GDP, Homicide Rate, Life Expectancy, each versus Cantril, based on predicto...
Hi Kai,
Empty cells are fine anywhere for the data cleaning program, so long as the rows are within the range of years of interest. Re double quotes, you only need these in CSV format, because comma can occur naturally within a string, so the whole s...
ANONYMOUS wrote
Hi, as far as I know, you the marks are on csmarks, but the feedback files are associated with cssubmit. I'm down the last few, which had to be marked by hand, for a variety of reasons. Speaking of which, I really should get back to m...
ANONYMOUS wrote
Hi
I certainly believe so. Don't forget, you have all the TSV source files, so you can work forward and see what should be there. Have I made a mistake?
Cheers
MichaelW