Please consider offering answers and suggestions to help other students!
And if you fix a problem by following a suggestion here,
it would be great if other interested students could see a short
"Great, fixed it!" followup message.
Hi, professor
I have a question about the standard of combination.
You mention that : "HINT: for each of the 3 input files, create a temporary version with an extra column that combines country code and year. You can then use join to join two of the 3 files, and then the third with the new combination."
I notice that there are different ways, such as inner, Left, right and outer join, which will influence the output. I have no idea which one should I choose, as the sample2.tsv just has few data.
In the output file, should it include all the "code + year" from three files, even if some files don't have this match key's data? or just base on one file to match, making sure this code year has data?
thank you.
In the output file, should it include all the "code + year" from three files, even if some files don't have this match key's data?
As I understand it, yes.
Because sample2 is just a sample output format, not the actual merged result of the three files, and the requirements mention that cells can be empty.