It's UWAweek 48


This forum is provided to promote discussion amongst students enrolled in Open Source Tools and Scripting.

Please consider offering answers and suggestions to help other students! And if you fix a problem by following a suggestion here, it would be great if other interested students could see a short "Great, fixed it!"  followup message.

How do I ask a good question?
Displaying selected article
Showing 1 of 564 articles.
Currently no other people reading this forum.

 UWA week 16 (1st semester, non-teaching week) ↓
SVG not supported

Login to reply

1:10am Fri 22nd Apr, ANONYMOUS

"Michael Wise" <mi*h*e*.*i*[email protected]*a*e*u*a*> wrote:
> "Hanlin Zhang" <22*4*4*[email protected]*u*e*t*u*a*e*u*a*> wrote: > > > Could u plz clarify this issue: > > > > If user enter 'Vietnam', does shell need to return the result of 'Viet nam' (Vietnam show as Viet nam in the data set)? Or shell should say something like 'country not found in the data set' if user enter 'Vietnam' instead of 'Viet nam'? > > Hi Hanlin, > Greetings from Kalgoorlie. > Clearly, Viet Nam and Vietnam are the same place, and it would be incorrect and unhelpful to say that the data is > not found in the dataset. Clearly, your program will need to do some data cleaning up front on the country data, and then also on the user queries to make sure that the things that are in fact the same, end up looking the same. > > Does that make sense? > > Cheers > MichaelW
Hi Michael, Sorry to harp on this issue, but I am still confused on how to deal with the different variants of a country's name. I took your earlier reply "Seeing that is the format that the data as downloaded uses, that is the format I'll be using for testing." to mean that you will test our programs using the spelling of a country's name as found in the dataset, meaning that we need not worry about cleaning "Viet Nam" into "Vietnam" such that a user's input of "Viet Nam" and "Vietnam" will yield the same results. However, your later reply seems to indicate that you expect us to account for the different possible variations of a country's name, which would be impossible to implement aside from hard-coding every possibility (Laos and Lao's People Democratic Republic, U.K and United Kingdom, Siam and Thailand, Myanmar and Burma). Do you just mean that our code must account for easily implementable cases like Vietnam and Viet Nam (we can easily remove spaces between words)? Based on the reasoning "Clearly, Viet Nam and Vietnam are the same place, and it would be incorrect and unhelpful to say that the data is
> not found in the dataset.", can we instead, if our program does not recognise a user input, return a list of country names found in the dataset (along with an appropriate error message)?

The University of Western Australia

Computer Science and Software Engineering

CRICOS Code: 00126G
Written by [email protected]
Powered by history
Feedback always welcome - it makes our software better!
Last modified  1:17AM Sep 14 2022
Privacy policy