PLEASE NOTE: the upgrades to this server, secure.csse.uwa.edu.au, have not yet been completed.
Hopefully the changes will be completed on THURSDAY 8th December.
Web-based programs, such as csmarks, cssubmit, and the help fora, will be unavailable at some time on Thursday 8th.
  It's UWAweek 49

help2003/help4407

This forum is provided to promote discussion amongst students enrolled in Open Source Tools and Scripting.

Please consider offering answers and suggestions to help other students! And if you fix a problem by following a suggestion here, it would be great if other interested students could see a short "Great, fixed it!"  followup message.

How do I ask a good question?
Displaying the 6 articles in this topic
Showing 6 of 564 articles.
Currently 2 other people reading this forum.


 UWA week 15 (1st semester, week 7) ↓
SVG not supported

Login to reply

👍?
helpful
4:48pm Thu 14th Apr, ANONYMOUS

Hello, I want to know if a word followed by a punctuation e.g. "hello," should be considered as a different word from "hello"? Since it was mentioned that lower-case letters differ from their upper-case counterparts in the assignment, so should my case be considered as well? Thanks


SVG not supported

Login to reply

👍x1
helpful
10:50am Fri 15th Apr, Ryan B.

ANONYMOUS wrote:
> Hello, I want to know if a word followed by a punctuation e.g. "hello," should be considered as a different word from "hello"? Since it was mentioned that lower-case letters differ from their upper-case counterparts in the assignment, so should my case be considered as well? Thanks
Hullo, This would probably make things easier for you (as I'm sure you've already run a simple text file and confirmed what `uniq` gives you for the results here). I'd encourage you to think about why one might be necessary, and the other unnecessary for the purposes of this assignment (as other edge cases are likely to come up and reason about). To get you started; ask yourself "why it may be prudent to count hello/Hello differently, and exclude the differences between "hello" and "hello?"?". Lower case and upper case are different because they are still words, but can potentially have different meanings (e.g. you could be referring to the "river", or (actor) "River" Pheonix; the common noun vs proper noun lead to different meanings, and therefore if we were applying some commonality ranking analysis it would make sense to differentiate). Counting punctuation might make sense if we were curious to investigate the rankings of words that occur in parts of the sentence, e.g. what is the most common end of sentence word ("!", ".", "?"), or compound words ("-"), but we are not asking you do to that; the only thing we care about is how common that word is. Hope this helps! Ryan (lab guy).


 UWA week 16 (1st semester, non-teaching week) ↓
SVG not supported

Login to reply

👍?
helpful
6:14pm Mon 18th Apr, Peter M.

"Ryan Bunney" <ry*n*b*n*e*@*e*e*r*h*u*a*e*u*a*> wrote:
> ANONYMOUS wrote: > > > Hello, I want to know if a word followed by a punctuation e.g. "hello," should be considered as a different word from "hello"? Since it was mentioned that lower-case letters differ from their upper-case counterparts in the assignment, so should my case be considered as well? Thanks > > Hullo, > > This would probably make things easier for you (as I'm sure you've already run a simple text file and confirmed what `uniq` gives you for the results here). I'd encourage you to think about why one might be necessary, and the other unnecessary for the purposes of this assignment (as other edge cases are likely to come up and reason about). > > To get you started; ask yourself "why it may be prudent to count hello/Hello differently, and exclude the differences between "hello" and "hello?"?". Lower case and upper case are different because they are still words, but can potentially have different meanings (e.g. you could be referring to the "river", or (actor) "River" Pheonix; the common noun vs proper noun lead to different meanings, and therefore if we were applying some commonality ranking analysis it would make sense to differentiate). > > Counting punctuation might make sense if we were curious to investigate the rankings of words that occur in parts of the sentence, e.g. what is the most common end of sentence word ("!", ".", "?"), or compound words ("-"), but we are not asking you do to that; the only thing we care about is how common that word is. > > Hope this helps! > > Ryan (lab guy).
Hi Ryan, I just want to add to this thread by throwing out the question: what is the strict definition of a 'word' for this assignment? What about the plural form e.g. bird and birds? With reference to the preceding discussion, if we ignore punctuation, the possessive form "bird's" with apostrophe removed ends up as "birds" which then clashes with the plural form "birds"(!). - Peter


SVG not supported

Login to reply

👍x1
helpful
9:28pm Mon 18th Apr, Ryan B.

"Peter Millitz" <23*8*2*[email protected]*u*e*t*u*a*e*u*a*> wrote:
> "Ryan Bunney" <ry*n*b*n*e*@*e*e*r*h*u*a*e*u*a*> wrote: > > > ANONYMOUS wrote: > > > > > Hello, I want to know if a word followed by a punctuation e.g. "hello," should be considered as a different word from "hello"? Since it was mentioned that lower-case letters differ from their upper-case counterparts in the assignment, so should my case be considered as well? Thanks > > > > Hullo, > > > > This would probably make things easier for you (as I'm sure you've already run a simple text file and confirmed what `uniq` gives you for the results here). I'd encourage you to think about why one might be necessary, and the other unnecessary for the purposes of this assignment (as other edge cases are likely to come up and reason about). > > > > To get you started; ask yourself "why it may be prudent to count hello/Hello differently, and exclude the differences between "hello" and "hello?"?". Lower case and upper case are different because they are still words, but can potentially have different meanings (e.g. you could be referring to the "river", or (actor) "River" Pheonix; the common noun vs proper noun lead to different meanings, and therefore if we were applying some commonality ranking analysis it would make sense to differentiate). > > > > Counting punctuation might make sense if we were curious to investigate the rankings of words that occur in parts of the sentence, e.g. what is the most common end of sentence word ("!", ".", "?"), or compound words ("-"), but we are not asking you do to that; the only thing we care about is how common that word is. > > > > Hope this helps! > > > > Ryan (lab guy). > > Hi Ryan, > > I just want to add to this thread by throwing out the question: what is the strict definition of a 'word' for this assignment? > > What about the plural form e.g. bird and birds? With reference to the preceding discussion, if we ignore punctuation, the possessive form "bird's" with apostrophe removed ends up as "birds" which then clashes with the plural form "birds"(!). > > - Peter
Hi Peter, It's a good question - is "bird's" a separate word to bird, i.e. if we refer to the "bird's lunch", is this just another instance of the word 'bird' in a possessive context? I would be treat anything ending with an "'s" as "a special case" and count accordingly (as 'it is' will be different to "it's", and we wouldn't expect you to expand the latter to count each of the former). Hope this helps, Ryan.


SVG not supported

Login to reply

👍?
helpful
11:24pm Mon 18th Apr, ANONYMOUS

To add to the previous question, what about words that are linked with "-" such as "well-known"? Should this case be counted as 2 distinct words or as one? I think it would be more helpful to give the whole specification on what to be expected in the output of each word in each file for the purpose of counting the number of occurrences. Cheers!


SVG not supported

Login to reply

👍?
helpful
11:34pm Mon 18th Apr, ANONYMOUS

And also, I don't get how to preserve the 's when I try to output each word from the file, apparently the ' is recognised as a special character that prompts new inputs i.e. > *new input*

The University of Western Australia

Computer Science and Software Engineering

CRICOS Code: 00126G
Written by [email protected]
Powered by history
Feedback always welcome - it makes our software better!
Last modified  1:17AM Sep 14 2022
Privacy policy