  UWAweek 49


 UWA week 19 (1st semester, week 10)
3:49pm Sun 15th May, ANONYMOUS

Hi Micheal, Could you please explain how the distance is calculated? The file "Lawrence_Lawson.txt" shows that "because 0" also 2.338 although 0 and 337.7 as 52.82 because 0 before 2.113 but 4.272 comma 102.9 compound_word 9.461 contraction 0.8221 for 87.13 if 0.4747 nor 0 of 131.3 or 12.93 semi_colon 8.451 sentence 46.97 since 2.113 that 10.85 though 2.113 until 0 when 0 whenever 0 whereas 0 which 1.696 while 0 yet 2.113 The Euclidian Distance between the two texts is: 28.609 I find that in Lawrence_sample.txt, the "Because" appears once. But I cannot find any "because" in Lawson_sample.txt. I am really confused why the score of "beacuse" is 0. And also in this case, assignment says "What I have also chosen to report is the squares of the style-metric differences, which suggests that the big contributors to the final distance are use of comma and "and" (at least for those to samples)." How to decide "the big contributors"? The word "of" has 131.3 and comma has 102.9, why "the big contributors" are comma and "and", rather than "of" and "and"?

