https://secure.csse.uwa.edu.au/run/help3401?p=np&rss=y
help3401 - Lecture 4: 60,000 possible dimension table rows help3401 - Lecture 4: 60,000 possible dimension table rows help3401https://blogs.law.harvard.edu/tech/rssThe University of Western AustraliaFri, 29 May 2020 18:48:43 +0800Fri, 29 May 2020 18:48:43 +0800Re: Lecture 4: 60,000 possible dimension table rows
https://secure.csse.uwa.edu.au/run/help3401?p=np&a=31
https://secure.csse.uwa.edu.au/run/help3401?p=np&a=31Sun, 29 Mar 2020 20:35:54 +0800"Zeyi Wen" <zeyi.wen@uwa.edu.au>Good answers. They are extreme/simplified examples. Based on your answers, I believe you have got the idea :)
Re: Lecture 4: 60,000 possible dimension table rows
https://secure.csse.uwa.edu.au/run/help3401?p=np&a=30
https://secure.csse.uwa.edu.au/run/help3401?p=np&a=30Sun, 29 Mar 2020 11:44:51 +0800"Edward Atkinson" <22487668@student.uwa.edu.au>
To answer my own question in the hope it might help someone else. The 60,000 unique values for each dimension limitation is a product of the fact that a 2 byte key is used to index rows in the dimension tables. This means that 2^16 - 1 rows can be stored, 65536 rows, therefore giving us our 60,000 unique value limit.
In the context of scenario A the assumption that there are only 60,000 unique values for each dimension also still makes sense as the likelihood of two rows in a dimension table of 100 million rows having the same values for 100 attributes with 60,000 possible levels is practically 0.
Lecture 4: 60,000 possible dimension table rows
https://secure.csse.uwa.edu.au/run/help3401?p=np&a=17
https://secure.csse.uwa.edu.au/run/help3401?p=np&a=17Tue, 24 Mar 2020 21:57:00 +0800"Edward Atkinson" <22487668@student.uwa.edu.au>Hi Zeyi ,
I was just wondering how you calculated that each dimension table in scenario C for the example you ran through in lecture 4 would have 60,000 possible rows?
Wouldn't it be possible for a dimension to have 100 million unique values if there are 100 million facts in the fact table? How can you be certain there would be only 60,000 possible values for each dimension, or is that simply an assumption?