Explanation : (a) What is the maximum number of cells possible in the base cuboid?
pn.
This is the maximum number of distinct tuples that you can form with p distinct values
per dimensions.
(b) What is the minimum number of cells possible in the base cuboid?
p.
You need at least p tuples to contain p distinct values per dimension. In this case no
tuple shares any value on any dimension.
(c) What is the minimum number of cells possible in the data cube, C?
(2n-1)×p+1.
The minimum number of cells is when each cuboid contains only p cells, except for the
apex, which contains a single cell.
(d) What is the maximum number of cells possible (including both base cells and
aggregate cells) in the data cube, C?
(p+1)n.
The argument is similar to that of part (a), but now we have p+1 because in addition to
the p distinct values of each dimension we can also choose ∗.
Explanation : The computation is based on the simplified Bayes’ formula.
P{B|A} = (P{A|B}·P{B) / P{A}.
P{M|S} = probability that a person had meningitis, conditioned by the existence of stiff neck.
P{S|M} = probability that a person complains about stiff neck, conditioned by the existence of meningitis. = 50%=1/2
P{S} = proportion of people who complain about stiff neck. = 1/20
P{M} = proportion of people who had meningitis. = 1/50,000
Then:
P{M|S} = (P{S|M}·P{M}) / P{S} =( 1 / 2 x 1 / 50,000 ) / 1 / 20 = 0.0002 = 0.02%
OLAP(Online application program) system is market oriented and is used for data analysis by knowledge workers including Managers, Executives and Analysts.
OLTP (online transaction processing) is a class of software programs capable of supporting transaction-oriented applications on the Internet.
Data system is a term used to refer to an organized collection of symbols and processes that may be used to operate on such symbols
A market system is the network of buyers, sellers and other actors that come together to trade in a given product or service
Explanation : h(k, i) = (h1(k) + i * h2(k)) mod m is used in double hashing is of the form. Rest other option does not used in double hashing. So, option (C) is correct.