本人完全是小白额,这学期选了数据挖掘的课,正在看《The Elements of Statistical Learning》,线代的基础不好,很多不明白,向各位大大求助:
2.3.3章 “From Least Squares to Nearest Neighbors”
“First we generated 10 means mk from a bivariate Gaussian distribution N((1, 0)T , I) and labeled this class BLUE. Similarly, 10 more were drawn from N((0, 1)T , I) and labeled class ORANGE. Then for each class we generated 100 observations as follows: for each observation,we picked an mk at random with probability 1/10, and then generated a N(mk, I/5), thus leading to a mixture of Gaussian clusters for each class.”
这里面generate的新分布N(mk, I/5)中, I/5是怎么得来的呢?想了半天想不出来T-T