reduce the number of features in SPSS
I have a dataset with more than 200 feature开发者_StackOverflow社区s and I would like to reduce the number in order not to overestimate the prediction of the outcome.
Does anyone know whether there is any option in SPSS to calculate mutual information between the target value (Y) and the independent variables (X) or any other method to check which variables are relevant and which are irrelevant?
Thank you!
I've not seen the term "features" used in this context, but I think you are in need of Principal Component Analysis.
However, doing statistics without knowing what you are doing is a good way to make meaningless numbers; I suggest you consult a statistician.
精彩评论