Linear / Logistic Regression in R: Dealing With Unknown Factor Levels in Test Data
Let’s say you have data containing a categorical variable with 50 levels. When you divide the data into train and test sets, chances are you don’t have all 50 levels featuring in your training set. This often happens when you divide the data set into train and test sets according ... [Read more...]