-
-
Notifications
You must be signed in to change notification settings - Fork 8.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feature numbering starting from 0 #3638
Comments
It looks like |
I think the solution could be to include
in the agaricus demo featmap.txt So the whole file will look like this:
I have tested it on other datasets, it works this way, for me at least! 👍 |
Target is not a feature, so I don't think we should include in the feature map. Let me submit a pull request to produce LIBSVM file with 0-based indexing, so that feature index starts with 0. |
…0-based indexing (dmlc#3652)
…0-based indexing (dmlc#3652)
featmap.txt has features numbered 0 to 125:
0 cap-shape=bell i
125 habitat=woods i
but the train and test data has features numbered up to 126
For example row 3207 in agaricus.txt.train
0 4:1 7:1 14:1 21:1 29:1 34:1 36:1 39:1 49:1 54:1 55:1 65:1 69:1 75:1 82:1 88:1 92:1 95:1 102:1 106:1 119:1 126:1
This seems to result incorrectly name the features when performing task=dump of an otherwise correctly performing model.
The text was updated successfully, but these errors were encountered: