Gunjan Dewan
2016-02-12 13:04:42 UTC
Hi all,
I am using the following dataset from kaggle (train.csv):
https://www.kaggle.com/c/lshtc/data
The dataset is in libSVM format.
However while trying to load it using load_svmlight_file, i get the
following error
File "_svmlight_format.pyx", line 72, in
sklearn.datasets._svmlight_format._load_svmlight_file
(sklearn\datasets\_svmlight_format.c:2120)
ValueError: could not convert string to float: b'Data'
I then removed the header but it is still giving me the same value error.
Can anyone please help me out with this?
I also wanted to know if there is any other way to convert the libSVM
format into 2 matrices.
Note : I just started out with sklearn and machine learning.
Thanks,
Gunjan
I am using the following dataset from kaggle (train.csv):
https://www.kaggle.com/c/lshtc/data
The dataset is in libSVM format.
However while trying to load it using load_svmlight_file, i get the
following error
File "_svmlight_format.pyx", line 72, in
sklearn.datasets._svmlight_format._load_svmlight_file
(sklearn\datasets\_svmlight_format.c:2120)
ValueError: could not convert string to float: b'Data'
I then removed the header but it is still giving me the same value error.
Can anyone please help me out with this?
I also wanted to know if there is any other way to convert the libSVM
format into 2 matrices.
Note : I just started out with sklearn and machine learning.
Thanks,
Gunjan