- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi
I'm using a StringDataSource for importing categorical data (text labels) into NumericTables for SVM modelling. The idea being to import the data, let the DataSource work out the category labels, create the model then save the DataSourceDictionary and the model for later prediction.
I can serialize/deserialize the DataSourceDictionary ok, but the contained DataSourceFeatures don't seem to serialize the CategoricalFeatureDictionary which means I've lost the data labels. Should this work or have I missed something?
Kind Regards
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Thank you, Harvey for your reprort.
That is an issue indeed, we will fix that at the nearest release opportunity.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Ok thanks, as a temporary workaround I'll probably tokenize it myself and build up the NumericTable by hand. I see that the StringDataSource also build up the stats for the columns, can you tell me if this is necessary for a NumericTable bound for SVM training?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
No, SVM does not use stats from NumericTable
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Good. Thanks for your help.
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page