E:\McNair\Projects\Accelerators\Summer 2018\Industry Classifier update\BuildTestData.sql
Since this dataset has different and more classifications than the venture capital data previously used, we need to rebuild a coding system for the classifier. The new version that I am editing on is: E:\McNair\Projects\Accelerators\Summer 2018\Industry Classifier update\IndustryClassifierCOPY.pySmall training and testing data is called: 2018traindata.txt NewTestData2018.txt Larger training and testing data is called: bigtrain2018.txt bigtest2018.txt This file modifies the Classifier.pkl file which stores the components of the model. Eventually, we should be able to run this through FinalIndustryClassifier.py.
=New Notes=