Difference between revisions of "Deep Text Classifier"
(Created page with "=Deep Text Classifier= ==Problem Description== ==Approach in General== ==Package Dependences== ==How to Run the Code== ==How to Modify the Code to Solve your own problems...") |
|||
Line 3: | Line 3: | ||
==Problem Description== | ==Problem Description== | ||
− | ==Approach | + | We want to build a classifier for the text input. For example, we may want to classify a company's industry area based on its description. Or we may want to classify a company's IPO status based on its description. |
+ | |||
+ | ==General Approach== | ||
+ | |||
+ | We will build a deep neural network to uniformly solve this problem. The traditional way of doing this is to hire a task specific expert to manually design some useful features, say to check if the text contains words "Internet" and "High-tech" at the same time, and to classify based on the observed features. Our way, by using the deep neural network, can automatically extract the features and most importantly achieve very high testing accuracy. However, the features that are used by the deep neural network are not human interpretable. | ||
==Package Dependences== | ==Package Dependences== |
Revision as of 14:34, 10 October 2017
Contents
Deep Text Classifier
Problem Description
We want to build a classifier for the text input. For example, we may want to classify a company's industry area based on its description. Or we may want to classify a company's IPO status based on its description.
General Approach
We will build a deep neural network to uniformly solve this problem. The traditional way of doing this is to hire a task specific expert to manually design some useful features, say to check if the text contains words "Internet" and "High-tech" at the same time, and to classify based on the observed features. Our way, by using the deep neural network, can automatically extract the features and most importantly achieve very high testing accuracy. However, the features that are used by the deep neural network are not human interpretable.