Changes

Jump to navigation Jump to search
*Current dataset: <code>The File to Rule Them All</code>, contains information of 160 accelerators (homepage url, found cohort url etc.)
** We will use the data of 121 accelerators, which have cohort urls found, for training and testing our CNN algorithm
** After applying the above sitemap generator Site Map Generator to those 121 accelerators, we will use 75% of the result data to train our model. The rest, 25% will be used as the test data
*The type of inputs for training CNN model:
#Image: picture of the web page (generated by the above screenshot tool)
227

edits

Navigation menu