*[[Google Crawler]]
*[[Yi Ma]]'s work assembling [[US Incubators]], state-by-state
*ClusterMapping
*Wharton entrepreneurship club
*
The [[Google Crawler]] was added instead of a structured source, with the exceptions of [[Crunchbase Database|Crunchbase]] and AngelList, the structured sources are all small.
==Assembling the data==
The data is assembled in the dbase '''incubators''' from the following national sources, all copied in E:\projects\Kauffman Incubator Project\Incubator Data Assembly:
*456 in CrunchbaseIncubators.txt, see [[Crunchbase_Database#Incubators_in_Crunchbase]]
*415 in INBIA_data.txt, see [[INBIA#Retrieve_Data_from_URLs_Generated]]
*1474 (self-declared as incubators but actually many different things) in angelList_companyInfo.txt, see [[AngelList_Database#Parsing_Saved_AngelList_Pages]]*292 in ClusterMapping.txt*21 in Wharton.txt*361 in Gaebler.txt Note that the AngelList data also has angelList_employees.txt and angelList_portfolio.txt as associated files.