09/27/2016 14:00 - 17:00: ===Fall 2017===*Set up personal and work log pages, accessed Remote Desktop. <onlyinclude>*Compiled list of accelerators from Wiki09/29/2016 14:00 - 16:15; 16:45 - 17:30:*Created new project: [[Accelerator Seed List (Data)Shrey Agarwal]] and worked with Dr. Egan to create schematic for data entry.*Evaluated 3 sources and logged data. Sources were taken from [[List of AcceleratorsWork Logs]]. Logged each step onto project page and identified categories that would be suitable for web crawling sometime in the future.10/11/2016 14:00 - 17:30;*Explored how to use regular expressions in TextPad to aid with data sorting (need to review expressions with Dr. Egan in future)*Continued evaluating sources from [[List of Accelerators]] and recorded steps onto project page, as before. Finished evaluating the six sources from initial list. Shrey Agarwal (All work done in [[Accelerator Seed List Work Log)|(Datalog page)]])109/1319/2016 1417 15:00 - 17:00;*All work done in [[Accelerator Seed List (Data)]]*Talked to Dr. Egan about project going forward. Need to pick out 10-15 accelerators from Became reacclimatized with the sources listed on my project page and identify a reliable method for obtaining cohort information, as well as other variables*Used google searches to identify more sources, and evaluated three databases spoke with Ed about the direction for the help rest of TextPad*Began working on more generic google searches. Was able to go through "Location+accelerator"-type searches today. Will continue next time.the semester109/1820/2016 1417 15:00 - 17:30;00*Work continued in [[Accelerator Seed List (Data)]]*Took a sample size of 10 accelerators and detailed how to extract cohort information, as well as what other information is readily available from accelerator URLs.*Brought Matthew up to speed Worked on accelerator project, added summaries to each section so they became easier to follow, and worked with him to finish setting up extracting cohort informationa new pull for the updated SDC data109/2021/16 1417 15:30 00 - 17:30:*Work continued in [[Accelerator Seed List (Data)]]00*Finished up the list of instructions for finding pull and sorted the cohort. Continued compiling data from the updated accelerator list of variables for each of the accelerators within the sample size.*Consulted Peter on prospects of creating a web crawler with the information we currently have compiled. Determined it was possible, although beyond the scope of Peter's knowledge.109/2522/16 1417 15:00 - 17:00*Consulted Ed Tried to set up the matcher with next step for project.*Began listing the E-R diagram onto Matthew; ran into some difficulties on Power Shell, returning a blank file in the accelerator database page where entities were potential categories and each entity had its associated attributesoutput109/2726/16 1417 15:00 - 17:00*Continued working with Matthew Finished the match and created pivot tables to identify elements in count the E-R diagram for pulling information on accelerators. *Found sources to obtain/cross-reference information number of repetitions (ie. Angel Listcompanies going through more than one accelerator)119/0827/16 1417 15:00 - 1817:00*Identified possible keywords Discussed with Matthew the best way to filter results through for accelerators*Began compiling a comprehensive list of accelerators based on collect the VC data we have already sifted through.*Learned how to use regular expressions from Ben to sort names individually and alphabeticallythe repetitions.11/10/16 14:00 - 18:00*Began sorting We tried different matches through accelerator list and removing duplicates, as well as identifying more places to pull names from.*Worked with Peter our SDC data to create a crawl for f6s because the website does not return only accelerators.no avail119/1528/17 16 14:00 - 18:00*Took a break from f6s to locate more lists based on individual google searches such as "city+accelerator+list"*Put Seed DB information into an excel file on the remote desktop11/17/16 14:00 - 16:00*Continued filling out information for attempting to match with SDC the random Google Searches*Organized TextPad files on different columns. Didn't work without separating the RDP data into coherent excel spreadsheets with proper headers on the table*Noticed problem with f6s: it seems although all of the html coding was protected by individual files, a captcha so the crawler did not actually extract any information; it was all blockedvery tedious process.119/2229/16 1417 15:00 - 17:00*Worked to fix f6s crawler Spoke with Peter*Finished and compiled master list of accelerators12/01/16 14:00 - 18:00*Caught up on Ed about incubators project with Ed and Carlin*Took 20 accelerators (241-260) from , will begin as soon as we can time the list and filled out textaccelerator startup investments.html files for them; finished Ed is expecting us to begin sometime in the 2012/05/16 13:00 - 16:00*After finishing first 20 acceleratorsnext two months, continued working down the list, beginning at 321using a similar process as we did for incubators. The process should be handled by a new worker.*Work noted in [[Accelerator Seed List (Data)]], but mostly stored on McNair RDP12/06</16 14:00 - 18:00onlyinclude>*Continued "Accelerating" down the list in [[Accelerator Seed List (Data)]], finished up until 34012/08/16 14:00 - 17:00===Spring 2017===*Continued working on accelerator list on the same page.
01/17/17 14:00 - 16:00
*Finished up "accelerating" from [[Accelerator Seed List (Data)]], numbers 341-351
4/11/17 14:00 - 16:00
*Finished compiling the accelerator and cohort information for the few we found from SARP, will consult Ed to figure out how to approach the missing accelerators and what to do for the preliminary report
9===Fall 2016=== 09/1927/2016 14:00 - 17 :00: *Set up personal and work log pages, accessed Remote Desktop. *Compiled list of accelerators from Wiki09/29/2016 14:00 - 16:15; 16:00 45 - 17:0030:*Became reacclimatized Created new project: [[Accelerator Seed List (Data)]] and worked with the Dr. Egan to create schematic for data entry.*Evaluated 3 sources and logged data. Sources were taken from [[List of Accelerators]]. Logged each step onto project, spoke with Ed about the direction page and identified categories that would be suitable for web crawling sometime in the rest future.10/11/2016 14:00 - 17:30;*Explored how to use regular expressions in TextPad to aid with data sorting (need to review expressions with Dr. Egan in future)*Continued evaluating sources from [[List of Accelerators]] and recorded steps onto project page, as before. Finished evaluating the semestersix sources from initial list. (All work done in [[Accelerator Seed List (Data)]])910/2013/17 152016 14:00 - 17:00;*Worked All work done in [[Accelerator Seed List (Data)]]*Talked to Dr. Egan about project going forward. Need to pick out 10-15 accelerators from the sources listed on setting up my project page and identify a new pull reliable method for obtaining cohort information, as well as other variables*Used google searches to identify more sources, and evaluated three databases with the updated SDC datahelp of TextPad*Began working on more generic google searches. Was able to go through "Location+accelerator"-type searches today. Will continue next time.910/2118/17 152016 14:00 - 17:0030;*Work continued in [[Accelerator Seed List (Data)]]*Finished the pull Took a sample size of 10 accelerators and sorted the data detailed how to extract cohort information, as well as what other information is readily available from the updated accelerator listURLs.*Brought Matthew up to speed on accelerator project, added summaries to each section so they became easier to follow, and worked with him to finish up extracting cohort information910/2220/17 1516 14:00 30 - 17:0030:*Work continued in [[Accelerator Seed List (Data)]]*Tried to set Finished up the matcher list of instructions for finding the cohort. Continued compiling the list of variables for each of the accelerators within the sample size.*Consulted Peter on prospects of creating a web crawler with Matthew; ran into some difficulties on Power Shellthe information we currently have compiled. Determined it was possible, returning a blank file in although beyond the outputscope of Peter's knowledge.910/2625/17 1516 14:00 - 17:00*Finished Consulted Ed with next step for project.*Began listing the match and created pivot tables to count E-R diagram onto the number of repetitions (companies going through more than one accelerator)database page where entities were potential categories and each entity had its associated attributes910/27/17 1516 14:00 - 17:00*Discussed Continued working with Matthew to identify elements in the best way E-R diagram for pulling information on accelerators. *Found sources to obtain/cross-reference information (ie. Angel List)11/08/16 14:00 - 18:00*Identified possible keywords to collect filter results through for accelerators*Began compiling a comprehensive list of accelerators based on the VC data we have already sifted through.*Learned how to use regular expressions from the repetitionsBen to sort names individually and alphabetically. We tried different matches 11/10/16 14:00 - 18:00*Began sorting through our SDC data accelerator list and removing duplicates, as well as identifying more places to pull names from.*Worked with Peter to no availcreate a crawl for f6s because the website does not return only accelerators.911/2815/16 14:00 - 18:00*Took a break from f6s to locate more lists based on individual google searches such as "city+accelerator+list"*Put Seed DB information into an excel file on the remote desktop11/17 /1614:00 - 1716:00*Continued attempting to match filling out information for the random Google Searches*Organized TextPad files on the RDP into coherent excel spreadsheets with SDC proper headers on the different columns. Didn't work without separating table*Noticed problem with f6s: it seems although all of the data into individual files, html coding was protected by a very tedious processcaptcha so the crawler did not actually extract any information; it was all blocked.911/2922/16 14:00 - 17 15:00*Worked to fix f6s crawler with Peter*Finished and compiled master list of accelerators12/01/16 14:00 - 1718:00*Spoke Caught up on project with Ed about incubators projectand Carlin*Took 20 accelerators (241-260) from the list and filled out text.html files for them; finished the 2012/05/16 13:00 - 16:00*After finishing first 20 accelerators, will begin as soon as we can time continued working down the accelerator startup investments. Ed is expecting us to begin sometime list, beginning at 321*Work noted in [[Accelerator Seed List (Data)]], but mostly stored on McNair RDP12/06/16 14:00 - 18:00*Continued "Accelerating" down the next two monthslist in [[Accelerator Seed List (Data)]], using a similar process as we did for incubators. The process should be handled by a new workerfinished up until 34012/08/16 14:00 - 17:00*Continued working on accelerator list on the same page.
[[Category:Work Log]]