NSF Data

From edegan.com
Revision as of 12:20, 20 September 2020 by Ed (talk | contribs)
Jump to navigation Jump to search


Each folder by year has around 10,000 awards. While the total number of awards seems to be less than NIH, the downloads appear to have been done properly. The categories in the grant award by institution include: Award ID, Institution Name, Institution City Name, Institution Zipcode, Institution Phone, instituion street address, instituion country name, Instiution State Name, and Institution state code. Currently this data is available on the RDP but has not been put in a data base. Catherine has used primarily the zipcodes in the file for her project.

Project
NSF Data
Project logo 02.png
Project Information
Has title NSF Data
Has owner Catherine Kirby
Has start date 03/31/17
Has deadline date
Has keywords NSF
Has project status Subsume
Subsumed by: Federal Grant Data
Has sponsor McNair Center
Copyright © 2019 edegan.com. All Rights Reserved.

NSF Grant Data

The NSF grant data are in folders by year, each grant is an xml file. The format of a typical file is shown below. There is data available from 1959 onwards, and full records after 1976. In 2016 there were ~10,000 awards.

Each year can be downloaded from: https://www.nsf.gov/awardsearch/download.jsp

Data is located in:

E:\McNair\Projects\Federal Grant Data\NSF

The three data files are:

grant_award_general_info.txt - 394,610 rows
grant_award_institution_info.txt
grant_award_investigator_info.txt

grant_award_general_info.txt includes:

  • Award ID
  • AwardTitle
  • AwardEffectiveDate
  • AwardExpirationDate
  • AwardAmount
  • AwardInstrument
  • Organization Code
  • Organization Directorate
  • Organization Division
  • ProgramOfficer
  • AbstractNarration


grant_award_institution_info.txt includes the following:

  • Award ID
  • Institution Name
  • Institution CityName
  • Institution ZipCode
  • Institution Phone#
  • Institution Street Address
  • Institution Country Name
  • Institution State Name
  • Institution State Code


grant_award_investigator_info includes the following:

  • Award ID
  • Investigator First Name
  • Investigator Last Name
  • Investigator Email Address
  • StartDate
  • Investigator Role Code


Data is currently loaded into dbase biotech.

The load script and some processing is in:

Z:\Biotech\biotech.sql

Note that only grant_award_institution_info appears to be loaded. Data related to grant_award_general is probably loaded in table nsfgeneral and data related to grants_awards_investigator is in table nsfinvestigator. Coverage appears good 1968-2017.

Sample XML

Sample XML schema

 <rootTag>
 <Award>
 <AwardTitle>
 Collaborative Research: Hybrid Organic-Inorganic Thermoelectric Materials
 </AwardTitle>
 <AwardEffectiveDate>08/31/2016</AwardEffectiveDate>
 <AwardExpirationDate>06/30/2017</AwardExpirationDate>
 <AwardAmount>95461</AwardAmount>
 <AwardInstrument>
 <Value>Standard Grant</Value>
 </AwardInstrument>
 <Organization>
 07030000
 <Directorate>
 <LongName>Directorate For Engineering</LongName>
 </Directorate>
 <Division>
 <LongName>Div Of Civil, Mechanical, & Manufact Inn</LongName>
 </Division>
 </Organization>
 <ProgramOfficer>
 <SignBlockName>Thomas F. Kuech</SignBlockName>
 </ProgramOfficer>
 <AbstractNarration>
 Thermoelectric materials are materials which can be used to convert thermal energy directly to electricity. The performance of a thermoelectric material is measured by the "figure of merit", termed ZT. There has been much research into increasing thermoelectric materials, figure of merit, however, progress in this area has been slow and most of the researched thermoelectric materials up to now are suffering from either high fabrication cost, usage of rare earth or toxic elements, or poor mechanical properties. Organic thermoelectric materials (OTEs) have recently attracted attention for low temperature applications (< 300K), especially cooling purposes, as they are flexible, low-cost and abundant, and low-cost fabrication methods for synthesizing them exist... A large class of semiconducting nanostructures (e.g. Si, CdTe, Bi, and PbTe nanowires and holely structures) combined with conjugated polymers (e.g., chemically-modified PEDOT and low bandgap polymers) and organic molecules (specifically charged chemical species attached to molecules such as CF3- substituted styrene molecules) will be simulated, synthesized and optimized to identify new hybrid materials with a potentially high ZT.
 </AbstractNarration>
 <MinAmdLetterDate>12/22/2016</MinAmdLetterDate>
 <MaxAmdLetterDate>12/22/2016</MaxAmdLetterDate>
 <ARRAAmount/>
 <AwardID>1723353</AwardID>
 <Investigator>
 <FirstName>Mona</FirstName>
 <LastName>Zebarjadi</LastName>
 <EmailAddress>mz6g@virginia.edu</EmailAddress>
 <StartDate>12/22/2016</StartDate>
 <EndDate/>
 <RoleCode>Principal Investigator</RoleCode>
 </Investigator>
 <Institution>
 <Name>University of Virginia Main Campus</Name>
 <CityName>CHARLOTTESVILLE</CityName>
 <ZipCode>229044195</ZipCode>
 <PhoneNumber>4349244270</PhoneNumber>
 <StreetAddress>P.O. BOX 400195</StreetAddress>
 <CountryName>United States</CountryName>
 <StateName>Virginia</StateName>
 <StateCode>VA</StateCode>
 </Institution>
 <ProgramElement>
 8092
 <Text>Materials Eng. & Processing</Text>
 </ProgramElement>
 <ProgramReference>
 024E
 <Text>MATERIALS DESIGN</Text>
 </ProgramReference>
 <ProgramReference>
 036E
 <Text>CIVIL INFRASTRUCTURE</Text>
 </ProgramReference>
 </Award>
 </rootTag>

Survey of Federal Funds for Research and Development Fiscal Years 2014–16

Excel spreadsheets with data regarding Federal Agency budgets and spending on R&D [1]

 E:\McNair\Projects\Federal Grant Data\NSF\NSF Data Tables