::*By now, we only focus on cleaning American patents.
:'''2. Postcode and State (U.S.)'''
::U.S. post code follows the pattern [five digits - four digits]. In this way, U.S. patents can be extracted by searching for post code with regular expression.
::The state and extracted post code records are always together, separated by a space. So we can also extract state information with regular expression toostored in table ptoassigneend_missus_final.
::SQL code are in:
E:/McNair/Projects/PatentAddress/RxPostcode.sql
::The extracted records are stored in table ptoassigneend_missus'''3.State (U.S.)'''
:'''3:The state and post code are always together, separated by a space. City (USo we can also extract state information with regular expression too.S.)'''
::The issue is that both abbreviation and state name exist.
::The extracted state records are stored in table ptoassigneend_missus_final.
::SQL code are in: E:/McNair/Projects/PatentAddress/RxPostcode.sql :'''4. City (U.S.)''' :'''5. Issues'''::* The ultimate post code and zip regex for other countries besides U.S. can be found here:
http://stackoverflow.com/questions/578406/what-is-the-ultimate-postal-code-and-zip-regex