--1128247
==Restructure Address Information(First Stage)==
Note: This section was worked on by [[Sonia Zhang]]
*''' Both state name and its abbreviation exist.'''
==Restructure Address Information (Second Stage)==
Note: This section was worked on by [[Sonia Zhang]]
The dbase is '''patent'''.
The table is '''ptoassigneend'''.
SQL code and other things are in:
E:/McNair/Projects/PatentAddress
IO files are on the dbase server in:
Z:/PatentAddress
====To do====
In no particular order:
*Remove city, state, zip, country from addrline1 & addrline2 to get clean addrlines.
*Maybe concatenate addrline1 and addrline to make addrline
*Identify clean data (e.g. City that is a city, zip that is a zip, state that is a state)
**By pattern, length, match to list
*Try some more patterns, perhaps with a slightly higher false positive rate, on the remaining uncleaned data
**Iterate!
==== ====