Difference between revisions of "Matching VentureOne (Data)"
Jump to navigation
Jump to search
Line 20: | Line 20: | ||
*Get the patent data | *Get the patent data | ||
**Draw the distinct assignees | **Draw the distinct assignees | ||
− | :<code>Z:\allpatentsprocessed\DistinctAssignees2.txt </code> | + | :<code>Z:\allpatentsprocessed\DistinctAssignees2.txt </code><br> |
**Match them against themselves | **Match them against themselves | ||
:<code>Z:\allpatentsprocessed\DistinctAssignees2matched.txt </code> | :<code>Z:\allpatentsprocessed\DistinctAssignees2matched.txt </code> |
Revision as of 14:30, 15 June 2016
Matching VentureOne (Data) | |
---|---|
Project Information | |
Project Title | |
Start Date | |
Deadline | |
Primary Billing | |
Notes | |
Has project status | |
Copyright © 2016 edegan.com. All Rights Reserved. |
Data
- Get the source file for the VentureOne data
E:\McNair\Projects\Venture One Data\Venture Data 1.xlsx
Original data source
- Clean it up
E:\McNair\Software\Scripts\Matcher\Input\Venture Data 1.txt
extraneous symbols and words removed
- Match it against itself
E:\McNair\Projects\Venture One Data\Cleaned and Matched Data.xlsx
- Get the patent data
- Draw the distinct assignees
Z:\allpatentsprocessed\DistinctAssignees2.txt
- Match them against themselves
Z:\allpatentsprocessed\DistinctAssignees2matched.txt
- Match venture data to patent data
Z:\allpatentsprocessed\Venture Patent Matched.txt
- Join patent data to assignee data, creating firstjoin_cleaned
- Join firstjoin_cleaned data to matchassignee data, creating secondjoin_cleaned
- Join secondjoin_cleaned data to venturepatentmatched data, creating fourthjoin_cleaned