Changes
Jump to navigation
Jump to search
no edit summary
The ParserSpliter.pl script will first split a large Patent Data XML file into smaller XML files, one for each patent data. And it will then parse and process each Patent Data XML file.
Some of the files are somehow mal-formatted, and will be moved to a ./failed_files directory If you add a character anywhere in these files, they somehow become fine to be processed by the script.
In order to use this script, you will need to have XML::Simple and Try::Tiny installed.