Changes
Jump to navigation
Jump to search
Extracting Features from Surnames (view source)
Revision as of 18:22, 20 July 2009
, 18:22, 20 July 2009no edit summary
<tt> perl SurnameFeatures.pl -i=SourceFile.txt -r=Culture-EganClassification.txt -rcol=6 -rkey=0 -rno=2 -ncol=0 -dcol=5 -rsup=1 -sp=1 -gram=2 -minfq=1 -diag=0 -two=1 </tt>
Where <tt>-rsup</tt> suppresses records that do not have reference lookups, and the <tt>-rkey</tt> and <tt>-rno</tt> specify the key and class number columns in the reference file (here Culture-EganClassification.txt). For simplicity we recommend that country names are standardized to the UN standard and then used as reference keys.