Changes
Jump to navigation
Jump to search
Extracting Features from Surnames (view source)
Revision as of 18:31, 10 July 2009
, 18:31, 10 July 2009no edit summary
An example command line is:
<tt> perl SurnameFeatures.pl -i=sourcefile.txt -ncol=0 -dcol=5 -sp=1 -gram=2 -minfq=1 -diag=0<\/tt>
Where <tt>sp=1<\/tt> forces the inclusion of spaces in the character set (which is otherwise a-z), as well as before and after the string, <tt>minfq<\/tt> sets to minimum global frequency of occurance of an n-gram for it to be included in the output, and <tt>diag=1<\/tt> produces an additional frequency of occurance diagnostic file.