Changes

Extracting Features from Surnames (view source)

Revision as of 16:59, 16 July 2009

244 bytes added , 16:59, 16 July 2009

no edit summary

Where <tt>sp=1</tt> forces the inclusion of spaces in the character set (which is otherwise a-z), as well as before and after the string, <tt>minfq</tt> sets to minimum global frequency of occurance of an n-gram for it to be included in the output, and <tt>diag=1</tt> produces an additional frequency of occurance diagnostic file.

The script has several other useful options, including <tt>-two</tt> which generates two files, one of the index, the class (if specified through '-classnocol') and the gram variables, and another containing the index and all other variables.

Anonymous user

imported>Ed

Changes

Extracting Features from Surnames (view source)

Revision as of 16:59, 16 July 2009

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Sites

Sections

Organizations

Help

Tools