==NLTK==
NLTK is the Natural Language Toolkit
*NLTK Information
**Need to convert text to ascii. Had issues with my PDF texts and had to convert
**Can use sent_tokenize() function to split document into sentences, easier that regular expressions
**Use pos_tag() to tag the sentences. This can be used to extract proper noun