Difference between revisions of "Python Libraries"

From edegan.com
Jump to navigation Jump to search
 
(7 intermediate revisions by 3 users not shown)
Line 1: Line 1:
{{McNair Projects
+
{{Project
 +
|Has project output=
 +
|Has sponsor=McNair Center
 
|Has title=Python Libraries
 
|Has title=Python Libraries
 
|Has owner=Peter Jalbert, Harrison Brown, Christy Warden, Jeemin Sim,
 
|Has owner=Peter Jalbert, Harrison Brown, Christy Warden, Jeemin Sim,
Line 7: Line 9:
 
This page is dedicated to documenting all Python libraries, working or not. Please include a description of what the library is for, whether or not it is functional, and how to import and use it.
 
This page is dedicated to documenting all Python libraries, working or not. Please include a description of what the library is for, whether or not it is functional, and how to import and use it.
  
==NLP Libraries==
+
==Geocoding Libraries==
 +
 
 +
=NLP Libraries=
 +
==NLTK==
 +
NLTK is the Natural Language Toolkit
 +
*NLTK Information
 +
**Need to convert text to ascii. Had issues with my PDF texts and had to convert
 +
**Can use sent_tokenize() function to split document into sentences, easier that regular expressions
 +
**Use pos_tag() to tag the sentences. This can be used to extract proper noun
 +
**there are several packages that need to be downloaded, to do this:
 +
***open up python in the shell
 +
****run nltk.download()
 +
****download all packages

Latest revision as of 12:34, 21 September 2020


Project
Python Libraries
Project logo 02.png
Project Information
Has title Python Libraries
Has owner Peter Jalbert, Harrison Brown, Christy Warden, Jeemin Sim
Has start date
Has deadline date
Has keywords Python, Libraries
Has project status
Has sponsor McNair Center
Copyright © 2019 edegan.com. All Rights Reserved.

This page is dedicated to documenting all Python libraries, working or not. Please include a description of what the library is for, whether or not it is functional, and how to import and use it.

Geocoding Libraries

NLP Libraries

NLTK

NLTK is the Natural Language Toolkit

  • NLTK Information
    • Need to convert text to ascii. Had issues with my PDF texts and had to convert
    • Can use sent_tokenize() function to split document into sentences, easier that regular expressions
    • Use pos_tag() to tag the sentences. This can be used to extract proper noun
    • there are several packages that need to be downloaded, to do this:
      • open up python in the shell
        • run nltk.download()
        • download all packages