Difference between revisions of "Listing Page Plugin Spec"
Line 4: | Line 4: | ||
|Has project status=Active | |Has project status=Active | ||
}} | }} | ||
− | '' | + | |
+ | ==Plugin Overview== | ||
+ | Faced with the problem of no standardization across incubator and accelerator websites, there is a design feasibility question concerning automating the extraction of information. A browser plugin with user guidance could serve as a fundamental first step towards total mechanization of the process. See [[LP_Extractor_Protocol]] for a comprehensive introduction to potential methods. | ||
+ | |||
+ | The focus of this design is to create a tool which allows for the quick identification of HTML markings on a webpage and subsequent reduction to a DSL for useful data extraction. Multiple options will be considered, including allowing the user to visually 'draw' a grid, either via dragging or marking vertices, and mouse-over. Attention will be given to potentially viable technical resources as well as usability. | ||
+ | |||
+ | Current List of sites to examine: | ||
+ | [[Media:File:Accelerator List.xlsx]] | ||
+ | |||
+ | |||
+ | '''Sample Webpage:''' | ||
+ | [[File:Kobeexampless.PNG|thumb|center|upright=3|Image taken from 500kobe.com]] | ||
+ | |||
+ | |||
+ | ==Technical Specifications== | ||
+ | |||
+ | ===User Input Styles=== | ||
+ | * Drag + Drop | ||
+ | * Marking Vertices | ||
+ | * Mouse-Over | ||
+ | |||
+ | ===Browser Considerations=== | ||
+ | |||
+ | ===Language Considerations=== | ||
+ | |||
+ | ===Current Problems=== | ||
+ | * "Infinite Scroll" webpages: Potentially impossible to account for incubator websites which display company lists in an infinite scroll style. Would require multiple instances of user input. |
Revision as of 09:50, 29 March 2019
Listing Page Plugin Spec | |
---|---|
Project Information | |
Has title | Listing Page Plugin Spec |
Has owner | Rex Bone |
Has start date | |
Has deadline date | |
Has project status | Active |
Copyright © 2019 edegan.com. All Rights Reserved. |
Contents
Plugin Overview
Faced with the problem of no standardization across incubator and accelerator websites, there is a design feasibility question concerning automating the extraction of information. A browser plugin with user guidance could serve as a fundamental first step towards total mechanization of the process. See LP_Extractor_Protocol for a comprehensive introduction to potential methods.
The focus of this design is to create a tool which allows for the quick identification of HTML markings on a webpage and subsequent reduction to a DSL for useful data extraction. Multiple options will be considered, including allowing the user to visually 'draw' a grid, either via dragging or marking vertices, and mouse-over. Attention will be given to potentially viable technical resources as well as usability.
Current List of sites to examine: Media:File:Accelerator List.xlsx
Sample Webpage:
Technical Specifications
User Input Styles
- Drag + Drop
- Marking Vertices
- Mouse-Over
Browser Considerations
Language Considerations
Current Problems
- "Infinite Scroll" webpages: Potentially impossible to account for incubator websites which display company lists in an infinite scroll style. Would require multiple instances of user input.