Difference between revisions of "Listing Page Plugin Spec"
Line 11: | Line 11: | ||
Current List of sites to examine: | Current List of sites to examine: | ||
− | [[ | + | [[File:Accelerator List.xlsx | Accelerator List ]] |
+ | |||
+ | (E:\projects\Kauffman Incubator Project\02 Identify the client listing page\Listing Page Classifier) | ||
Revision as of 09:53, 29 March 2019
Listing Page Plugin Spec | |
---|---|
Project Information | |
Has title | Listing Page Plugin Spec |
Has owner | Rex Bone |
Has start date | |
Has deadline date | |
Has project status | Active |
Copyright © 2019 edegan.com. All Rights Reserved. |
Contents
Plugin Overview
Faced with the problem of no standardization across incubator and accelerator websites, there is a design feasibility question concerning automating the extraction of information. A browser plugin with user guidance could serve as a fundamental first step towards total mechanization of the process. See LP_Extractor_Protocol for a comprehensive introduction to potential methods.
The focus of this design is to create a tool which allows for the quick identification of HTML markings on a webpage and subsequent reduction to a DSL for useful data extraction. Multiple options will be considered, including allowing the user to visually 'draw' a grid, either via dragging or marking vertices, and mouse-over. Attention will be given to potentially viable technical resources as well as usability.
Current List of sites to examine: Accelerator List
(E:\projects\Kauffman Incubator Project\02 Identify the client listing page\Listing Page Classifier)
Sample Webpage:
Technical Specifications
User Input Styles
- Drag + Drop
- Marking Vertices
- Mouse-Over
Browser Considerations
- Firefox
- Chrome
- Version Control
Language Considerations
Current Problems
- "Infinite Scroll" webpages: Potentially impossible to account for incubator websites which display company lists in an infinite scroll style. Would require multiple instances of user input.