Difference between revisions of "Listing Page Plugin Spec"

From edegan.com
Jump to navigation Jump to search
Line 4: Line 4:
 
|Has project status=Active
 
|Has project status=Active
 
}}
 
}}
''in progress''
+
 
 +
==Plugin Overview==
 +
Faced with the problem of no standardization across incubator and accelerator websites, there is a design feasibility question concerning automating the extraction of information. A browser plugin with user guidance could serve as a fundamental first step towards total mechanization of the process. See [[LP_Extractor_Protocol]] for a comprehensive introduction to potential methods.
 +
 
 +
The focus of this design is to create a tool which allows for the quick identification of HTML markings on a webpage and subsequent reduction to a DSL for useful data extraction. Multiple options will be considered, including allowing the user to visually 'draw' a grid, either via dragging or marking vertices, and mouse-over. Attention will be given to potentially viable technical resources as well as usability.
 +
 
 +
Current List of sites to examine:
 +
[[Media:File:Accelerator List.xlsx]]
 +
 
 +
 
 +
'''Sample Webpage:'''
 +
[[File:Kobeexampless.PNG|thumb|center|upright=3|Image taken from 500kobe.com]]
 +
 
 +
 
 +
==Technical Specifications==
 +
 
 +
===User Input Styles===
 +
* Drag + Drop
 +
* Marking Vertices
 +
* Mouse-Over
 +
 
 +
===Browser Considerations===
 +
 
 +
===Language Considerations===
 +
 
 +
===Current Problems===
 +
* "Infinite Scroll" webpages: Potentially impossible to account for incubator websites which display company lists in an infinite scroll style. Would require multiple instances of user input.

Revision as of 09:50, 29 March 2019


Project
Listing Page Plugin Spec
Project logo 02.png
Project Information
Has title Listing Page Plugin Spec
Has owner Rex Bone
Has start date
Has deadline date
Has project status Active
Copyright © 2019 edegan.com. All Rights Reserved.


Plugin Overview

Faced with the problem of no standardization across incubator and accelerator websites, there is a design feasibility question concerning automating the extraction of information. A browser plugin with user guidance could serve as a fundamental first step towards total mechanization of the process. See LP_Extractor_Protocol for a comprehensive introduction to potential methods.

The focus of this design is to create a tool which allows for the quick identification of HTML markings on a webpage and subsequent reduction to a DSL for useful data extraction. Multiple options will be considered, including allowing the user to visually 'draw' a grid, either via dragging or marking vertices, and mouse-over. Attention will be given to potentially viable technical resources as well as usability.

Current List of sites to examine: Media:File:Accelerator List.xlsx


Sample Webpage:

Image taken from 500kobe.com


Technical Specifications

User Input Styles

  • Drag + Drop
  • Marking Vertices
  • Mouse-Over

Browser Considerations

Language Considerations

Current Problems

  • "Infinite Scroll" webpages: Potentially impossible to account for incubator websites which display company lists in an infinite scroll style. Would require multiple instances of user input.