E:\projects\listing page identifier\generate_dataset.py
'''''Generate and Label Image Data: ''''' feed <code>train.txt </code> and <code>text.txt that are generated by the generate_dataset tool </code> into Screenshot Tool to get our image data
*Results are split into two folders: train and test
** also separated into sub-folders: cohort and not_cohort[[File:autoName.png|250px]]