A sample training code lives in
E:\projects\embedding\Web_extractor_model\train_sample.py
In the '''utils.py''' file, there are a few hyperparameters to remember.
max_len: the length of each training point
step: the number of steps we want to move to generate the next training point
num_units: LSTM units, a safe choice is 128
len_unique_chars: total number of unique tokens in all training data