GitHub - clemens33/hmlstm: HMLSTM pytorch implementation and detailed analysis

HMLSTM

J. Chung, S. Ahn, and Y. Bengio, “Hierarchical multiscale recurrent neural networks,” arXiv preprint arXiv:1609.01704, 2016.

disclaimer

work was done during a seminar/project in bioinformatics+ai master program
I don't claim that my results/findings/implementations in context to the HMLSTM architecture are correct

documents - contains all my written work (seminar paper + presentation slides + results)
hmlstm - contains the implementation - if you just want to use it focus on "network.py" and "HMLSTMNetwork" - class.
lstm - can be ignored - was only used as a baseline for comparison
projects - contains the dataset - implementation was used/tested on a character modeling task (predict next character)
environment.yml - contains the conda env

The architecture is very interesting - if you want to learn about it focus on the seminar paper in the documents' folder - I spent quite a while on visualizations
It is basically a stacked LSTM which learns to mask out information when information is going from bottom to top stacked LSTMs.
This mask/boundary detector can be used for visualization (which boundaries were detected)
It uses a non-differentiable function (round/step function) which is basically approximated for the gradient calculation
My findings should that it detects boundaries - but most of the time those boundaries could not easily be interpreted (like end/beginning of words etc.)
I tried to create a metric based analysis - therefore I marked the expected boundaries in a text (e.g. start/end word etc.) and measured the differences of the detected boundaries - results were not very promising
Maybe in a different settings (non-textual) the architecture would be more beneficial - or my implementation was just wrong ;)

None - if you really happen to use some of the code/documents/visualization - its nice if you link the repo ;)

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
documents		documents
hmlstm		hmlstm
lstm		lstm
projects		projects
utils		utils
.gitignore		.gitignore
README.md		README.md
demo_ax.ipynb		demo_ax.ipynb
demo_ax.py		demo_ax.py
environment.yml		environment.yml
result_text2.ipynb		result_text2.ipynb
train.ipynb		train.ipynb
train.py		train.py
train_hmlstm.py		train_hmlstm.py
train_lstm.py		train_lstm.py
tune_hmlstm.py		tune_hmlstm.py
tune_hmlstm2.py		tune_hmlstm2.py