Collected molecules will appear here. Add from search or explore.
Implementation of AWD-LSTM (Average-SGD Weight-Dropped LSTM), introducing DropConnect for recurrent weights and non-monotonically triggered averaged SGD for sequence modeling.
citations
0
co_authors
3
While the original paper (AWD-LSTM) was a major milestone in RNN research, this specific repository lacks community traction (0 stars). Furthermore, LSTMs have been almost entirely replaced by Transformers and more recently State Space Models (SSMs) in frontier-lab research. The techniques (Weight-dropping, NT-AvSGD) are now standard knowledge and easily reproducible, offering no modern competitive moat.
TECH STACK
INTEGRATION
reference_implementation
READINESS