The Pointer Sentinel-LSTM mixture model is a type of recurrent neural network that combines the advantages of standard softmax classifiers with those of a pointer component for effective and efficient language modeling. Rather than relying on the RNN hidden state to decide when to use the pointer, the model allows the pointer component itself to decide when to use the softmax vocabulary through a sentinel.
Source: Pointer Sentinel Mixture ModelsPaper | Code | Results | Date | Stars |
---|
Component | Type |
|
---|---|---|
LSTM
|
Recurrent Neural Networks | |
Pointer Network
|
Sequence To Sequence Models | |
Softmax
|
Output Functions |