Browse State-of-the-Art
Datasets
Methods
More
Newsletter
RC2022
About
Trends
Portals
Libraries
Sign In
Subscribe to the PwC Newsletter
×
Stay informed on the latest trending ML papers with code, research developments, libraries, methods, and datasets.
Read previous issues
Join the community
×
You need to
log in
to edit.
You can
create a new account
if you don't have one.
Edit Category
×
Description with markdown (optional):
Image
Subword Segmentation
Edit
Natural Language Processing
• 4 methods
Methods
Add a Method
Method
Year
Papers
BPE
Neural Machine Translation of Rare Words with Subword Units
2015
12910
WordPiece
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
2016
5356
Gradient-Based Subword Tokenization
Charformer: Fast Character Transformers via Gradient-based Subword Tokenization
2021
5
Unigram Segmentation
Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates
2018
1