BytePair embeddings are a really cool idea. BytePair Embeddings can be seen as a lightweight variant of FastText. They need less memory because they are more selective in what subtokens they remember. This also makes them useful in certain scenarios because they can ignore subwords as well. They're also available in 275 languages!
If you want to see the Rasa NLU examples repo, go here:
[ Ссылка ]
If you want to see the Whatlies repo for these embeddings, go here:
[ Ссылка ]
If you want to see the BPEmb repo, go here:
[ Ссылка ]
![](https://i.ytimg.com/vi/-0IjF-7OB3s/maxresdefault.jpg)