HIGHLIGHTS
- who: Eatedal Alabdulkreem from the Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, POBox, Riyadh, Saudi Arabia have published the research work: Language-Independent Text Tokenization Using Unsupervised Deep Learning, in the Journal: (JOURNAL)
- what: This research proposes a cross-language text tokenization model using a Transformer technique. The authors compare the model with other related models in Section 4.4.
- how: The authors compared the proposed Sub-Word Byte-Pair tokenization technique (SBPT) to the state-of-the-art models ADAN and ArabADAN .
If you want to have access to all the content you need to log in!
Thanks :)
If you don't have an account, you can create one here.