Language-independent text tokenization using unsupervised deep learning

HIGHLIGHTS

  • who: Eatedal Alabdulkreem from the Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, POBox, Riyadh, Saudi Arabia have published the research work: Language-Independent Text Tokenization Using Unsupervised Deep Learning, in the Journal: (JOURNAL)
  • what: This research proposes a cross-language text tokenization model using a Transformer technique. The authors compare the model with other related models in Section 4.4.
  • how: The authors compared the proposed Sub-Word Byte-Pair tokenization technique (SBPT) to the state-of-the-art models ADAN and ArabADAN .
 

Logo ScioWire Beta black

If you want to have access to all the content you need to log in!

Thanks :)

If you don't have an account, you can create one here.

 

Scroll to Top

Add A Knowledge Base Question !

+ = Verify Human or Spambot ?