[PDF][PDF] A succinct n-gram language model

T Watanabe, H Tsukada, H Isozaki�- Proceedings of the ACL�…, 2009 - aclanthology.org
N-gram is a key issue even for lossy approaches. Raj and Whittaker (2003) showed a general
Ngram language model … a succinct way to represent the N-gram language model structure …

N-gram language modeling using recurrent neural network estimation

C Chelba, M Norouzi, S Bengio�- arXiv preprint arXiv:1703.10724, 2017 - arxiv.org
… practical situations: the state in a n-gram LM can be succinctly represented with (n − 1) ∗
4 … of n-gram contexts can be processed in parallel. On the downside, the n-gram context …

Bugram: bug detection with n-gram language models

S Wang, D Chollak, D Movshovitz-Attias…�- Proceedings of the 31st�…, 2016 - dl.acm.org
… Bugram selects high-level tokens that represent the structure and context of the code
using a succinct semantic representation. Take the loop “for (int i=0; i<n; i++) { foo(i); }” as an …

Internet news headlines classification method based on the n-gram language model

X Liu, G Rujia, S Liufu�- 2012 International Conference on�…, 2012 - ieeexplore.ieee.org
… According to the characteristics of the internet news headlines, short and concise and less
… puts forward N-Gram language model, by constructing two Yuan classification model, to …

Federated learning of n-gram language models

M Chen, AT Suresh, R Mathews, A Wong…�- arXiv preprint arXiv�…, 2019 - arxiv.org
… The n-gram language models trained with federated … two languages, American English and
Brazilian Portuguese. This work demonstrates that high-quality n-gram language models can …

Transformers Can Represent -gram Language Models

A Svete, R Cotterell�- arXiv preprint arXiv:2404.14994, 2024 - arxiv.org
… LMs and $n$-gram LMs, a simple and historically relevant class of language models. We …
attention mechanisms can exactly represent any $n$-gram LM, giving us a concrete lower …

URL-based web page classification: With n-gram language models

TA Abdallah, B de La Iglesia�- …�Joint Conference, IC3K 2014, Rome, Italy�…, 2015 - Springer
… The URL is very concise, and may be composed of … We have introduced an n-gram language
model for this task as a … the n-gram language model and test their impact on the models

Using suffix arrays as language models: Scaling the n-gram

H Stehouwer, M van Zaanen�- The 22nd Benelux Conference on�…, 2010 - pure.mpg.de
… In this article, we propose the use of suffix arrays to implement n-gram language models with
practically unlimited size n. These unbounded n-grams are called ∞-grams. This approach …

[PDF][PDF] Subword language modeling with neural networks

T Mikolov, I Sutskever, A Deoras, HS Le…�- preprint (http://www�…, 2012 - datascienceassn.org
… network based language models can be order of magnitude smaller than compressed n-gram
models, … n-gram models by counting 3 bytes for each n-gram entry. Nonetheless, the 54M …

N-gram language models for massively parallel devices

N Bogoychev, A Lopez�- 54th Annual Meeting of the Association�…, 2016 - research.ed.ac.uk
… We observe that N-gram order affects throughput of the GPU language model much more
than the CPU one. This is likely due to effects of backoff queries, which are more optimized in …