All Posts tagged as "data science"

R Package For Word Prediction

Wordpredictor is a R package for developing and evaluating n-gram models. It provides methods for data analysis, data cleaning, data sampling, converting text to tokens, generating n-gram transition probabilities, generating n-gram models and evaluating n-gram model performance.


R6 class for processing large text files

In this article I will describe a R6 class that I had developed for processing large text files.


Using N-gram models for Word Prediction

N-gram models have applications in many areas such as spell checking, biological sequence analysis, speech recognition, word prediction and more. In this article I will provide an overview of n-gram models.