data science

data science

R Package For Word Prediction

Wordpredictor is a R package for developing and evaluating n-gram models. It provides methods for data analysis, data cleaning, data sampling, converting text to tokens, generating n-gram transition probabilities, generating n-gram models and evaluating n-gram model performance.

Read
data science

R6 class for processing large text files

In this article I will describe a R6 class that I had developed for processing large text files.

Read
data science

Using N-gram models for Word Prediction

N-gram models have applications in many areas such as spell checking, biological sequence analysis, speech recognition, word prediction and more. In this article I will provide an overview of n-gram models.

Read