data science

R Package For Word Prediction

Wordpredictor is a R package for developing and evaluating n-gram models. It provides methods for data analysis, data cleaning, data sampling, converting text to tokens, generating n-gram transition probabilities, generating n-gram models and evaluating n-gram model performance.

Read
data science

R6 class for processing large text files

In this article I will describe a R6 class that I had developed for processing large text files.

Read
data science

Using N-gram models for Word Prediction

N-gram models have applications in many areas such as spell checking, biological sequence analysis, speech recognition, word prediction and more. In this article I will provide an overview of n-gram models.

Read

Expressing numbers as a sum of fives and sevens

Recursion is one method for proving statements mathematically. It is a powerful concept that has important applications in Computer Science. In this article I will describe how to use the technique of Recursion to prove a statement mathematically.

Read