Data and tools to compile word frequencies, trigrams and more for use with NLP, spelling correction etc.