Pontifications The R textclean package looks to be an excellent way to clean text including removing HTML and whitespace Leave a comment on github