Literary and Linguistic Computing 2007 22(2):151-165 Processing Internet-derived Text--Creating a Corpus of Usenet Messages Sebastian Hoffmann http://llc.oxfordjournals.org/cgi/content/abstract/22/2/151 Suzana