Got a lot of digitized text? Not sure what to do with it? Try text mining!
I’d like to hold a Text Mining BootCamp for those interested in using computers to extract information from raw text. At one level above messy OCR, I will first introduce the teriminology and possibilities – when we talk about the “information in text” what do we mean? What kinds of things has computational linguistics made it possible to extract from words, sentences, and document collections? To make it concrete, I will work with example scholarly questions from real humanists, and show how to they are translated into computational terms.
Then the tools: I will introduce and demonstrate the text mining toolkits accessible to scholars with no programming experience, and touch upon other tools, suitable for more experienced programmers.test Filed under BootCamp | Comments (4)