0
kicks
Using Tika in .NET for extracting text out of documents
Tika is an open source Java-based tool for extracting information out of various different document formats. It can be used with Lucene for indexing and searching documents, among other things. In this post, Kevin Miller talks about how use Tika in .NET via the IKVM.net utility.