Indexing Office and PDF Files With Sphinx and .NET

added by devenv_exe
2/7/2013 3:34:26 AM

0 Kicks, 121 Views

The Sphinx full-text search engine does not support indexing pdf, doc, xls, etc. files directly. This article describes an easy way to extract text from these document types to store it in a Sphinx full-text index.