I am building a site where we will need to be able to search PDF documents and 'maybe' Word documents/Rich Text docs. I have done this in the past with Java libraries using a different web platform. Are there any extensions available or maybe some technical documents/samples available out there?
Solved! Go to Solution.
You will have to look into Solr. The search engine will need to parse and index PDF documents, something the Magento search doesn't touch as it's only source for search material is a fulltext index it creates from fields contained within the database.
I did look into Solr a bit and it looks like Solr is the most viable solution. Thank you for validating.