Forum Discussion
Wavel
Mar 10, 2021Copper Contributor
Word search through thousands of pdf's?
Is this the appropriate product to use if I want to create a word search index for thousands of pdf files, and then query it from my asp.net application?
Luis Cabrera-Cordon
Mar 10, 2021Former Employee
Wavel, I am not sure what your budget is, but here is an idea... use S1 (where the limit will be 4MB of text -- abut 1300 pages per document)... that will only cost $250 per month.
For the bigger documents, it may not be worth paying 4X (I imagine you probably have a few outliers that are bigger than 1300 pages). In that case, maybe just take the first 1000 pages of content or so. That may be the best bang for the buck given your need...
Wavel
Mar 10, 2021Copper Contributor
You are correct, there are only a few outliers, however, I have to index the entire document. Can't miss any when our subscribers do a search.
My suggestion is to rethink the pricing structure. Base it on the total number of bytes being indexed, not the individual document size. Indexing 100 5MB files shouldn't cost so much more than 5000 2k files. (or whatever math makes my argument work 😉
My suggestion is to rethink the pricing structure. Base it on the total number of bytes being indexed, not the individual document size. Indexing 100 5MB files shouldn't cost so much more than 5000 2k files. (or whatever math makes my argument work 😉