Forum Discussion
searchable pdfs not searchable within sharepoint document library
HiYou speak about sharepoint online?
just to be sure, when you are on a library, in the searchbox, could you type
content:<a word present in your pdf> filetype:pdf
It will specifically look for the content in all the pdf file. If you see result, your pdf are well searchable and the issue is somewhere else.
Are you using information right management? I think that in that case, the content is not searchable YET ( something in the roadmap).
- alex_k60Oct 03, 2021Copper Contributor
Thanks for answering. Using SharePoint online. We're not using information right management. What else could it be? I've even saved a word document as pdf and still cant search the contents.
Thanks
- Vertebre85Oct 03, 2021Iron Contributor
Could you go on the main page of sharepoint online (https:/:<yourdomain>.sharepoint.com and just try the search with the keyword "content:..." to see if it's a domain issue or a specific site issue?
If it has never work, i would advise to reach the microsoft support. On Server, it's often due to issue with the search crawl.
if you have the PNP powershell, you can look at the crawl log https://www.sharepointdiary.com/2019/07/get-search-crawl-log-in-sharepoint-online-using-powershell.html
It's officialy not possible to start/restart a manual crawl in sharepoint online. I've seen some "hack" but never tested it.
- alex_k60Oct 04, 2021Copper Contributor
Done the above and seen a few items in the log file. Nothing for the document library in-particular at https://companyname.sharepoint.com/Finance/company1_invoices
However, there is the entry below , will that be the scan for the whole of finance or is that only scanning the root directory not the document library "finance/company1_invoices"
"Url : https://companyname.sharepoint.com/Finance
CrawlTime : 03/10/2021 15:02:38
ItemTime : 01/01/0001 00:00:00
LogLevel : Success
Status :
ItemId : 11257
ContentSourceId : 1"