Quantcast
Viewing all articles
Browse latest Browse all 26374

SharePoint 2013 - PDF crawl issue while crawling content of a public site

Hi,

Scenario:

I have 3 content sources on my SharePoint search administration.

Content Source 1:

I have a SharePoint site with PDF as well as other type of content. I have created a content source for this sharepoint site. The content including the PDF files are crawled with this content source without any issue.

Content Source 2:

I have created another content source for a public site. This public site is not a SharePoint site. The crawl log shows below mentioned warning for all the PDF files. The other files are getting crawled without any issue.

"The filtering process could not load the item. This is possibly caused by an unrecognized item format or item corruption."

Just to make sure there is no issue with the file on the public site, I downloaded a file with such an issue, uploaded it on SharePoint site and crawled the content. It is crawled with the content source of the sharepoint site without any issue.

Content Source 3:

I have created third content source for a different public site. This public site is not a SharePoint site.The content including the PDF files are crawled with this content source without any issue.

So the crawling process does not crawl PDF contents from one public site while it crawls PDF content of the other public site.

Please let me know if anyone has encountered such an issue and has solution for it.

Thanks,

Hemil



Viewing all articles
Browse latest Browse all 26374

Trending Articles