Hi All,
We have been experiencing a major issue with our SharePoint Portal Server
2003 content index crawls over the past 18 months.
Whenever a crawl (both full and incremental) is carried out, the size of the
index will increase exponentially and the crawl never completes as it slows
down to a grinding halt and may only index one document per every hour.
To cut a (very) long story short, we spent a huge amount of time trying to
identify the cause of the problem.
After many escalations to Microsoft and months of troubleshooting, our only
course of action was to build a new server and change some of the underlying
structure of how the documents were stored.
This wasn't a major issue as the server was old and in need of an upgrade.
After the server upgrade and migration everything appeared to work fine.
The content index crawls and size were perfect and it appeared our problem
had been resolved.
That was until I installed the Adobe PDF iFilter 6.0.
As soon as the PDF iFilter was installed, the same problems we had on the
old server appeared again.
The size of one index shot up from 500Mb to about 1.5GB and had only found
about 10 new documents within a 24 hour period.
As soon as I uninstalled the iFilter and ran another full update the size
went back down to 500Mb and took only 5 hours to complete.
My question is, has anybody experienced the same issue and can it be
resolved?
I have tried installing the full version of Acrobat Writer on both front end
and backend servers, which didn't make a difference.
I believe that the problem is more related to the actual PDF files than the
iFilter, as I can't seem to find much information about this problem.
Some of the PDF files are quite old and large and contain a lot of images.
I would appreciate any help/suggestions you could give me.
Kind regards,
Nick