Web collaboration consultant, public speaker and Microsoft Press Author. @resing on twitter

Whither Goest the SharePoint Gatherer?

image Is the Gatherer a part of SharePoint 2007 or a relic of the the past?

The SharePoint Log known as the ULS shows some errors regarding the Gatherer process. The documentation on SharePoint 2007 reveals no such beast to me. So that begs the question, what in the world is the Gatherer? And how is it related to search?

The answer appears to lie in some legacy code that goes at least as far back as SharePoint Portal Server 2001.

In my research, I compared the following two MSDN Articles:
1.    Content Crawling and Search Overview, in the SharePoint Portal Server 2001 SDK
2.    Enterprise Search Architecture, in the SharePoint Server 2007 SDK
The second appears to be a rewrite of the first, but notice the numerous references to the Gatherer in the 2001 docs have been removed in the 2007 docs.

Is it a matter of a component that has, not entirely, gone away? Or is it a matter of a change in how the technical writers decided to describe the crawl process?

It’s not gone completely and at least one SharePoint author didn’t get the memo on the change in describing search. This article, Searching in MOSS 2007, is excerpted from Chapter 8, "Advanced Configurations," of the book Beginning SharePoint 2007 Administration: Windows SharePoint Services 3.0 and Microsoft Office SharePoint Server 2007.

Further check out the following two excerpts from MSDN and guess which refers to 2001:

image

image

If you see the following error in your Event Viewer, I can only guess it happens during the crawl during a process similar to the one described in the 2001 docs. What do you think?

 

Event Source: Windows SharePoint Services 3 Search
Event Category: Gatherer
Event ID: 2436
Computer: SERVERNAME
Description:
The start address <sts3://*******/contentdbid={GUID}> cannot be crawled.
Context: Application ‘Search index file on the search server’, Catalog ‘Search’
Details:
Access is denied. Check that the Default Content Access Account has access to this content, or add a crawl rule to crawl this content. (0x80041205)

Leave a Reply