Crawling connectors - Apache ManifoldCF
The crawling phase focuses on contacting the remote data repositories, retrieving the data, pre-processing it, and handing it over to the indexing engine. Using Apache ManifoldCF, it handles full or partial data retrieval, can connect to many different types of data repositories, and handles the security phase of indexing and searching. The full documentation for the crawl administrator is naturally located on the apache manifoldCF website, the updated pointer should be more precisely on the ManifoldCF documentation info page.