/
Crawling connectors - Apache ManifoldCF

Crawling connectors - Apache ManifoldCF

The crawling phase focuses on contacting the remote data repositories, retrieving the data, pre-processing it, and handing it over to the indexing engine. Using Apache ManifoldCF, it handles full or partial data retrieval, can connect to many different types of data repositories, and handles the security phase of indexing and searching. The full documentation for the crawl administrator is naturally located on the apache manifoldCF website, the updated pointer should be  more precisely on the ManifoldCF documentation info page.

 

The connectors that are currently supported into Datafari are (valid from Datafari 6)

  • Databases :

    • Oracle

    • Postgresql

    • MySQL

    • MariaDB

    • Microsoft SQL server

    • Sybase

  • Web

  • CSV

  • Drupal

  • Alfresco

  • Solr

  • SMB / Fileshare

  • Office 365

  • Sharepoint 2010-2019

  • Sharepoint Online

  • Tuleap

  • XWiki

  • Jamespot

  • Typo3

  • Jira

  • Confluence

  • Google drive

  • Dropbox

  • Email

Related content