Crawling connectors - Apache ManifoldCF
The crawling phase focuses on contacting the remote data repositories, retrieving the data, pre-processing it, and handing it over to the indexing engine. Using Apache ManifoldCF, it handles full or partial data retrieval, can connect to many different types of data repositories, and handles the security phase of indexing and searching. The full documentation for the crawl administrator is naturally located on the apache manifoldCF website, the updated pointer should be more precisely on the ManifoldCF documentation info page.
The connectors that are currently supported into Datafari are (valid from Datafari 6)
Databases :
Oracle
Postgresql
MySQL
MariaDB
Microsoft SQL server
Sybase
Web
CSV
Drupal
Alfresco
Solr
SMB / Fileshare
Office 365
Sharepoint 2010-2019
Sharepoint Online
Tuleap
XWiki
Jamespot
Typo3
Jira
Confluence
Google drive
Dropbox
Email