Spaces
Teams
Apps
Templates
Create
Datafari Documentation
All content
Space settings
Content
Results will update as you type.
Download
•
Foreword
•
Introduction
•
Release notes - Community Edition
•
Architecture of Datafari
•
Quick Start Guide
•
Differences between the Community and Enterprise Editions of Datafari
Requirements
Installation
System configuration
Datafari Configuration
Datafari Roles and Users
Crawling connectors - Apache ManifoldCF
•
Concepts of our crawling framework
Indexing pipeline (available transfo connectors)
•
Content Limiter transformation connector Configuration
Data Extraction Server (Tika) Configuration
•
Data Extraction Server Configuration
•
Tika Server - Easy creation & configuration
•
Doc Filter Connector
•
Metadata Cleaner Connector
•
Emptier Connector
•
Regex Entity Connector
Spacy Named Entity Recognition (NER)
Optical Character Recognition (OCR) Configuration
[DEPRECATED] Indexing pipeline
DB connector
Web connector
•
MCF Simplified UI configuration
•
Crawl jobs configuration best practices
•
Force a MCF job to reindex all documents without deleting them from the current index
•
Simple history retention time
•
CSV Connector
•
Alfresco connector configuration
•
Atomic Update Management
•
Force a MCF job to reindex all documents AND deleting them from the current index
•
Apache ManifoldCF Connectors documentation
[DEPRECATED] Local filesystem connector
Analytics
•
Favorites Configuration
•
Highlighting configuration
•
Opening Files with Browsers Configuration
•
Promolinks Configuration
•
Search index Configuration
Search Relevancy
•
SearchAggregator Configuration
Security Configuration
•
Audit and Privacy
•
Help and Privacy policy pages
•
Datafari Help Page administration
Semantic Configuration
Solr
User Interface Configuration and Customization
•
New Language Configuration
•
Hierarchical facet configuration
•
Detect duplicates configuration
•
OpenSearch
•
Alerts management - Mail Configuration
•
MCF Backup and Restore Configuration
•
System Configuration Manager (Zookeeper)
•
Adding entity autocomplete: from indexation to autocomplete
•
Add Yellow pages feature into Datafari
•
Add Direct Links feature into Datafari
[DEPRECATED] Datafari Configuration
Searching with Datafari
Exploitation
Development
Misc
Use Cases
•
Older versions of this guide
Datafari Documentation
/
/
Indexing pipeline (available transfo connectors)
/
Data Extraction Server (Tika) Configuration
Summarize
Data Extraction Server (Tika) Configuration
Cedric
Julien Massiera
Owned by
Cedric
Last updated:
13 Mar, 2024
by
Cedric
Version comment
1 min read
Loading data...
This documentations will help you to configure a Tika Server for various cases
{"serverDuration": 19, "requestCorrelationId": "59ab5dc8397745ba8a5b926f648b5981"}