Pre-requisites
- Have a JDK installed on your PC (Java 8)
- Have an IDE installed on your PC (in the guide we use Eclipse) and be able to launch it as root
- Have git installed on your PC
- Have maven and ant installed on your PC
Clone repository
Open a terminal and navigate to the folder where you want to have Datafari source code to be checked out (usually your workspace folder).
Perform a
git clone https://github.com/francelabs/datafari.git
to checkout the code. The root directory name is datafari.
Build Datafari for the development environment
Navigate to the folder datafari. We now call it DATAFARI_SRC.
Run a : 'mvn install' in DATAFARI_SRC.
Run a : 'ant all' in DATAFARI_SRC/debian7
Install Datafari
Run a 'dpkg -i DATAFARI_SRC/debian7/installer/dist/datafari.deb and type an admin password.
Open the project in Eclipse
In Eclipse,go to File -> Import... , type maven and « Existing Maven project ».
Select the DATAFARI_SRC folder as root directory and click Finish.
Change the permissions of the following folders :
chmod -R 777 /opt/datafari
chmod -R 700 /opt/datafari/pgsql
To add access rights to any user on datafari installation folder. Be careful : this should be done only for the development environment and should be avoided for a production deployment!
Setup the Tomcat server in Eclipse
Go to Window -> Preferences -> Server -> Runtime Environments -> Add…
and select Apache Tomcat V.9.0.
Select DATAFARI_SRC/tomcat-dev for the Tomcat Installation directory and Finish.
If Eclipse does not accept the Tomcat version, stop Eclipse, and copy the patch you can download here https://bugs.eclipse.org/bugs/attachment.cgi?id=262418&action=edit in the plugins directory of Eclipse installation directory.
In a Servers window (for example in Debug Perspectives), click on "add a server", select a "tomcat 9.0 server". Click on next and add Datafari webapp :
Click Finish.
Run Datafari
Run Datafari in normal mode. Go to /opt/datafari/bin and run ./start-datafari.sh.
Start Tomcat server with a right click → Start on the server:
Go to the Datafari URL (http://localhost:9080/Datafari)
Update Solr configuration :
If you change the solr configuration in DATAFARI_SRC/datafari-solr/solr_home, you have to run the script updateSolrConfig.sh in DATAFARI_SRC/datafari-solr to update the config in Solr Cloud.
Configure connectors
Run Datafari tomcat. Go to the Datafari URL (http://localhost:9080/Datafari). Login in administrator (admin/admin).
In the left menu, click on Connectors; the Apache ManifoldCF login page shows up. Enter admin/admin to login (click on Login button, as the Enter key may not work).
Click on « List Output Connections » of Output section, and add a new « Output Connection ».
In Name tab, set the name to « DatafariSolr ».
In Type tab, set the connection type to Solr and then click on Continue button.
In the Server tab, set port to « 8983 », set web application name to « solr » and « Core/Collection name » to FileShare.
You can now save:
Your development environment is now ready.
For more information about ManifoldCF configuration, please have a look here: Crawling