/
Setup development environment with Eclipse (Linux) for Datafari 4.x

Setup development environment with Eclipse (Linux) for Datafari 4.x

Pre-requisites

  • Have a JDK installed on your PC (Java 8)

  • Have an IDE installed on your PC (in the guide we use Eclipse) and be able to launch it as root

  • Have git installed on your PC

  • Have maven and ant installed on your PC

Clone repository

Open a terminal and navigate to the folder where you want to have Datafari source code to be checked out (usually your workspace folder).

Perform a

git clone https://github.com/francelabs/datafari.git

to checkout the code. The root directory name is datafari.

Build Datafari for the development environment

Navigate to the folder datafari. We now call it DATAFARI_SRC.

Run a : 'mvn install' in DATAFARI_SRC.

Run a : 'ant all' in DATAFARI_SRC/debian7

Install Datafari

Get the dependencies :

apt-get install curl debconf unzip sudo libc6-dev jq lsof

Run a 'dpkg -i DATAFARI_SRC/debian7/installer/dist/datafari.deb and type an admin password.

Open the project in Eclipse

In Eclipse,go to File -> Import... , type maven and « Existing Maven project ».

Select the DATAFARI_SRC folder as root directory and click Finish. 

Change the permissions of the following folders :

chmod -R 777 /opt/datafari  chmod -R 700 /opt/datafari/pgsql

To add access rights to any user on datafari installation folder. Be careful : this should be done only for the development environment and should be avoided for a production deployment!

Setup the Tomcat server in Eclipse

Go to Window -> Preferences -> Server -> Runtime Environments -> Add…

and select Apache Tomcat V.9.0.

Select DATAFARI_SRC/tomcat-dev for the Tomcat Installation directory and Finish.

If Eclipse does not accept the Tomcat version, stop Eclipse, and copy the patch you can download here https://bugs.eclipse.org/bugs/attachment.cgi?id=262418&action=edit in the plugins directory of Eclipse installation directory.

In a Servers window (for example in Debug Perspectives), click on "add a server", select a "tomcat 9.0 server". Click on next and add Datafari webapp :

Click Finish.

Run Datafari

Run Datafari in normal mode. Go to /opt/datafari/bin and run ./start-datafari.sh.

Start Tomcat server with a right click → Start on the server:

Go to the Datafari URL (http://localhost:9080/Datafari)

Update Solr configuration :

If you change the solr configuration in DATAFARI_SRC/datafari-solr/solr_home, you have to run the script updateSolrConfig.sh in DATAFARI_SRC/datafari-solr to update the config in Solr Cloud.

Configure connectors

Run Datafari tomcat. Go to the Datafari URL (http://localhost:9080/Datafari). Login in administrator (admin/admin).

In the left menu, click on Connectors; the Apache ManifoldCF login page shows up. Enter admin/admin to login (click on Login button, as the Enter key may not work).

Click on « List Output Connections » of Output section, and add a new « Output Connection ».
In Name tab, set the name to « DatafariSolr ».
In Type tab, set the connection type to Solr and then click on Continue button.
In the Server tab, set port to « 8983 », set web application name to « solr » and « Core/Collection name » to FileShare.

You can now save:

Your development environment is now ready.

For more information about ManifoldCF configuration, please have a look here: Crawling

Note that if after some time, you do a git pull and the build is not working anymore, it could be due to the Maven dependencies. Delete your local Maven repository in this case : the folder is : /your-user/.m2/



Related content

Development environment with Docker
Development environment with Docker
More like this
Architecture - Community Edition
Architecture - Community Edition
Read with this
[DEPRECATED] Setup development environment with Eclipse (Linux) for Datafari 3.1
[DEPRECATED] Setup development environment with Eclipse (Linux) for Datafari 3.1
More like this
Extract Files Content to Raw Text Files Using Datafari
Extract Files Content to Raw Text Files Using Datafari
Read with this
Software requirements
Software requirements
More like this
Solr ingester crawler connector
Solr ingester crawler connector
Read with this