Force a MCF job to reindex all documents AND deleting them from the current index

Valid from 3.0

The documentation below is valid from Datafari v3.0.0 upwards

 In this case, we assume that you indexed some data with a MCF job and you want to delete the indexed data before restarting the MCF job. If you intend to keep the current index, you should refer to https://datafari.atlassian.net/wiki/spaces/DATAFARI/pages/2826371073

In order to do it, the steps are :

  • Make sure that your job is in "Done" mode : 

Go to Jobs -> Status and Job management

  • Copy your existing job :

Go to Jobs -> List all Jobs

Then click on Copy next to the job name 

In the next screen, change the job name, here we add "2" at the end of the name then click on the Save button.

  • Go back to the List jobs menu

So click on Jobs -> List all jobs. 

Now we have the 2 jobs present on the List all jobs menu. We now can delete the original job.

  • Delete the original job 

Click on the Delete button front of the "old" job, so here Enron files. Confirm your choice on the popup.

  • Go to the Status and job management

Click on Jobs -> Status and Job management. You will see that the Enron files job status is cleaning up. It means that MCF gives the order to Solr to delete all the documents crawled per this job.

Wait a little and click periodically on the resfresh button until the cleaning process ends. At the end, only the new job is present.

  • Start the new job

Finally you can launch your new job. Just click on the start button next to it and the data will be crawled and indexed into Solr.