Valid from Datafari v6.0
The CSV connector lets you index each lines of a CSV file as a Solr document.
Create the repository connector
Go the MCF Admin Page and in the “Main Navigation”, choose “List Repository Connections” and add a new connection:
Choose a name then select CSV in “Type“ tab:
And that's it, your Repository connector is created with the appropriate default values:
Create the CSV Job with this minimal configuration:
In the connection Pipeline, you do not need to add a Tika connector, since a csv file is a simple texte format.
For the CSV file paths parameter, it is necessary to specify the file names to be used. The syntax required is that of local files, so if you need to access remote files, you can use SAMBA, for example, to mount a directory containing your CSV files.
Use Solr fields in your CSV file to add metadata. Column names that do not correspond to any Solr field will not be used.