Valid from Datafari v6.0
The CSV connector lets you index each lines of a CSV file as a Solr document.
Create the repository connector
Go the MCF Admin Page and in the “Main Navigation”, choose “List Repository Connections” and add a new connection:
Choose a name then select CSV in “Type“ tab:
And that's it, your Repository connector is created with the appropriate default values:
Create the CSV Job with this minimal configuration:
In the connection Pipeline, you don't have to add a Tika connector, as a csv file is a simple texte format.
For the CSV file paths parameter, it is necessary to specify the file names to be used. Use SAMBA, for example, to access remote files.
Use Solr fields in your CSV file to add metadata. Column names that do not correspond to any Solr field will not be used.