Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • Facets panel: by default, the following facets have been configured:
    • Last modifications: allows to filter based on time windows compared to the current date. Less than 1 month old documents, Less than 1 year old documents, Less than 5 years old documents.
    • Type: allows to filter based on document types (pdf, doc, docx, xls, html ...)
    • Source: allows to filter based on the source repository configured in Apache ManifoldCF
    • Language: based on Solr capability to autodetect languages, allows to filter based on the documents language.
  • Results list: by default, each result in this list is composed as follows:
    • Graphic icon symbolising the document type
    • Document file name in bold font / Title from the HTML header title in case of web page
    • Text snippet, 3 lines maximum, surrounding the query terms found in the document. These query terms are highlighted in bold fonts.
    • Document path in the source repository (or URL for web pages)

Facets have a standard way of working: if you click on a facet value, it will filter out all the results that don't satisfy the facet value condition. In the illustration below, selecting the pdf value in the Type facet only displays pdf documents in the result panel.

Image Added

Datafari also proposes the spellchecker functionnality of Apache Solr. In the illustration below, we enter the query term "comminuque" instead of "communique". The spellchecker automatically proposes a correct word. By default, Datafari does not do a search on this suggestion, but it can do it with slight modifications.

Image Added

In the search bar, you can type in several query terms, and you can put operators to fine tune your search, such as AND and OR. Check the Apache Solr reference documentation for the full list of operators.

Image Added

Right below the search bar, three buttons allow to modify the search behaviour:

  • All words: forces the search engine to return only the documents that contain ALL the terms present in the query term
  • At least one word: that's the standard behavior. The search engine will favor documents that contain all the terms, but it not all terms are present, it will display the documents that contain a subset of the query terms
  • Exact expression: forces the search engine to return only the documents that contain the exact expression in the query term. In the illustration below, only documents containing the exact expression "communique presse" will be return. So a document containing "presse communique" won't be returned.

Image Added