Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Info

Valid from Datafari 5.3 up to 5.5

This documentation is only valid for Datafari 5.3 upwardsup to 5.5

Note

Before starting to use Apache Zeppelin notebooks, be sure to read the section 3. Apache Zeppelin Notebooks trap

Datafari 5.3 shifted from the Open Distro stack to the Apache Zeppelin, as it is much less demanding in terms of resources consumption.

1. How does it works ?

Where OpenDistro required to index the analytics data of Datafari into Elasticsearch, we can now index them in dedicated Solr indexes:

...

Concerning the Kibana Dashboards, they have been migrated to Apache Zeppelin, by replacing the dashboards with notebooks.

2. How to access and use Apache Zeppelin in Datafari

Apache Zeppelin is automatically started/stopped when Datafari is started/stopped by default unless you have disabled it during the install phase by answering “no” to the question “Do you want to enable analytic stack (yes/no) [yes] ?” or if you have disabled it in the configuration file [DATAFARI_HOME]/tomcat/conf/datafari.properties by setting the parameter “AnalyticsActivation” to false.

...

Once you are connected to one of the Apache Zeppelin notebooks, you can navigate through the different notebooks available thanks to the “Notebook” header menu:

...

3. Apache Zeppelin Notebooks trap

Unlike Kibana, Apache Zeppelin does not automatically refresh the notebooks data. Please note that when a user accesses a notebook for the first time ever, no data will be displayed. To refresh the data of a notebook, you MUST do it manually by clicking on the “Run all paragraph” button which is located next to the notebook name at the top of the notebook:

...

You can also refresh one paragraph at a time (a visualization is called a paragraph in Apache Zeppelin) by clicking on the “Run this paragraph” button in the top right corner of the paragraph:

...

4. Filtering the data of a paragraph

Unlike Kibana, in a notebook, you are unable to simply filter data by clicking on a value. Instead, you will need to directly modify the query of the paragraph (a visualization is called a paragraph in Apache Zeppelin) that is displayed above it:

...