Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

The default job configuration of Datafari limits the amount of text that can be extracted from a file. It also discards compressed container files (zip, tar.gz, …) or large files from the pipeline for stability and to avoid risks of saturating the disks. You may want to search in the documentation how the following pages that list the limitations in place, and decide by yourself whether to remove those limitations or or modify them. You can also reach out to us through our users mailing list: https://groups.google.com/g/datafari?pli=1.github discussions page: Discussions · francelabs/datafari · GitHub

Doc Filter Connector

Content Limiter transformation connector Configuration

Emptier Connector

Metadata Cleaner Connector