Data set exports

Exports

Introduction

Datasets from DataBlocks can be exported to the various Connectors (remote servers or database) of the selected Project or Public Projects, or downloaded to your workstation (via the Local Connector).

Export can be performed

  • from a DataBlock, from the left-hand menu

  • from the list of DataBlocks by clicking on upload.svg in the Actions column of each line

In all cases, it is possible to load a pre-configured export template.

Options available depend on the Connector and output format selected.

Export data from a DataBlock

Available export format and options

Different types of exports are available depending on the type of Connectors chosen

Available export formats

Connector type CSV XML JSON PARQUET EXCEL DATA WRITING

Local (on your workstation)

YES

YES

YES

YES

YES

Server (HDFS, S3 and SFTP)

YES

YES

YES

YES

YES

BDD

YES

Example: Setting up a CSV export using a Local connector from an export template

CSV export type from a template

  • Template: Search for a export template to load to perform this export.
    Information You can view and select only the templates linked to connectors to which you have access.

  • Connector : Select the connector to be used for exporting. The connectors available correspond to those created and configured in the Connectors module of GenericsData.

Information The Connectors Local present in DataChain allows data to be exported locally.
Most of the connectors set up in GenericsData’s connectors function can be used in the export function.
The list of available connectors takes user rights into account: only connectors authorised for the current user are visible.

  • Format : Defines the export format

Information _This option is only available for Connectors authorizing a file_type export (Csv, Parquet, Json, XML, Excel).

Once you have made your settings, click on "Export".

The duration of the export varies according to the parameters and the volume of data.

Export settings vary depending on the output format of the file selected.

Exporting with the Local DataChain Connector

The Local DataChain connector can be used to export files. The Local Connector does not allow you to specify a path: once the file has been generated, you can download the file: it will be stored according to your browser settings.

Export settings vary depending on the output format of the file selected.

DataBlocks exports 1. Export options in Local mode
Fields Detail CSV XML JSON PARQUET EXCEL

Keep original labels

Check to retain DataBlock column labels

YES

YES

YES

YES

YES

Compress

Check to compress the file (ZIP) to reduce the final volume

YES

YES

YES

YES

YES

Encoding

Format in which Unicode characters are encoded

YES

Separator

Character used to separate different values

YES

string identifier (1)

character added around a string value if the character used as a separator is present in the value

YES

On all strings

If ticked, the identifier is added to all character string values.

YES

Escape character (1)

Character added in front of characters identical to the identifier when contained in values

YES

Keep blanks left / right

Check to preserve spaces around values (by default only spaces between values in the same cell are preserved)

YES

Export headers

Select to create a header line

YES

YES

Number of partitions

Number of files to generate (preferable for large volume exports)

YES

YES

YES

YES

YES

root and line tags

Text to be used in tags (default ROWS and ROW)

YES

Cell coordinates

Target cell from which to start writing data

YES

File name

Value used as file name (UID or specified value)

YES

YES

YES

YES

YES

(1) Example of using the escape character on the identifier

  • Input value: This is an example

  • Output value: "This is an example".

Export to a remote server

Remote server connectors are used to export files.
S3 connectors automatically delete and replace data.

Export settings vary depending on the output format of the file selected.

DataBlocks exports 2. Options for exporting to a remote server
Fields Detai CSV XML JSON PARQUET EXCEL

Path

Location of data on the server (use the magnifying glass at the end of the line to explore and select the folder).

YES

YES

YES

YES

YES

Keep original labels

Check to retain DataBlock column labels

YES

YES

YES

YES

YES

Compress

Check to compress the file (ZIP) to reduce the final volume

YES

YES

YES

YES

YES

Encoding

Format in which Unicode characters are encoded

YES

Separator

Character used to separate different values

YES

string identifier (1)

character added around a string value if the character used as a separator is present in the value

YES

On all strings

If ticked, the identifier is added to all character string values.

YES

Escape character (1)

Character added in front of characters identical to the identifier when contained in values

YES

Keep blanks left / right

Check to preserve spaces around values (by default only spaces between values in the same cell are preserved)

YES

Export headers

Select to create a header line

YES

YES

Number of partitions

Number of files to generate (preferable for large volume exports)

YES

YES

YES

YES

YES

root and line tags

Text to be used in tags (default ROWS and ROW)

YES

Cell coordinates

Target cell from which to start writing data

YES

File name

Value used as file name (UID or specified value)

YES

YES

YES

YES

YES

Export to a database

Database connectors (SQL, NoSQL or BaseGraph) can be used to export data writings.
BaseGraph connectors (NEO4J) do not allow schema specification.

Export parameters vary according to the type of database and the write mode selected.

Parameters for Database Connectors (SQL, NoSQL or BaseGraph)

Fields Detail SQL NOSQL BaseGraph (NEO4J)

Keep original labels (2)

Check to keep the DataBlock column labels. Uncheck to use standard column codes.

YES

YES

YES

Schema

Schema label if applicable

YES

Target

Target label (UID or specified value)

YES

YES

YES

Node

Node label (only for the "Overwrite" option)

YES

Write mode

Specifies export behaviour in the database (Add / Replace)

YES

YES

YES

(2) Caution Caution, if this option is chosen when exporting to databases, some special characters may cause an error when exporting.

Quick Reference

Performing an export pie.svg

Steps Objective Stock Landmarks

1

Access to GenericsData module Logo Generics Data

Click on icon in left menu of DataChain

Logo Generics Data

2

Access to the list of DataBlocks

Click on icon DataBlocks

genericsData-color.svg

4

Finding DataBlock

Using available search functions

5

Access to the Export function

Click on the icon

upload.svg

6

Export Popup

Entering Export parameters

7

Carrying out the export

Export button