Data set exports
Introduction
Datasets from DataBlocks can be exported to the various Connectors (remote servers or database) of the selected Project or Public Projects, or downloaded to your workstation (via the Local Connector).
Export can be performed
-
from a DataBlock, from the left-hand menu
-
from the list of DataBlocks by clicking on
in the Actions column of each line
In all cases, it is possible to load a pre-configured export template.
Options available depend on the Connector and output format selected.
Export data from a DataBlock
Available export format and options
Different types of exports are available depending on the type of Connectors chosen
Available export formats
Connector type | CSV | XML | JSON | PARQUET | EXCEL | DATA WRITING |
---|---|---|---|---|---|---|
Local (on your workstation) |
||||||
Server (HDFS, S3 and SFTP) |
||||||
BDD |
Example: Setting up a CSV export using a Local connector from an export template
-
Template: Search for a export template to load to perform this export.
You can view and select only the templates linked to connectors to which you have access.
-
Connector : Select the connector to be used for exporting. The connectors available correspond to those created and configured in the Connectors module of GenericsData.
The Connectors Local present in DataChain allows data to be exported locally.
Most of the connectors set up in GenericsData’s connectors function can be used in the export function.
The list of available connectors takes user rights into account: only connectors authorised for the current user are visible.
-
Format : Defines the export format
_This option is only available for Connectors authorizing a file_type export (Csv, Parquet, Json, XML, Excel).
Once you have made your settings, click on "Export".
The duration of the export varies according to the parameters and the volume of data. |
Export settings vary depending on the output format of the file selected.
Exporting with the Local DataChain Connector
The Local DataChain connector can be used to export files. The Local Connector does not allow you to specify a path: once the file has been generated, you can download the file: it will be stored according to your browser settings.
Export settings vary depending on the output format of the file selected.
Fields | Detail | CSV | XML | JSON | PARQUET | EXCEL |
---|---|---|---|---|---|---|
Keep original labels |
Check to retain DataBlock column labels |
|||||
Compress |
Check to compress the file (ZIP) to reduce the final volume |
|||||
Encoding |
Format in which Unicode characters are encoded |
|||||
Separator |
Character used to separate different values |
|||||
string identifier (1) |
character added around a string value if the character used as a separator is present in the value |
|||||
On all strings |
If ticked, the identifier is added to all character string values. |
|||||
Escape character (1) |
Character added in front of characters identical to the identifier when contained in values |
|||||
Keep blanks left / right |
Check to preserve spaces around values (by default only spaces between values in the same cell are preserved) |
|||||
Export headers |
Select to create a header line |
|||||
Number of partitions |
Number of files to generate (preferable for large volume exports) |
|||||
root and line tags |
Text to be used in tags (default ROWS and ROW) |
|||||
Cell coordinates |
Target cell from which to start writing data |
|||||
File name |
Value used as file name (UID or specified value) |
(1) Example of using the escape character on the identifier
-
Input value: This is an example
-
Output value: "This is an example".
Export to a remote server
Remote server connectors are used to export files.
S3 connectors automatically delete and replace data.
Export settings vary depending on the output format of the file selected.
Fields | Detai | CSV | XML | JSON | PARQUET | EXCEL |
---|---|---|---|---|---|---|
Path |
Location of data on the server (use the magnifying glass at the end of the line to explore and select the folder). |
|||||
Keep original labels |
Check to retain DataBlock column labels |
|||||
Compress |
Check to compress the file (ZIP) to reduce the final volume |
|||||
Encoding |
Format in which Unicode characters are encoded |
|||||
Separator |
Character used to separate different values |
|||||
string identifier (1) |
character added around a string value if the character used as a separator is present in the value |
|||||
On all strings |
If ticked, the identifier is added to all character string values. |
|||||
Escape character (1) |
Character added in front of characters identical to the identifier when contained in values |
|||||
Keep blanks left / right |
Check to preserve spaces around values (by default only spaces between values in the same cell are preserved) |
|||||
Export headers |
Select to create a header line |
|||||
Number of partitions |
Number of files to generate (preferable for large volume exports) |
|||||
root and line tags |
Text to be used in tags (default ROWS and ROW) |
|||||
Cell coordinates |
Target cell from which to start writing data |
|||||
File name |
Value used as file name (UID or specified value) |
Export to a database
Database connectors (SQL, NoSQL or BaseGraph) can be used to export data writings.
BaseGraph connectors (NEO4J) do not allow schema specification.
Export parameters vary according to the type of database and the write mode selected.
Parameters for Database Connectors (SQL, NoSQL or BaseGraph)
Fields | Detail | SQL | NOSQL | BaseGraph (NEO4J) |
---|---|---|---|---|
Keep original labels (2) |
Check to keep the DataBlock column labels. Uncheck to use standard column codes. |
|||
Schema |
Schema label if applicable |
|||
Target |
Target label (UID or specified value) |
|||
Node |
Node label (only for the "Overwrite" option) |
|||
Write mode |
Specifies export behaviour in the database (Add / Replace) |
(2) Caution, if this option is chosen when exporting to databases, some special characters may cause an error when exporting.
Quick Reference
Performing an export
Steps | Objective | Stock | Landmarks |
---|---|---|---|
1 |
Access to GenericsData module |
Click on icon in left menu of DataChain |
|
2 |
Access to the list of DataBlocks |
Click on icon DataBlocks |
|
4 |
Finding DataBlock |
Using available search functions |
|
5 |
Access to the Export function |
Click on the icon |
|
6 |
Export Popup |
Entering Export parameters |
|
7 |
Carrying out the export |
Export button |