Datasets
Overview#
The Datasets functionality in Nextflow Tower allows the users to store CSV and TSV formatted files within a workspace and use those as an input one or more pipelines.
Note
This feature is available only in the organization workspaces..
Creating a new Dataset#
To create a new dataset, please follow these steps
-
Open the
Datasets
tab in your organization workspace. -
Click on
New dataset
button to open the dataset creation dialog as shown below.
-
You can enter the Name and Description fields as per the nature of your dataset.
-
You can add the dataset file to your workspace using either drag and drop or by using the system file explorer dialog.
-
It is possible to customize the subsequent views for the dataset using
First row as header
option, to accomodate the situations where the first row contains the column names.
Warning
The size of the dataset file cannot exceed 10MB.
Dataset versions#
The Datasets functionality can accommodate multiple versions of a dataset. To add a new version for a dataset, please follow these steps
-
Click on the Edit option for the intended dataset.
-
In the Edit dialog, click on the Add a new version button.
-
Upload the newer version of the dataset and click on Update.
Warning
All subsequent versions of a dataset must be in the same data format as the initial version of the dataset.
Using a Dataset#
To use a dataset with the saved pipelines in your workspace, please follow these steps
-
Open any pipeline from the Launchpad containing a pipeline-schema.
-
Click on the input field for the pipeline, removing any default value.
-
Pick the right dataset for your pipeline
Warning
The Datasets shown in the dropdown menu depends upon the validation specified in your pipeline-schema. Hence, if the schema specifies only CSV
format, no TSV
dataset would appear in the dropdown.