To access it, first , click on “Applications” in the left pane's menu.
Then click on ''Dataset Processing and Visualization".
A new dashboard expands. It is very similar to the first one (seen in the dataset application) except that this time, data are filtered here by date of creation and size.
From wherever on the platform we may retrieve a dataset, clicking on the processor' s cog icon leads to the ‘Dataset Processor’’s console.
The Dataset Processor allows to sample, filter and download datasets.
Flaws in ‘dirty dataset’ are highlighted, just like here where missing items are set in red. For dealing with these data impurities, we are able to choose the kind of processing we want .
As a strategy, dropping the rows is systematic here for rows (and columns) with missing values. To do so, we just need to specify it in the processor type, then as a "Strategy" , choose Drop.
Click on the + button to add a processing step.
As we are asked to select a processor , scroll until 'Handle missing values'.
Click on it.
🃏 Let us also, for instance, initiate the sorting process . We shall recapitulate the same steps as in 'handling missing values', except that for this time, we shall choose 'Sort' instead.
So, click on the + sign to add processor step
Scroll until 'Sort' and select it.
In choosing column, select 'Variety' for it is the scrambled one.
The table shows that the processor took into account the steps we wanted it to apply as the quality of our data has obviously increased.
The processing pipeline pane appears on the right side of the Dataset processor . It can fold and unfold and contains its own set of functions.
Once we have finished the cleaning operations, we are able to utilize the new dataset for our Processing pipeline , the list of which is located on the right sidebar next to the Data Processor's dashboard .
Look for the exporting icon (3rd icon from the left ) on the menu below the right pane . The tooltip shows its label'' Export processing pipeline to SmartPredict" .
Click on it.
Then, click on "Export".
Now let us get back to our workspace, enter through the flowchart icon on the left to switch to project view. From the right sidebar, as we click on the third button named ‘Processing pipelines’, we see the pipeline we have just configured before displaying there.
Just like any other module, we can drag and drop it into the layout now, in order to attach it to the flowchart .