# Overview

This processor provides descriptive summaries (mean, variance, amount of non-zero values, total number of values) for all of the input Data numeric columns.

# Motivation

__Workflows__** **in ONE DATA enable the user to exploit and work with large Data, but to be able to extract Value from this data the user must have a general overview of it.

It would be very helpful and time saving to have an effective tool which can present descriptive summaries of the Data, and that is the functionality of this processor.

# Configuration

This processor does not need configuration.

What is important thing is to provide the processor with input __Data__** **which contains **at least** one numerical column.

The data can come from a __Dataset Load__, a Data Table Load or a __Custom Input Table__.

# Output

This processor generates short summaries for each numeric variable in the dataset including its column names, mean values, variances, number of non-zero values and the number of rows in the dataset.

So the output looks as follows:

Column Name:the list of all numerical columns from input data | Mean:the mean value of the column of interest | Variance:How values vary around the Mean | Non Zero:total of non zero values | Total:Total number of values |

# Example

In this example, the Column Summary Processor will be used on a toy Dataset generated by a Custom Input Table:

# Related Articles

Was this article helpful?

That’s Great!

Thank you for your feedback

Sorry! We couldn't be helpful

Thank you for your feedback

Feedback sent

We appreciate your effort and will try to fix the article