[Data Visualization] Distributions

Modified on Mon, 04 Apr 2022 at 03:56 PM

Boxplot Chart

Overview

Boxplots are a standardized way of displaying the distribution of data based on a five number summary (minimum, first quartile (Q1), median, third quartile (Q3), and maximum). They give a good indication of how the values in the data are spread out.


Boxplot displayed Vertically


Boxplot displayed horizontally


Steps to follow

  1. Create a workflow with a Result Table processor or a Filterable Result Table (FRT)
  2. Create a new report for the workflow
  3. In the report add a new Container by clicking "Add Report Container" or by double-clicking in the empty canvas.
  4. Select a processor with the data that should be visualized
  5. Switch to edit mode
  6. Open the chart menu and change the type Boxplot chart
  7. Set the mandatory fields (Group and value columns)


Check the full Demo under Attachements


Configuration

1. Data Manipulation

    Filter shown data. It's possible to add a filter for each column.


2. Setting based on

    Choose between Raw Data and Precalculated Data (different configurations).


3. Group

    Select a column to group by.


4. Value Columns

    Select the value column to be plotted.


5. Outliers

    Choose whether to display outliers in the Boxplot


6. Display

    Choose whether to display the Boxplot horizontally or vertically.  


7. Axis format

    Format the x-Axis and the y-Axis values (can also be set to invisible)


8. Axis Labels

    Set and format the group axis and the value column axis labels


9. Colors

    Change the chart's colors


10. Format Data

    Used to alter one or multiple columns without changing the corresponding data. Add new Format Groups, select the format and add the target columns.


11. General Chart Container Settings

    Change the background color and and choose whether to display the container title.



Note that for choosing the option precalculated data in the second configuration field, all boxplot necessary values (Minimum, Maximum, Median, 1st Quartile, 3rd Quartile and Mean) need to be at hand. It's possible to provide such information using the Heuristic Summaries processor.


Histogram

Overview

A histogram is a plot that lets you discover, and show, the underlying frequency distribution (shape) of a set of continuous data. This allows the inspection of the data for its underlying distribution (e.g., normal distribution), outliers, skewness, etc.


Histogram showing the number of likes for certain Youtube videos


Steps to follow

  1. Create a workflow with a Result Table processor or a Filterable Result Table (FRT)
  2. Create a new report for the workflow
  3. In the report add a new Container by clicking "Add Report Container" or by double-clicking in the empty canvas.
  4. Select a processor with the data that should be visualized
  5. Switch to edit mode
  6. Open the chart menu and change the type Histogram
  7. Set the mandatory field (x-Axis)


Check the full Demo under Attachements


Configuration


1. Data Manipulation

    Filter shown data. It's possible to add a filter for each column.


2. x-Axis

    Choose the x-Axis (numeric column)


3. Interval Width

    Type the grouping interval width for the x-Axis.


4. Axis Labels

    Set and format the x-Axis.


8. Format

    Format the x-Axis and the y-Axis values (can also be set to invisible)


6. Format Data

    Used to alter one or multiple columns without changing the corresponding data. Add new Format Groups, select the format and add the target columns.


7. General Chart Container Settings

    Change the background color and and choose whether to display the container title.






Was this article helpful?

That’s Great!

Thank you for your feedback

Sorry! We couldn't be helpful

Thank you for your feedback

Let us know how can we improve this article!

Select atleast one of the reasons

Feedback sent

We appreciate your effort and will try to fix the article