2D Scatterplots with Box Plots - Advanced Tab

Graphical Analytic Techniques

The Advanced tab of the 2D Scatterplots with Box Plots Startup Panel contains a variety of options for both the scatterplot and the box plots.

Variables. Click the Variables button to display a standard variable selection dialog box, in which you select the X and Y variables to be plotted in the scatterplot graph. The selections made will be displayed below the Variables button. A box plot is also produced for each variable (the X variable box plot will be placed above the scatterplot while the Y variable box plot will be placed to the right of the scatterplot in the compound graph display).

Scatterplot fit. You can fit an equation to the points in the line plots by selecting one of the predefined functions in this box.

Graph type. Select the type of box plot to be plotted from the Graph type list. Click the desired plot link listed below for a brief description of that type of box plot.

Graph icon. The graph icon represents the currently selected Graph type and the selection of values for the Middle point, Box, and Whiskers. The graph icon previews these three selections and the specific statistics that define the current box plot.

Outliers. You can elect to display none (select Off), only Outliers, only Extreme values, or both Outliers & Extremes in the box plot, as well as assign a Coefficient. For more details, see Outliers and Extremes.

Middle point. The options in this group box control the type of value and appearance of the middle point.

Value. The middle point can be either the Mean or Median of the selected variables. The options available for the Box and Whiskers depend on your selection here.

Style. In the Style list, specify how the middle point is represented (by a Line or Point).

Box. If you selected the Median as the middle point, the range (box) can be represented by Percentiles or the Min-Max values of the selected variable, or a specified Constant value (when you want a fixed size box around the medians).

If you selected Mean as the mid-point, the range (box) can be defined in terms of standard deviations (Std. Dev), standard errors (Std. Error), Min-Max values of the selected variable, or a specified Constant value (when you want a fixed size box around the means).

You can also specify a Coefficient, by which the selected range value is multiplied (by default, the Coefficient is 1). Note that, except for unusual applications, the default value of the coefficient (1) should not be changed if the box Value is Min-Max. If Median is specified as the Middle point, and Percentiles is specified as the Value, the Coefficient entered must be between 0.01 - 50.0. If Mean is specified as the Middle point, and Conf. Interval is specified as the Value, the Coefficient entered must be between 0.15 - 0.9999.

Whisker. If you select Median as the middle point, the range (whiskers) can be represented by Percentiles or the Min-Max values of the selected variable, a specified Constant value, or Non-outlier range (see Outliers and extremes).

If you select Mean as the mid-point, the range (whiskers) can be defined in terms of standard deviations (Std. Dev), standard errors (Std. Error), or Min-Max values of the selected variable, or Non-Outlier Range.

If you select Non-Outlier Range, STATISTICA determines which points in the data set are outliers (see Outliers and Extremes), and then uses the highest and lowest data points which are closest to the outliers (but are not outliers) as the whiskers in the plot.

You can also specify a Coefficient by which the selected range value will be multiplied (by default, the Coefficient is 1). In most typical applications the coefficient should be set to 1 when the value of the whisker is Min-Max or Non-Outlier Range.

Box plot size. Use the options in this box to adjust the size of the component box plot with respect to the size of the graph window (e.g., 30% of the graph window size). Note that the size of the scatterplot will adjust accordingly.

Scatterplot statistics. You can include a variety of statistics as footnotes in the graph by selecting one or more of the check boxes in this group box.

R square. Select the R square check box to include the R-square for the plotted variables.

Correlation and p. Select the Correlation and p check box to include the correlation for the plotted variables and its corresponding p-value.

Regression equation. Select the Regression equation check box to include the regression equation in the plot.

Display raw data. Select this check box to display the raw data points.

Jitter. Use the options in this group box to jitter the data points, i.e. modify the original position of the data point from the center of the graph in order to more easily identify/brush overlapping points.

Off. If you select Off, no jitter is applied to the raw data points, outliers, and extremes.

Sequential. If you select Sequential, the jitter is applied sequentially to the raw data points, outliers, and extremes. The jitter is applied such that the first case in the data set is maximally shifted to the left and the last case is shifted maximally to the right.

Random. If you select Random, the data point is randomly shifted within the available range.  

Width. With this option, you can specify the maximum jitter width defined as percentage of box width. Possible percentages range from 0 to 250.