Data Mining ... Quick Linear Models Project

Ribbon bar. Select the Data Mining tab. In the Tools group, click Workspaces. From the General Classifier (Trees and Clusters) submenu or the General Modeler and Multivariate Explorer submenu, select Quick Linear Models Project...

Classic menus. From the Data Mining Data Mining - Workspaces Data Miner - General Classifier (Trees and Clusters) submenu or the Data Mining Data Mining - Workspaces Data Miner - General Modeler and Multivariate Explorer submenu, select Quick Linear Models Project ...

...to display a project workspace (template) with several prearranged nodes for fitting a linear model to a regression problem (with a continuous dependent variable), and for automatically generating deployment information (for predicting new observations). This project can be further augmented to include any of the nodes available in the Regression Modeling and Multivariate Exploration folder of the Node Browser (for fitting models and automatically generating deployment information), or via other nodes available in STATISTICA Data Miner. See Data Mining with STATISTICA, in particular the topic Structure and User Interface of STATISTICA Data Miner for additional details; see also the STATISTICA Data Miner Example 3: Predictive Data Mining and Deployment for a Continuous Output Variable).

To use the prearranged project as-is, simply select a data source, specify the continuous dependent variable and the continuous and/or categorical predictor variables, connect the input data to the node labeled Split Input Data into Training and Testing Samples, and Run the project. Then, to predict new values based on the fitted linear model, connect a data file with new observations to the project, select the same variables as before (even if no valid values are available for the continuous dependent variable), mark the input data as data for deployment (select option Data for deployment project; do not re-estimate models on the Select Dependent Variables and Predictors dialog), connect the new input data source to the node labeled Compute Best Prediction From all Models, and then Run that node or update (Run) the entire project. For a step-by-step example of this process, see Example 3: Predictive Data Mining and Deployment for a Continuous Output Variable.

See also, Data Mining with STATISTICA Data Miner.