Published online by Cambridge University Press: 03 August 2017
The Decision Process of Modeling
Given a data set or a design for collecting data, it is the task of the data analyst to match the data to an appropriate model. The selection of an appropriate model depends on a number of factors, including the goals or intentions of the study, properties of the data collected, and the nature of the conclusions the analyst would like to make. In most cases many models can be deemed appropriate for one data set, and the analyst must select one or many appropriate models to address the goals of the study. The data analyst cannot focus on “right” or “wrong” models, but must instead balance the relative strengths and weaknesses of different modeling choices. The analyst must also consider the availability of computing resources, interpretability of results, and the ability of the analyst herself.
Very generally, the data analyst must consider the specific goals or questions that need to be addressed by the study, including whether there is an interest in evaluating model effects, in making predictions using the model, or both. The analyst must also consider the nature of predictors for the analysis, including whether the predictors are continuous or categorical, whether interactions between predictors should be considered, whether any predictors should be considered as a source of random variation, whether any predictors present as time-dependent, and so on. Perhaps most relevant to the discussions from this handbook, the data analyst must consider the nature of the data collected, including whether the outcome of interest is continuous, skewed, categorical, longitudinal, or otherwise. Figure 10.1 shows some very general properties of the response of interest that must be determined by the data analyst when matching data to an appropriate model. First, the analyst must determine the type of data representing the outcome of interest which is represented by the top node, “outcome variable.” Exploratory data analyses corroborate the choice of the three options for the outcome variable given in the second tier of nodes.
To save this book to your Kindle, first ensure no-reply@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
Find out more about the Kindle Personal Document Service.
To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.
To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.