ME
|
PODMODELING
TA B L E 1
Fitted linear models
Model y =Material Fitted models
x Combo –1.1176+ 15.2382x --
x2 +x Combo 6.6539+ 43.9802x +10.6919 x2 -
x A subset 0.2897+ 10.2467x --
x2 +x A subset 5.516+ 20.912x +4.922x2 -
x B subset –2.525+ 20.230x --
x2 +x B subset 7.792+ 41.286x +10.199x2 -
x +m +xm Both 0.2897+ 10.2467x –2.8146m +9.983mx
x2 +x +m +mx +mx2 Both 5.5155+ 29.5736x +2.2768m 28.813mx
-+6.9602 x2 +7.4635mx2 --
Note: Models with material “Combo” combined the data from materials A and B, since material was not included as a model parameter. Models
with material “A subset” or “B subset” were fit with data from only one material. Models with material “Both” were fit using all the data and included
material (m) as a model parameter, as defined in Equation 21.
TA B L E 2
Comparison of fitted linear models
Model y =Material Residual DoF DoF –2LL AIC BIC ~R2 ~Adjusted R2
x Combo 97 3 457.40 463.40 471.22 0.77 0.77
x2 +x Combo 96 4 434.91 442.91 453.33 0.82 0.82
x A subset 47 3 148.41 154.41 160.15 0.88 0.88
x2 +x A subset 46 4 120.72 128.72 136.37 0.93 0.93
x B subset 47 3 204.09 210.09 215.82 0.91 0.91
x2 +x B subset 46 4 158.31 166.31 173.95 0.96 0.96
x +m +xm Both 95 5 285.93 377.26 390.28 0.91 0.91
x2 +x +m +mx +mx2 Both 93 7 367.26 299.93 318.17 0.96 0.96
x3 + +mx3 Both 91 9 285.63 303.63 327.08 0.96 0.96
Note: All models were significant (P-value 0.001). Smaller values of –2 log likelihood (–2LL), Akaike information criterion (AIC), and Bayesian
information criterion (BIC) indicate better model fits.
TA B L E 3
Likelihood ratio test (LRT) results
Model 1 Model 2 Material DoF difference Deviance
x x2 +x Combo 1 22.49204
x x2 +x A 1 27.68873
x x2 +x B 1 45.78020
x x +m +xm Combo 2 90.14749
x +m +xm x2 +x +m +mx +mx2 Combo 2 81.32474
x2 +x x2 +x +m +mx +mx2 Combo 3 148.9802
Note: Each x2 test revealed the models were statistically significantly different (P-value 0.001), in that Model 2 fit the data better.
66
M AT E R I A L S E V A L U AT I O N A U G U S T 2 0 2 5
approximate the R2 and the adjusted R2 (called ​​ a​ which is
adjusted to penalize for unnecessary variables. For example, ​​
R​a​​​ =0.77 indicates that 77% of response variability (​ can be
explained by the model.
When fitting a model, it is important to balance the
number of model parameters to estimate (​ with the number
of observations (​ often referred to as “the curse of dimen-
sionality.” In practice, the goal is to estimate parameters that
improve the model’s ability to represent the data (avoiding
underfitting), but not estimate parameters that represent noise
within the data (avoiding overfitting). The number of model
parameters to fit are described by the degrees of freedom
(DoF). The residual DoF represent the number of observations
that are free to vary, where residual DoF =number of observa-
tions DoF. These residual DoF are important for estimating
the residuals, and residuals are a description of how much the
fitted model differs from the response (​
This discussion of residual DoF highlights a difference
between models fit on a subset of the data versus models fit on
all the data, with a categorical variable included. For example,
using Table 2, consider the =x​ model for Material A, which
has DoF =3 (with three parameters ​​ 0​​​ ​​ 1​​​ and ​​ ε​ but only
50 observations, so the residual DoF =47. In comparison,
the =x +m +xm​ model has DoF =5 (with five parame-
ters: ​​ 0​​​ ​​ 1​​​ ​​ 2​​​ ​​ 3​​​ and ​​ ε​ but it is fit on the entire dataset
of 100 observations, so the residual DoF =95. Thus, if is a
significant variable, then the =x +m +xm​ model is better
than the =x​ subset models in describing the residuals.
When fitting a linear model, several hypothesis tests are
performed. For this application, P-values are compared against
α =0.05​ which is a measure of the Type 1 error: rejecting the
null hypothesis when it is actually true. When testing for sig-
nificance on the subset models, each is compared to =0.05​.
For two subsets (like Materials A and B), the allowable Type 1
error is α =0.1​ rather than =0.05​ for the model that
includes material as a variable.
The likelihood ratio test (LRT) is also useful for comparing
models, as it tests whether two nested models are statistically
significantly different from each other. Models are nested if
one has fewer terms than the other. For example, model =x​
is nested inside model = x​​ 2​ +x​ If the test returns a P-value
0.05, the models are not significantly different, and the addi-
tional variables in the larger model are not needed. Table 3
shows the LRT results for all nested models, and in all cases,
the additional terms are useful because they result in signifi-
cantly different likelihoods (P-values 0.001).
When using all the data, the worst model is =x​ with
AIC =463.40. Adding the ​​ 2​​ term (​ = x​​ 2​ +x​ leads to a
smaller AIC of 442.91, and adding the material term reduces it
further to 377.26. The best model for all the data includes the
most variables, with the form = x​​ 2​ +x +m +mx +m x​​ 2​​ and
an AIC of 299.93. When modeling each material separately
TA B L E 4
POD mean and variance terms for each fitted linear model, with statical tests of whether the linear model
assumptions of normality and constant variance are met
Model y =Material ​​​ˆ​​​​ pod 𝛔​x​​​ ​​​ˆ​​​​ pod Normality P-value Constant variance
P-value
x Combo 0.27 2.38 0.156 0.3316 0.001*
x A subset 0.26 1.07 0.104 0.3894 0.3204
x B subset 0.27 1.86 0.092 0.3270 0.9688
x +m +xm A 0.26 1.52 0.148 0.1785 0.3100
x +m +xm B 0.27 1.52 0.075 0.1785 0.3100
*P-values 0.05 indicate that the assumption is violated.
TA B L E 5
POD mean and variance terms for each fitted linear model, with statical tests of whether the linear model
assumptions of normality and constant variance are met
Model y =Material ​ˆ​​​​ pod 𝛔x​​​ ​​​ˆ​​​​ pod Normality P-value Constant variance
P-value
x2 +x Combo 3 2.13 3.02 0.5652 0.001*
x2 +x A subset 3 0.81 1.15 0.5747 0.0177*
x2 +x B subset 3 1.18 1.67 0.6543 0.7282
x2 +x +m +mx +mx2 Both 3 1.01 2.03 0.3316 0.3972
Note: Values in this table are in terms of z =f(x) (for some function g), not x, as described in Section 3.1.
*P-values 0.05 indicate that the assumption is violated.
A U G U S T 2 0 2 5 M AT E R I A L S E V A L U AT I O N 67
Previous Page Next Page