the results. Simulation 2 (Section 4.3) is a large Monte Carlo
study that compares how the distribution of critical values can
change as the model complexity increases. It illustrates how
using an overly simple model may lead to biased estimates
of 90 and 90/95 Code for simulating the data, fitting the
models, and estimating POD is available at https://github.com/
christieknott/Multivariate-POD. The provided code uses the
R programming language Appendix B provides some import-
ant information about how to interpret the variance-related
outputs from R and SAS software.
4.1. POD Experimental Study: The Inspiration for
This Work
The US Air Force Research Laboratory performed an in-house
study of multilayer metal plates with bolt holes, using bolt-hole
eddy current (BHEC) as the nondestructive inspection method.
Four-layer metal plates were stacked and clamped together,
aligning the six bolt holes in the plates. The metal plates were
made of three different materials. The specimens contained both
fatigue cracks and notches propagating from the bolt holes, and
these defects had a range of realistic sizes. The fatigue cracks
were either corner cracks or mid-bore cracks [16].
Analysis of variance (ANOVA) tests revealed that crack/
notch size ( was not the only variable related to the eddy
current response. Material type ( A B and C was a signif-
icant factor. Additionally, if Material A was in the layer below
the actively scanned layer ( A the response changed. Cracks
( and notches ( showed different responses, too. There
were also interactions between some of the variables [16].
Varying across these significant variables led to eight dif-
ferent combinations. Within the study, three types of models
were fit:
1. One “collapsed” model, which ignored all the variables
except defect size.
2. Eight “by case” models, each run on a subset of the data.
3. A “multiple” linear model that included all the significant
variables.
Using Akaike information criterion (AIC) and Bayesian
information criterion (BIC), the “multiple” linear model [22]
was the best fit to the data, and its form was:
ˆ =0.6254 +7.3595x +1.9627 mB +
1.3290 mC +1.4834d +0.9609 bA +0.8584xmB +
3.5873xmC − 3.5155 mBd − 0.9717 mCd − 1.1020dbA +
2.7590xmAd +14.7659xmBd − 0.4538xmCd −
0.6907xcbA − 0.6247xdbA.
The methods described in Section 3.1 were used to
estimate POD curves, and the results were very different when
comparing each modeling type. The hypothesis of whether
a more accurate linear model would yield a more accurate
POD curve could not be tested with experimental data, since
the true POD curve is unknown [16]. However, in a simula-
tion study, the true POD can be estimated from prior knowl-
edge (i.e., knowing the input function) and from Monte Carlo
sampling. Thus, Simulations 1 and 2 were conducted.
4.2. Simulation 1: Simple Example
A simulated dataset was created to represent an NDE system
whose response depends on both the material being inspected
(A or B) and the area of a discontinuity. This simulated case is
intended to loosely represent eddy current measurements cor-
related with the cross-sectional area of quarter-penny (corner)
discontinuities at edges, where varying responses are observed
for materials with different conductivity. The formulas for the
simulation model are given in Equation 42, where is the con-
tinuous length of the discontinuity (x ∈ 0.2,1) of which there
are 50 values, and is the random noise. A plot of the data is
given in Figure 1. The decision threshold was set at dec =3.
The variables 50×1 and 50×1 are column vectors of zeros and
ones, respectively, each of length 50.
(42) yA =10 x2 +2*150×1 +𝛆
yB =20x2 + 150×1 +𝛆
𝛆 ∼ Normal(µ = 050×1,σ2150×1 = 150×1)
Categorical variables need to use a coding scheme to
be included in the modeling. Equation 21 gives the material
coding schemes for the data in Figure 1. Since variables beyond
discontinuity size impact the signal response, the simple linear
model from Equation 2 is probably insufficient. Three different
approaches will be considered for this data:
Ñ By case: Divide the data by material and fit two separate
models (A only or B only)—50 observations each.
Ñ Collapsed: Ignore the effect of material (Combo)—100
observations.
Ñ Multiple: Extend the linear model to include discontinuity
size and material as variables (Both)—100 observations.
Within each of these approaches, a model with respect to
discontinuity size ( discontinuity area ( 2 and discontinuity
volume ( 3 is considered. When models include higher-order
ME
|
PODMODELING
20
15
10
5
0
0.00 0.25 0.50 0.75 1.00
x
Material A
Material B
Figure 1. Plot of the simulated data for each material.
64
M AT E R I A L S E V A L U AT I O N • A U G U S T 2 0 2 5
y
study that compares how the distribution of critical values can
change as the model complexity increases. It illustrates how
using an overly simple model may lead to biased estimates
of 90 and 90/95 Code for simulating the data, fitting the
models, and estimating POD is available at https://github.com/
christieknott/Multivariate-POD. The provided code uses the
R programming language Appendix B provides some import-
ant information about how to interpret the variance-related
outputs from R and SAS software.
4.1. POD Experimental Study: The Inspiration for
This Work
The US Air Force Research Laboratory performed an in-house
study of multilayer metal plates with bolt holes, using bolt-hole
eddy current (BHEC) as the nondestructive inspection method.
Four-layer metal plates were stacked and clamped together,
aligning the six bolt holes in the plates. The metal plates were
made of three different materials. The specimens contained both
fatigue cracks and notches propagating from the bolt holes, and
these defects had a range of realistic sizes. The fatigue cracks
were either corner cracks or mid-bore cracks [16].
Analysis of variance (ANOVA) tests revealed that crack/
notch size ( was not the only variable related to the eddy
current response. Material type ( A B and C was a signif-
icant factor. Additionally, if Material A was in the layer below
the actively scanned layer ( A the response changed. Cracks
( and notches ( showed different responses, too. There
were also interactions between some of the variables [16].
Varying across these significant variables led to eight dif-
ferent combinations. Within the study, three types of models
were fit:
1. One “collapsed” model, which ignored all the variables
except defect size.
2. Eight “by case” models, each run on a subset of the data.
3. A “multiple” linear model that included all the significant
variables.
Using Akaike information criterion (AIC) and Bayesian
information criterion (BIC), the “multiple” linear model [22]
was the best fit to the data, and its form was:
ˆ =0.6254 +7.3595x +1.9627 mB +
1.3290 mC +1.4834d +0.9609 bA +0.8584xmB +
3.5873xmC − 3.5155 mBd − 0.9717 mCd − 1.1020dbA +
2.7590xmAd +14.7659xmBd − 0.4538xmCd −
0.6907xcbA − 0.6247xdbA.
The methods described in Section 3.1 were used to
estimate POD curves, and the results were very different when
comparing each modeling type. The hypothesis of whether
a more accurate linear model would yield a more accurate
POD curve could not be tested with experimental data, since
the true POD curve is unknown [16]. However, in a simula-
tion study, the true POD can be estimated from prior knowl-
edge (i.e., knowing the input function) and from Monte Carlo
sampling. Thus, Simulations 1 and 2 were conducted.
4.2. Simulation 1: Simple Example
A simulated dataset was created to represent an NDE system
whose response depends on both the material being inspected
(A or B) and the area of a discontinuity. This simulated case is
intended to loosely represent eddy current measurements cor-
related with the cross-sectional area of quarter-penny (corner)
discontinuities at edges, where varying responses are observed
for materials with different conductivity. The formulas for the
simulation model are given in Equation 42, where is the con-
tinuous length of the discontinuity (x ∈ 0.2,1) of which there
are 50 values, and is the random noise. A plot of the data is
given in Figure 1. The decision threshold was set at dec =3.
The variables 50×1 and 50×1 are column vectors of zeros and
ones, respectively, each of length 50.
(42) yA =10 x2 +2*150×1 +𝛆
yB =20x2 + 150×1 +𝛆
𝛆 ∼ Normal(µ = 050×1,σ2150×1 = 150×1)
Categorical variables need to use a coding scheme to
be included in the modeling. Equation 21 gives the material
coding schemes for the data in Figure 1. Since variables beyond
discontinuity size impact the signal response, the simple linear
model from Equation 2 is probably insufficient. Three different
approaches will be considered for this data:
Ñ By case: Divide the data by material and fit two separate
models (A only or B only)—50 observations each.
Ñ Collapsed: Ignore the effect of material (Combo)—100
observations.
Ñ Multiple: Extend the linear model to include discontinuity
size and material as variables (Both)—100 observations.
Within each of these approaches, a model with respect to
discontinuity size ( discontinuity area ( 2 and discontinuity
volume ( 3 is considered. When models include higher-order
ME
|
PODMODELING
20
15
10
5
0
0.00 0.25 0.50 0.75 1.00
x
Material A
Material B
Figure 1. Plot of the simulated data for each material.
64
M AT E R I A L S E V A L U AT I O N • A U G U S T 2 0 2 5
y