Var[​​f(​​x)​​​]​​​​ is a function of values, and since there are
values of there are potentially values of ​​ ar[​​f(​​x)​​​]​​​​ To be
conservative, use the largest variance value, which occurs
at the largest discontinuity size: ​​​ max​​ =max(​​​x​i​​​)​​​​ Thus, ​​
σ​ε​​ =​​ ˆ 1​​​​​
2​ Var​[f(​x​max​​)​]​​ By selecting the ​​ max​​​ value, the resulting
POD curve will be shifted toward larger discontinuity sizes,
so the resulting ​​ 90/95​​​ is an upper bound, and an overesti-
mation of the true ​​ 90/95​​​ Therefore, in general terms, using
Equation 33, Equation 34 provides the new estimate of ​​​σ​pod​​​​: ˆ
(34)​ ​​ˆd​​​ po = ​ˆ​ε​​​
​ˆ 1​z​​​​​​
= ​1
​ˆ​​z​​​​​​​​1

_​ _______________
​ˆ β​1​z​​​​​​2​​Var​[f(​xmax​​)​]​​​​=​√
_____________
Var​[f(​xmax​​)​]​​​​​
The POD equation, with ​ˆ​​​ μ​pod y​dec​​​ and ​ˆ​​​​ σ​pod from
Equation 34, is given in Equation 35:
(35)​ POD(z)​ =Φ​ (​​
ˆ y​dec​​ _
​ˆ​εz​​​​​
)​​​ =Φ​ (​
z + (​ˆ​z​​​​​ 0 ydec​​)​ ​​ˆ​z​​​​​ 1 __________
​ˆ​εz​​​​​ ​​ˆ​z​​​​​ 1
)​ =Φ​ (​
z ​ˆd​​​o​p​ _
​ˆd​​​ σpo )​​
3.2.2. TRANSITION MATRIX
Of note is that ˆ σ​pod​​​/∂​ˆ​ β​1z​​​​​ =0​ so that specific term of the
U​ matrix has changed from the value in Equation 6. The last
row of the matrix must match the parameters in so given
the last row now consists of ˆ μ​pod​​​/∂ˆ​​​​​2​ ​​ ε =0​ and ˆ μ​pod​​​/∂ˆ​​​​​2​ ​​ ε =∂​√
___________Var​[f(​x​max​​)​]​​/∂​​ˆ​ β​1z​​​​​​​ 2​ Var​[f(​x​max​​)​]​​ let random variable
v =​β​1​z​​​​​​​2​Var​[f(​x​max​​)​]​​ ​​ ˆ then ​​√Var​[f(​x​max​​)​]​​
___________
=
_
v /​β​1​z​​​​​​​2​​​ ​​ ˆ Using this
change of variables, the derivative is given in Equation 36:
(36)​ ​ˆd​​​ po
​ˆε​​​​​2​
=
_____________
Var​[f(​xmax​​)​]​​ ___________
​​​ˆ​z​​​​​​​ 1
2​ Var​[f(​xmax​​)​]​ ​​
=​ _
v /​​​ˆ​z​​​​​​​ 1
2​​
_
v
= 1 _
2​ˆ​z​​​​​​√v​​​​​1​ _
=​________________ 1 2​​ˆ​z​​​​​​√
1
________________​​
​ˆ 1​z​​​​​​​
2​ Var​[f(​xmax​​)​]​​​
= 1
2​ˆ​z​​​​​​​ˆε​​​​​​​​1​
The new matrix is shown in Equation 37, and the new
matrix is in Equation 38. The variance in ​​ ε​ can be estimated
from the data to form the last entry in the matrix, and the
covariances involving ​​ ε​ are assumed to be zero because ​​ ε​
is independent of ​​​ 0​​​z​​​ and ​​​ 1​​​z​​​ Recall that the Delta method
establishes the variance of the POD curve as ​​ pod​​ = U​​T​VU​.
Matrices and are defined the same way as in the
standard methodology (see Equations 15 and 11) except that
they are calculated in -space rather than -space. Then, to cal-
culate ​​​ 90/95​​​z​​​ one can follow the steps in Equations 16–19.
3.2.3. CONVERTING CRITICAL VALUES FROM Z-SPACE TO X-SPACE
Following the standard equations, the final POD curve will be
expressed in terms of which is a function of For each value
of there is an associated POD. To convert back into -space,
a solver may be used to find the solutions to Equation 39 for
in terms of each value on the POD curve and its confidence
interval.
(39)​ f(x)​ =z​ ​​
β0​z​​​​ + β1z​​​​z=​β0z​​​​ + β​1z​​​​f(x)​​​​​
In the example, because = α​0​​ + α​1​​x + α​2​​​x​​ 2​​ the quadratic
formula provides two solutions for in terms of as given in
Equation 40. One of these solutions will provide useful results
for matching a discontinuity size to a probability value, while
the other solution will provide values that are unrealistic for
the problem.
(40)​ β​0z​​​​ + β​1z​​​​z = β0​z​​​​ + β1​z​​​​​(​α0​​ + α1​​x + α2​​​x​​2​)​​​
0 = β​0z​​​​ + β​0z​​​​ + β​1z​​​​​(​α​0​​ + α​1​​x + α​2​​​x​​2​)​ z​
= (​β0​z​​​​ β1​z​​​​​α0​​ z)​ + (​β1​z​​​​​α​1​​)​x + (​β1z​​​​​α​2​​)​​x​​2​​​​​
x = (​β1​z​​​​_____________________________​​)z−​​​0α​​​​​​z​​1β​−​​​​z​​0​β​(​​​​2α​​​​​​z​​1β​4−​​2​1α​​​​​​z2​​1β√​±​)​​​1α​​​_
_____________________________​
2 β1​z​​​​​α2​​
One thing to note is that when converting from back to
x​ the resulting curve may contain POD​ (that is, one minus
the probability of detection) instead. This occurs because the
POD is a cumulative distribution function (​ DF​ which is
defined for both and by Equation 41:
(41)​ = FZ​​​(Y)​ =P(Y y)​​ ​​CDF​Z​​​(Y)​
=P​(α0​​ + α1​​x + α​2​​​x​​2​ β0​z​​​​ + β1​z​​​​z)​​​​
=P​((​α​0​​ β​0z​​​​ β​1z​​​​z)​ + α1​​x + α​2​​​x​​2​ 0)​​
=P(​x1​​ X x2​​)​or P(​x2​​ X x1​​)​​​
Without loss of generality, assume (​x​1​​ X x​2​​)​​ Then,
the solutions to the DF​ are (X x​2​​)​​ or ​​ (​​x_1 X)​​ =
1 P(​​X x _1 )​​​​ (Note that the equality is maintained when
the inequality is switched, since is continuous.) This means
that estimating ​​ p/q​​​ may require p​ instead of and q​
instead of .
4. Results
In this section, the statistical methods are applied to two sim-
ulations. These simulations are useful to describe the direction
of bias (since you can simulate the “truth”), but real-world
data has many complexities that these simulations do not
capture. Section 4.1 describes the real-world POD data that
inspired this research, along with a discussion of how the
methods described in this paper were applied to that data
[16]. Simulation 1 (Section 4.2) illustrates fitting the more
complex models discussed in the Methods section, compar-
ing the quality of these different models and how to interpret
A U G U S T 2 0 2 5 M AT E R I A L S E V A L U AT I O N 63
the results. Simulation 2 (Section 4.3) is a large Monte Carlo
study that compares how the distribution of critical values can
change as the model complexity increases. It illustrates how
using an overly simple model may lead to biased estimates
of ​​ 90​​​ and ​​ 90/95​​​ Code for simulating the data, fitting the
models, and estimating POD is available at https://github.com/
christieknott/Multivariate-POD. The provided code uses the
R programming language Appendix B provides some import-
ant information about how to interpret the variance-related
outputs from R and SAS software.
4.1. POD Experimental Study: The Inspiration for
This Work
The US Air Force Research Laboratory performed an in-house
study of multilayer metal plates with bolt holes, using bolt-hole
eddy current (BHEC) as the nondestructive inspection method.
Four-layer metal plates were stacked and clamped together,
aligning the six bolt holes in the plates. The metal plates were
made of three different materials. The specimens contained both
fatigue cracks and notches propagating from the bolt holes, and
these defects had a range of realistic sizes. The fatigue cracks
were either corner cracks or mid-bore cracks [16].
Analysis of variance (ANOVA) tests revealed that crack/
notch size (​ was not the only variable related to the eddy
current response. Material type (​​ A​​​ ​​ B​​​ and ​​ C​​​ was a signif-
icant factor. Additionally, if Material A was in the layer below
the actively scanned layer (​​ A​​​ the response changed. Cracks
(​ and notches (​ showed different responses, too. There
were also interactions between some of the variables [16].
Varying across these significant variables led to eight dif-
ferent combinations. Within the study, three types of models
were fit:
1. One “collapsed” model, which ignored all the variables
except defect size.
2. Eight “by case” models, each run on a subset of the data.
3. A “multiple” linear model that included all the significant
variables.
Using Akaike information criterion (AIC) and Bayesian
information criterion (BIC), the “multiple” linear model [22]
was the best fit to the data, and its form was: ​​
ˆ =0.6254 +7.3595x +1.9627 m​B​​ +
1.3290 m​C​​ +1.4834d +0.9609 bA​​ +0.8584x​m​B​​ +
3.5873x​m​C​​ 3.5155 m​B​​d 0.9717 m​C​​d 1.1020d​bA​​ +
2.7590x​m​A​​d +14.7659x​m​B​​d 0.4538x​m​C​​d
0.6907xc​bA​​ 0.6247xd​bA​​​.​
The methods described in Section 3.1 were used to
estimate POD curves, and the results were very different when
comparing each modeling type. The hypothesis of whether
a more accurate linear model would yield a more accurate
POD curve could not be tested with experimental data, since
the true POD curve is unknown [16]. However, in a simula-
tion study, the true POD can be estimated from prior knowl-
edge (i.e., knowing the input function) and from Monte Carlo
sampling. Thus, Simulations 1 and 2 were conducted.
4.2. Simulation 1: Simple Example
A simulated dataset was created to represent an NDE system
whose response depends on both the material being inspected
(A or B) and the area of a discontinuity. This simulated case is
intended to loosely represent eddy current measurements cor-
related with the cross-sectional area of quarter-penny (corner)
discontinuities at edges, where varying responses are observed
for materials with different conductivity. The formulas for the
simulation model are given in Equation 42, where is the con-
tinuous length of the discontinuity ​​ (x 0.2,1)​​ of which there
are 50 values, and is the random noise. A plot of the data is
given in Figure 1. The decision threshold was set at ​​ dec​​ =3​.
The variables ​​ 50×1​​​ and ​​ 50×1​​​ are column vectors of zeros and
ones, respectively, each of length 50.
(42)​ yA​​ =10 x​​2​ +2*​1​50×1​​ +𝛆​ ​​
y​B​​ =20x2 + 1​50×1​​ +𝛆​
𝛆 Normal​(µ = 0​50×1​​,​σ​​2​​1​50×1​​ = 1​50×1​​)​​
Categorical variables need to use a coding scheme to
be included in the modeling. Equation 21 gives the material
coding schemes for the data in Figure 1. Since variables beyond
discontinuity size impact the signal response, the simple linear
model from Equation 2 is probably insufficient. Three different
approaches will be considered for this data:
Ñ By case: Divide the data by material and fit two separate
models (A only or B only)—50 observations each.
Ñ Collapsed: Ignore the effect of material (Combo)—100
observations.
Ñ Multiple: Extend the linear model to include discontinuity
size and material as variables (Both)—100 observations.
Within each of these approaches, a model with respect to
discontinuity size (​ discontinuity area (​​ 2​​ and discontinuity
volume (​​ 3​​ is considered. When models include higher-order
ME
|
PODMODELING
20
15
10
5
0
0.00 0.25 0.50 0.75 1.00
x
Material A
Material B
Figure 1. Plot of the simulated data for each material.
64
M AT E R I A L S E V A L U AT I O N A U G U S T 2 0 2 5
y
Previous Page Next Page