Generally, the response ​​ a vector of Box-Cox transformed
y​ values, is modeled as a function of β +ε​ using maximum
likelihood estimation. The method for estimating is described
in [16, 17, 19], and many commercial and open-source software
packages [21] can recommend an optimal by comparing the
sum of squared errors at various values across the range of
power transforms, stepping in increments (e.g., 0.1) between
both positive and negative integers, with the transformation at
a value of 0 representing the natural logarithm.
After fitting the model, the transformation should be
reversed to return to the original data units of the response, .
Once the value is estimated and applied to the response,
then is considered fixed. The residuals are an estimate of the
variability that the model cannot explain. When using a simple
linear model, residuals may be examined, or a lack-of-fit test
may be conducted to determine if additional variables beyond
discontinuity size are necessary [3, 17, 18].
If a transformation is applied to the same transforma-
tion should also be applied to the variable ​​ dec​​​ (which will
be described in Section 2.2), so that they have the same units
(say, ​​ dec​ For simplicity, the following equations will refer to ,
but if a transformation is necessary, ​​ should be substituted.
Lastly, if the data shows evidence of censoring, a modifica-
tion to the likelihood equation is necessary during maximum
likelihood estimation. Censoring often occurs in real-world
NDE measurements. Very large discontinuities may max out
the NDE system’s signals, providing a flat response at the
maximum detection limit, and thus providing a lower bound
on the response, rather than the true value. This is common
if the gain on a system is set high such that small discontinu-
ities are detectable, but large discontinuities may saturate the
detection system. Very small discontinuities that fall below the
detection threshold of the NDE system may return values of
zero, which is also a form of censoring.
2.2. Transforming the Linear Model into Probability
Assume a simple linear model has been fitted using and
(discontinuity sizes): ​​​y​1​​​ =​​ ˆ 0​​​ +​​ ˆ 1​​​ a​i​​​ Note that, for convention, ​​
a​i​​​ is now used in place of ​​ i​​​ in the simple linear model. Recall
that ​​ i​​​ is assumed to be normally distributed. A POD curve
consists of a standard normal cumulative distribution function
(CDF), denoted as the response. Therefore, the first step
requires a transformation from the normal distribution (with
estimated mean ​ˆ​​ and estimated residual error variance ​​​σ​ε​​​​​2​​)
to the standard normal distribution (with mean 0 and
variance 1), as shown in Equation 5:
(5)​ i​​​ = ​ˆ​ε​​​​Zi​​ + ˆ N​(ˆ​,​ˆε​​​​​2​)​​​⇒​Zi​​ = 1
​ˆε​​​​​​
ˆ i​​​ ˆ N​(μ =0,​σ2​ =1)​​
Before performing a POD evaluation, a predefined proce-
dure incorporating calibration and specific call criteria based
on the system’s noise floor is determined for the NDE system,
and this value is called the decision threshold, ​​ dec​​​ Therefore,
Equation 6 becomes the formula for the POD of a discontinuity
of size ​​ i​​​ [1–4]:
(6)​ POD(​ai​​)​ =Φ(​Zi​​)​ =Φ(​​ ˆ i​​​_​​ce​d​y​−
​ˆ​ε​​​
)​​
=Φ​ (​​
ai​​​ + (​​ˆ0​​​ ydec​​)​ ​​ˆ​1​​​ __________
​ˆε​​​ ​​ˆ1​​​
)​ =Φ​ (​​
ai_​​​do​p​​ˆ​−​​​
​ˆd​​​ σpo )​​
The final equation provides a new mean, ​ˆ​​​​ μ​pod and a new
variance, ​ˆ​​​​​2​​ σ​pod as shown in Equation 7:
(7)​ ​ˆd​​​ po = (​ˆ0​​​ y​dec​​)​ _
​ˆ​1​​​

,and​ ​​ ​ˆd​​​​​2​ po = (​​
​ˆ​ε​​​​
​ˆ​1​​​​)​​​
2​​
2.3. Estimating a90, the Discontinuity Size at 90% POD
By inverting Equation 6, the discontinuity size associated with
each probability can be calculated. The discontinuity size, ,
with POD ​​ %=POD(​​a)​​​​ is denoted ​​ p​​​ (see Equation 8),
where ​​​ ​−1​​(​​p/100​)​​ = z​p​​​​ is calculated using a standard normal
Z-table [1–4].
(8)​ ap​​ = Φ−1(POD(a)​)​​​ˆd​​​ po + ​ˆd​​​=​​​Φ−1(p)​​ˆd​​​ po σpo + ​ˆd​​​ po = zp​​​​ˆd​​​ po +​ˆd​​​​o​p​
Equation 9 shows how to find ​​ 90​​​ the discontinuity
size associated with 90% probability, which corresponds to
p =0.90​:
(9)​ a90​​ = z​0.9​​​ˆd​​​ po + ​ˆd​​​ po 1.2816ˆd​​​ ​​ po +​ˆd​​​​o​p​
For a simple linear regression model, Equations 8 and 9
simplify to Equation 10:
(10)​ ap​​ = zp​​​
(​​
​ˆε​​​ ​​
​ˆ​1​​​​)​
+ (​y_, dec​​ ​​ˆ0​​​)​
​ˆ​1​​​
and ​​ 90​​ = z​0.9​​​ (​​
​ˆ​ε​​​ ​ˆ​1​​​​)​ + (​ydec​​​ ​​ˆ0​​​)​​
_
​ˆ​1​​​

1.2816​ (​​
​ˆε​​​ ​​
​ˆ​1​​​​)​ + (​y_​)​​​​0​ˆ​​−​​ce​d
​ˆ​1​​​

2.4. Estimating a 95% Confidence Interval and the a90/95
Discontinuity Size
A confidence interval can be estimated for the linear model, in
terms of the three estimated model parameters: ​​β​0​​​​ ​​β​1​​​​ and ​​​σ​ε​​​​.
Estimates of the variances and the covariances of the param-
eters are also provided in the variance-covariance matrix (see
Equation 11) returned when fitting the linear model.
However, to calculate a confidence interval on the POD
curve, a variance-covariance matrix is needed in terms of the
parameters ​​​μ​pod​​​​ ˆ and ​ˆ​​​​ σ​pod as shown in Equation 12:
(12)​ Vpod​​ =V​(​​ˆd​​​,​ˆd​​​)​=​​​o​p​o​p [​
Var​(​​ˆd​​​)​​ po Cov​(​ˆd​​​,​ˆd​​​)​​op​​op​​
Cov​(​​ˆd​​​,​ˆd​​​)​​ po po Var​(ˆd​​​)​ ​​ po ]​​
A U G U S T 2 0 2 5 M AT E R I A L S E V A L U AT I O N 59
Equation 7 shows that ​​​μ​pod​​​​ ˆ and ​ˆ​​​​ σ​pod are functions of the
parameters ​​β​0​​​​ ​​β​1​​​​ and ​​​σ​ε​​​​ The Delta method [19] provides a
way to estimate these variances. For a differentiable function
with derivative ​​ (​​T​)​​​​ where ​​ [​​y]​​ =µ​​:
(13)​ Var​
[​​ g(y)​​
]​​

i=1​
k gi(​T)​​(μ)​​​2​
Var[​​yi​​​]​​​​​ ​​ 2​∑gi​(​T)​​​gj​(​​​T​)​​​
i j​
Cov[​​yi​​,​y​j​​​]​​​​
In this application, is the number of parameters, so here
k =3​ Equation 14 illustrates how the method can be used to
estimate Var[ˆ​​​​ ​​​μ​pod
(14)​ po =Var​ [ (​ ​ˆ​0​​​ ydec​​)​ _
​ˆ​1​​​

]​​​ ≈​ (​ˆd​​​)​ po
(​​ˆ​0​​​)​
Var​[​ˆ0​​​]​ + (ˆd​​​)​ ​​ po
(​ˆ1​​​)​
Var​[​​ˆ 1​​​ ]​​ ​Var​[​ˆd​​​]​
+ (​ˆd​​​)​ po
(​ˆε​​​)​
Var[​​ˆ​ε​​​]​ +2 (​ˆd​​​)​ po
(​​ˆ​0​​​)​
(ˆd​​​)​ ​​ po
(​​ˆ1​​​)​
Cov[​​ˆ 0​​​ ,​​ˆ1​​​]​​
+2 (ˆd​​​)​ ​​ po
(​ˆ0​​​)​
(​ˆd​​​)​ po
(​ˆε​​​)​
Cov[​​ˆ0​​​,​ˆ​ε​​​]​​ 2 (ˆd​​​)​ ​​ po
(​ˆ1​​​)​
(​ˆd​​​)​o​p​
(​ˆε​​​)​
Cov[​​ˆ1​​​,​ˆ​ε​​​]​​
=−​ (​​ˆ​1​​​​)​
1 Var[​ˆ0​​​]​ + (​​ˆ0​​​ ydec​​)​​ _
​ˆ​1​​​​​ 2​
Var[​​ˆ1​​​]​ +0​
2​ (​​ˆ1​​​​)​​​
1

(​ˆ0​​​ y​dec​​)​ _
​ˆ​1​​​​​ 2​
Cov[​​ˆ 0​​​ ,​​ˆ1​​​]​ +0 +0​
=​ (​​ˆ​
​1
1​​​​​
2​​
Var[​​ˆ0​​​]​ + ​1
​ˆ​1​​​​​ 2​​
​ˆd​​​​​2​Var[​​ˆ1​​​]​ po )​​
+ ​2
​ˆ​1​​​​​ 2​​​
​ˆd​​​Cov[​​ˆ0​​​,​​ˆ po 1​​​ ]​​
The derivatives ​​ (​​T​)​​​​ in Equation 13 can be written in a com-
plementary matrix format, The equations for ​​​μ​pod​​​​ ˆ and ​ˆ​​​​ σ​pod
(Equation 6) are needed to create the transition matrix, in
Equation 15, which contains the partial derivatives with respect
to each parameter [1–4].
Then, the matrices are useful in calculating the variance of
the POD curve, ​​​ pod​​​ (​​​ˆ​​​​​ μ​pod ​​​σ​pod)​​ ˆ = U​​T​V(​ˆ)​U​​ For a simple
linear model, the resulting matrix provides the terms shown in
Equation 16.
(16)​ Var​[​ˆd​​​]​ po =​ ​​​​1
​ˆ​1​​​​​ 2​​​ (​​
Var[​ˆ0​​​]​ +2​ˆd​​​Cov[​​ˆ0​​​,​​ˆ po 1​​​ ]​ +​​​ˆd​​​​​2​Var[​ˆ1​​​]​​ po
)​​​​
Var​[​ˆd​​​]​ po =​ ​​​​1
​ˆ​1​​​​​ 2​​​ (​​
Var​[​ˆε​​​]​ 2​​ˆd​​​Cov​[​​ˆ1​​​,​​ˆ​ε​​​]​ po +​​​ˆd​​​​​2​Var​[​ˆ1​​​]​​ po
)​​​​
​​ po po ​ˆ​1​​​​​ 2​​​
​ˆd​​​Cov[​​ˆ (​​ po 0​​​ ,​​ˆ1​​​]​ Cov[​ˆ 0​​​ , ​ˆε​​​]​​ ​Cov​[ˆd​​​,​ˆd​​​]​=​​​​​1
​​ˆd​​​Cov[​​ˆ1​​​,​​ˆ​ε​​​]​ po +​​ˆd​​​​​ˆd​​​Var[​ˆ po po 1​​​ ])​​​​
If ​​​σ​ε​​​​ is assumed to be independent of ​​β​0​​​​ and ​​β​1​​​​ then
Cov[​ˆ​​​,ˆ​​​]​ 0 ​​ ε =0​ and ov[​ˆ​​​,ˆ​​​]​ 1 ​​ ε =0​.
The POD curve assumes a standard normal distribution,
so the Wald confidence interval is appropriate. Using a Wald
confidence interval, the probability of detecting a disconti-
nuity with %​ probability and %​ confidence is provided by ​​
a​p/q​​​ in Equation 18. Note that ​​ a​​​p​​ the standard deviation
at the probability associated with discontinuity size ​​ p​​​ (see
Equation 17).
(17)​ σa​p​​​​ =​​
_____________________________________________
Var​(ˆd​​​)​ ​​ po +2 zp​​Cov​(ˆd​​​,​ˆd​​​)​ ​​ po po + z​p​​Var​(ˆd​​​)​​​o​p​​
(18)​ ap/q​​ = a​p​​ + zq​​​ˆ​p​​​​​​ a (​​ˆd​​​ po + zp​​​ˆd​​​)​ po + zq​​​σap​​​​​​​​​
Equation 19 gives ​​ 90/95​​​ which is defined as the disconti-
nuity size with 90% POD and 95% confidence:
(19)​ a90/95​​ =​​ˆd​​​ po + z0.9​​​​ˆd​​​ po + z0.95​​​σa90​​​​​​​​​
When performing the conversion from a linear model to a
probability curve, the variance of the POD curve is calculated
as ​​​ pod​​​ (​​​ˆ​​​​​ μ​pod ​​​σ​pod)​​ ˆ = U​​T​V(​ˆ)​U​​ where is the variance-
covariance matrix from the linear model fit, and is a matrix
of derivatives of the new ​​ pod​​​ and ​​ pod​​​ with respect to each
estimated parameter. Extending this result for a model that
estimates parameters—say, ​​ 0​​ , β​1​​,…,​β​p−2​​,​​σ​ε​​​​2​​—the matrix
has a size of × p​ and the matrix has a size of × p​ so ​​ pod​​​
will always have a final size of × 2​.
3. Methods: Alternatives to the Simple Linear
Model Setup
In step 1 of the POD process, a linear model is fit to the data,
usually the simple model shown in Equation 2. However, this
model only describes the effect that discontinuity size (​ has
on the signal response (​ Often, as confirmed by analysis of
variance (ANOVA), the NDE response varies with multiple
variables, not just discontinuity size, and these additional vari-
ables should be included in the linear model. For instance,
the NDE response may depend on the area of a discontinuity
rather than just its length, which may require including ​​ 2​​ as
a variable in the linear model. Equation 5 applies to any linear
model ​y​​ however, Equation 6 becomes more complex as more
variables are added, and all steps for calculating the POD curve
will require adjustment.
3.1. Variable Additions to the Simple Linear
Model Setup
The methodology defined in MIL-HDBK-1823A [4] does not
specify how to calculate POD for models beyond the simple
linear relationship between and but additional terms
are often needed. This section considers the complication of
moving from the simple linear model, = β​0​​ + β​1​​x​ to a model
with additional terms and interactions.
For example, the categorical variable of material type may
have a statistically significant relationship to the NDE response.
In that case, the indicator variable (Equation 21) may be
ME
|
PODMODELING
60
M AT E R I A L S E V A L U AT I O N A U G U S T 2 0 2 5
Previous Page Next Page