Skip to main content

Improved maximum likelihood method for P-S-N curve fitting method with small number specimens and application in T ... - Nature.com

Abstract

In fatigue data analysis, fitting accurate P-S-N curve is problematic if only a small number of specimen is available, especially to evaluate the relationship between the stress level and the standard deviation. This paper proposes a sample information reconstruction method that can effectively solve this problem. Based on this method and the life equivalent principle, a new maximum likelihood method (which is abbreviated to improved maximum likelihood method) is proposed for P-S-N curve fitting. T-joint specimens of Q450NQR1 steel were fabricated and tested, then the P-S-N curves was fitted by the improved maximum likelihood method, least square method, maximum likelihood method, standard BS7608 and standard IIW. Finally, P-S-N curves by three methods and two standards are compared and analyzed. The results show that the relevant parameters of the P-S-N curve with 99.9% survival probability fitted by the improved maximum likelihood method are similar to those in the two standards, and it is indicated that the improved maximum likelihood method is a better way for P-S-N curve fitting with the small number of fatigue test specimens.

Similar content being viewed by others

A versatile model for lifetime of a component under stress

A modification in Weibull parameters to achieve a more accurate probability distribution function in fatigue applications

The 3-component mixture of power distributions under Bayesian paradigm with application of life span of fatigue fracture

Introduction

P-S-N(probabilistic stress-life) curve is used to describe the relationship between stress level and the number of stress cycles to material fatigue failure for a specified survival probability1,2,3. It is a key indicator of the material's ability to resist fatigue damage and also the basis for structural fatigue reliability design and evaluation4,5,6,7. The basic methods used for P-S-N curve fitting include: the least-square method, Bayesian method, maximum likelihood method, Monte Carlo method, and modern intelligent algorithm8,9,10. In fact, each P-S-N curve is fitted with one of methods listed above or a combination of these methods9, 11,12,13.

According to sampling theory, the larger the sample, the higher the accuracy of the tests conducted on the sample. However, owing to high costs and long fatigue testing times, it is difficult to achieve sample sizes that satisfy sampling theory requirements, especially in the case of large-sized structural parts or expensive parts14,15,16. The maximum likelihood method can fit the P-S-N curve with a small sample17, 18, but the life means and standard deviations under each stress level must have been measured with high accuracy to achieve good results. Often, fatigue test data exhibit the following pattern19, 20: the samples have a small number of specimens in the middle life zone, there is generally a good linear relationship between the mean life and stress level in the logarithmic coordinate system, and the relationship between the life standard deviation and stress level is often not unique. Thus, it is often difficult to reflect the true-life distribution when the number of specimens is small.

In this paper, the sample information reconstruction and the improved maximum likelihood method are studied under small number specimens. Then fatigue test is carried out on T-joint specimens of Q450NQR1 steel, and the P-S-N curves were fitted by the method in this paper, least square method, maximum likelihood method, standard BS7608 and standard IIW. Finally, the P-S-N curves by three methods and two standards are compared and analyzed.

Theoretical background

Mathematical expression of S–N curve

The S–N curve represents the fatigue performance of a material. For ease of calculation, mathematical expressions are usually used to describe its laws, including the power function, exponential function, three-parameter, Basquin, and Weibull formulas21,22,23. The S–N curve for a metallic material can be expressed by the power function formula in middle life:

$$S^{m} N = C_{0}$$
(1)

where S is the stress level, N is the material life under stress level S, m and C can be determined by the means of a regression of formula (1), and are related to the material, stress ratio, loading mode, and so forth.

Taking the logarithm of both sides in Eq. (1) leads to:

$${\text{lg}}S = - \frac{1}{m}{\text{lg}}N + \frac{{{\text{lg}}C_{0} }}{m}$$
(2)

Letting \({\text{Y}} = {\text{lg}}S\), \({\text{X}} = {\text{lg}}N\), \({\text{A}} = - {\raise0.7ex\hbox{$1$} \!\mathord{\left/ {\vphantom {1 m}}\right.\kern-0pt} \!\lower0.7ex\hbox{$m$}}\), and \(B = {\raise0.7ex\hbox{${{\text{lg}}C_{0} }$} \!\mathord{\left/ {\vphantom {{{\text{lg}}C_{0} } m}}\right.\kern-0pt} \!\lower0.7ex\hbox{$m$}}\), Eq. (2) can be rewritten as:

$$Y = AX + B$$
(3)

Equation (3) shows that it is a linear relationship between the fatigue life N and stress level S in the logarithmic coordinate system.

Basic assumption

A large number of fatigue tests have confirmed that the fatigue life for a metallic material generally follows a lognormal distribution in across the mid-life span24, 25. Let the number of stress levels be \(n\), the total number of specimens be \(k\), the number of specimens at the i-th stress level be \({k}_{i}\), and the fatigue life of the j-th specimen at the i-th stress level be \({X}_{ij}\). Then the probability density function can be expressed as:

$$f\left( {X_{ij} } \right) = \frac{1}{{\sigma_{i} \sqrt {2\pi } }}{\text{exp}}\left[ { - \frac{1}{2}\left( {\frac{{X_{ij} - \mu_{i} }}{{\sigma_{i} }}} \right)^{2} } \right],$$
(4)

where \(i=\mathrm{1,2},\dots ,n\), \(j=\mathrm{1,2},\dots ,{k}_{i}\), and \({\mu }_{i}\) and \({\sigma }_{i}\) are the mean and std (standard deviation) of the log-life (logarithmic fatigue life) under stress level \({S}_{i},\) respectively.

The probability distribution function is:

$$F\left( {X_{ij} } \right) = P(X < X_{ij} ) = \Phi \left( {\frac{{X_{ij} - \mu_{i} }}{{\sigma_{i} }}} \right)$$
(5)

Life equivalent principle

From Eq. (1), when the loading method, loading form, stress ratio and test method are determined, the fatigue life of specimen is determined by material performance and manufacturing quality. During fatigue test, it is impossible to perform fatigue test on the same specimen at different stress levels, but according to "consistency principle of fatigue life percentile"26, the life equivalent of a specimen at different stress levels can be achieved because the fatigue life probability distribution \(F({X}_{ij})\) of the same specimen is constant under different stress levels in medium life span. Thus, the j-th specimen's fatigue life at the i-th stress level can be obtained by using Eq. (6) below:

$$\frac{{X_{ij} - \mu_{i} }}{{\sigma_{i} }} = T\left( {T\,{\text{is a constant,}}\,i = {1},{2}, \, \ldots ,n} \right)$$
(6)

Based on this method, the number of specimens can be increased for different stress levels, maximizing sample information. In particular, it provides a good way to expand the number of specimens. However, this method is based on obtaining a highly accuracy mean life and life std for each stress level. In fact, the mean life and life std are related to the stress level. Therefore, it is especially important to find the relationship between mean life and stress level as well as the relationship between life std and stress level.

P-S-N curve fitting based on the maximum likelihood method

The basic idea of the maximum likelihood method is that the probability of an event that has occurred is a maximum. Therefore, the selection of unknown parameters is based on the benefit of that event. When using the maximum likelihood method to fit the P-S-N curve, fatigue tests are performed on one specimen at each stress level \({S}_{i}\) (i = 1, 2, …, n), resulting in a corresponding log-life of \({X}_{i}\). Then a group of specimens are tested under one specified stress level \({S}_{d}\), where the corresponding log-lives are \({X}_{dj}\) (j = 1, 2, …, q)27. Meanwhile, two assumptions are proposed for the mid-life span: the fatigue lives follow a lognormal distribution at each stress level, and the life mean and life std each have a linear relationship with stress level in the logarithmic coordinate system. Then, the following formula can be obtained

$$\mu_{i} = \mu_{d} + \alpha \left( {Y_{i} - Y_{d} } \right)$$
(7)
$$\sigma_{i} = \sigma_{d} + \beta \left( {Y_{i} - Y_{d} } \right),$$
(8)

where \({Y}_{i}\) and \({Y}_{d}\) are the values of \({S}_{i}\) and \({S}_{d}\), respectively, in the logarithmic coordinate system; \({\mu }_{d}\) and \({\sigma }_{d}\) are the log-life mean and log-life std, respectively of the sample at stress level \({S}_{d}\); and \(\alpha\) and \(\beta\) are pending constants.

The log-life with failure probability p is:

$$X_{p} = \mu + u_{p} \sigma$$
(9)

According to the above assumptions, the likelihood function can be written as:

$$L = \mathop \prod \limits_{i = 1}^{n} f\left( {X_{i} } \right) = \mathop \prod \limits_{i = 1}^{n} \frac{1}{{\sigma_{i} \sqrt {2\pi } }}\exp \left[ { - \frac{1}{2}\left( {\frac{{X_{i} - \mu_{i} }}{{\sigma_{i} }}} \right)^{2} } \right]$$
(10)

Substituting Eqs. (7) and (8) into Eq. (10) and taking the logarithm of both sides of Eq. (10) leads to:

$$lnL = - \mathop \sum \limits_{i = 1}^{n} \left\{ {ln\sqrt {2\pi } + ln\left[ {\sigma_{d} + \beta \left( {Y_{i} - Y_{d} } \right)} \right] + \frac{1}{2}\left( {\frac{{X_{i} - \mu_{d} - \alpha \left( {Y_{i} - Y_{d} } \right)}}{{\sigma_{d} + \beta \left( {Y_{i} - Y_{d} } \right)}}} \right)^{2} } \right\}$$
(11)

When \(lnL\) reaches its maximum value, the corresponding \(\alpha\) and \(\beta\) reach their maximum likelihoods. If we take partial derivatives of Eq. (11) and set them equal to zero, we can find their values numerically instead.

$$\frac{\partial lnL}{{\partial \alpha }} = - \mathop \sum \limits_{i = 1}^{n} \left\{ {\frac{{\left( {Y_{i} - Y_{d} } \right)[X_{i} - \mu_{d} - \alpha \left( {Y_{i} - Y_{d} } \right)]}}{{\left[ {\sigma_{d} + \beta \left( {Y_{i} - Y_{d} } \right)} \right]^{2} }}} \right\} = 0$$
(12)
$$\frac{\partial lnL}{{\partial \beta }} = - \mathop \sum \limits_{i = 1}^{n} \left\{ {\frac{{\left( {Y_{i} - Y_{d} } \right)[X_{i} - \mu_{d} - \alpha \left( {Y_{i} - Y_{d} } \right)]^{2} }}{{\left[ {\sigma_{d} + \beta \left( {Y_{i} - Y_{d} } \right)} \right]^{3} }} + \frac{{\left( {Y_{i} - Y_{d} } \right)}}{{\left[ {\sigma_{d} + \beta \left( {Y_{i} - Y_{d} } \right)} \right]}}} \right\} = 0$$
(13)

Using the above method, the P-S-N curve's equation is:

$$X_{p} = \mu_{d} + u_{p} \sigma_{d} + \left( {\alpha + u_{p} \beta } \right)\left( {Y_{i} - Y_{d} } \right)$$
(14)

Equation (14) can be rewritten as:

$$Y_{i} = \frac{{X_{p} - \mu_{d} - u_{p} \sigma_{d} }}{{\left( {\alpha + u_{p} \beta } \right)}} + Y_{d}$$
(15)

Improved maximum likelihood method for P-S-N curve fitting

Sample information reconstruction method

The problem with fitting the P-S-N curve using the maximum likelihood method is that the mean life and life std for each stress level cannot be obtained, with the exception of stress level \({S}_{d}\). Meanwhile, due to high costs and the long time periods needed for fatigue tests, it is difficult to use large samples for fatigue tests. Therefore, this research focuses on obtaining the mean life and life std for each stress level with high accuracy based on small or very small samples.

Generally speaking, when the number of specimens for each stress level is large enough, there is an approximate linear relationship between the \(\mu\) (or \(\sigma\)) and stress level in the logarithmic coordinate, which is shown in Fig. 1a. However, in the case of small or very small number samples, it is difficult for fatigue test data to accurately describe true data distribution pattern, particularly in the case of the relationship between stress level and the life std.

Figure 1
figure 1

Fatigue life distributions when there is (a) a large number of specimens and (b) a small number of specimens.

Full size image

In fact, the most typical phenomenon is that they are not linear relationship in some adjacent stress levels when the number of samples is small, which is shown in Fig. 1b. There may even be a phenomenon that it is the opposite of the homoskedasticity method28 or linear heteroskedasticity method29. The sample information reconstruction method is given to deal with such problems in this paper.

When the fatigue life stds' change direction (increase or decrease) of two adjacent stress levels is inconsistent with the overall trend, it can combine the two stress levels into one stress level, and Eq. (16) can be defined under the new stress level, as follows:

$$\left\{ \begin{gathered} S_{i}^{\prime} = \frac{{k_{i} }}{k}S_{i} + \frac{{k_{i + 1} }}{k}S_{i + 1} \hfill \\ k = k_{i} + k_{i + 1} \hfill \\ \sigma_{i}^{\prime} = \frac{{k_{i} }}{k}\sigma_{i} + \frac{{k_{i + 1} }}{k}\sigma_{i + 1} \hfill \\ \end{gathered} \right.,$$
(16)

where \({S}_{i}^{\prime}\) is the new stress level replacing the old stress levels \({S}_{i}\) and \({S}_{i+1}\), \({\sigma }_{i}{\prime}\) is the new life std replacing the old life stds \({\sigma }_{i}\) and \({\sigma }_{i+1}\), and \(k\) is the sum of \({k}_{i}\) and \({k}_{i+1}\).

When the fatigue life stds' change directions (increase or decrease) of three adjacent stress levels relationships are inconsistent with the overall trend, these three stress levels can be combined into one stress level, and Eq. (17) below can be defined under the new stress level, where its parameters are defined as in Eq. (16).

$$\left\{ \begin{gathered} S_{i}^{\prime} = \frac{{k_{i - 1} }}{k}S_{i - 1} + \frac{{k_{i} }}{k}S_{i} + \frac{{k_{i + 1} }}{k}S_{i + 1} \hfill \\ k = k_{i - 1} + k_{i} + k_{i + 1} \hfill \\ \sigma_{i}^{\prime} = \frac{{k_{i - 1} }}{k}\sigma_{i - 1} + \frac{{k_{i} }}{k}\sigma_{i} + \frac{{k_{i + 1} }}{k}\sigma_{i + 1} \hfill \\ \end{gathered} \right.$$
(17)

Sample information equivalence shall be performed as this method if the variation is not unique between life mean and stress level, and this method is defined as sample information reconstruction.

Improved maximum likelihood method and its calculation process

The basic principle of the improved maximum likelihood method is: Firstly, the test data are compiled using the life equivalent principle method, which, when combined with the sample information reconstruction method, can both improve the accuracy of the life stds and ensure the accuracy of the life equivalency. Secondly, the P-S-N curve is fitted via the maximum likelihood method. The calculation flowchart of improved maximum likelihood method for P-S-N curve fitting is shown in Fig. 2, and the detailed steps for the method are as follows:

  1. (1)

    According to the fatigue test data, if there is an approximate linear relationship between the mean life and stress level in logarithmic coordinates at each stress level, then those lines can be fitted by the least squares method, and a mean life mean can be calculated for each stress level.

  2. (2)

    If there is an approximate linear relationship on the whole between life std and stress level in the logarithmic coordinate system but some adjacent stress levels fail to follow the trend closely, the sample information reconstruction method can be used to generate new data, allowing the life stds at different stress levels to be calculated.

  3. (3)

    According to the life equivalent principle, combined with the new means and the new stds, the fatigue lives under different stress levels are equivalent to those at the specific stress level \({S}_{d}\) for a large sample, so all the information in the existing sample is utilized.

  4. (4)

    Finally, the P-S-N curve is fitted with this equivalent fatigue data via the maximum likelihood method.

Figure 2
figure 2

The calculation flowchart of improved maximum likelihood method.

Full size image

Fatigue test of T-joint specimen

Welded T-joints in railway wagons are widely used to provide the basic parameters for fatigue life evaluations of these wagons and can be used to verify the accuracy of the P-S-N curve fitted via the new maximum likelihood method. Thus, fatigue tests were carried out on 30 T-joint specimens made of Q450NQR1 steel, then the P-S-N curve was fitted.

Material properties

At present, the total number of railway wagons in China exceeds 900,000, and their structures are basically all-steel welded30. Q450NQR1 steel is the most widely used. Therefore, it is very representative to fabricate fatigue specimens out of Q450NQR1 steel. Tables 1 and 2 show the chemical composition and mechanical properties of Q450NQR1, respectively.

Table 1 Chemical composition of Q450NQR1 steel 31.
Full size table
Table 2 Mechanical properties of Q450NQR1 steel 31.
Full size table

Specimen fabrication

Specimen sizes for the fatigue tests followed Standard GB/3075-200832. For each specimen, the thickness of the plate was 6 mm, the dimensional tolerance was ± 1 mm, the vertical tolerance was ± 1 mm, the flatness was within 1 mm, and the parallelism of each side was less than 0.2 mm. The specimens had no defects, such as delamination, depressions, and large bulges, but fine burrs were allowed. The welding method of parts adopts mixed gas shielded welding, the mixed gas ratio is Ar:CO2 = 80%:20%, the gas flow is 15–20 L/min; the welding voltage is 18–23 V, and the welding current is 150–200A. A total of 30 T-joint specimens were fabricated. The welded form, specimen size and an actual specimen are shown in Fig. 3.

Figure 3
figure 3

Welded T-joint specimen: (a) three dimensional model; (b) 2D drawing; (c) actual specimen.

Full size image

Loading conditions

At present, the S–N curves of metallic materials are usually obtained by group test method and lifting method33, and the fatigue test in this paper also adopts this methods. The fatigue tests were carried out with the Zwick/Roell high-frequency fatigue testing machine under normal temperatures. The specimens were loaded with a multi-level, constant-amplitude loading consisting of no less than five stages. The loading method utilized was the axial pull-pull cycle with a stress ratio R = 0.1, sine wave load, and loading frequency of about 70 Hz. In order to obtain a realistic fatigue limit, the load stress was adjusted as close as possible to the stress level at a fatigue life of 2 × 106, which was then used as the basis for judging the fatigue failure of the specimen. That is, if the number of cycles exceeded 2 × 106, the specimen was no longer subject to damage.

Test results

The fatigue test results show that the fatigue cracks of all specimens are basically located at the weld toe, and the cracks spread laterally along the position of the weld toe, which is shown in Fig. 4. According to Fig. 4b, the fatigue crack began on the T-joint surface. A fatigue fracture is described by its source, fatigue crack stability extension, and rapid fracture zones. In the figure, the fatigue source zone is located at the end of the T-joint, and the crack growth rate is slower in this area. The upper and lower surfaces continuously opened and closed during the expansion process, leading to little friction, as it is relatively smooth. The white area in the middle of the fracture is the rapid fracture zone, which is a fresh section that appears after a crack has spread sufficiently far.

Figure 4
figure 4

The fatigue test results for a T-joint specimen: (a) its fatigue crack and; (b) its macroscopic fracture morphology.

Full size image

The fatigue test results for the specimens were calculated. For P-S-N curve fitting, the specimen fatigue life was logarithmically transformed, and the means and stds at the different stress levels were calculated. The results are shown in Table 3.

Table 3 The log-lives of T-joint specimens.
Full size table

P-S-N curve fitting for T-welded joint

According to the analysis of fatigue test results, this article focuses on the study of P-S-N curves of T-welded joints, which based on the characteristics of crack initiation at the weld toe and crack propagation along the weld toe direction. The improved maximum likelihood method, the least square method and the maximum likelihood method are used to fit the P-S-N curves of the data in Table 3, and the P-S-N curves of F2-level in BS7608 and FAT 80 in IIW are introduced. The P-S-N curves fitted by five methods are compared and analyzed to verify the accuracy of the P-S-N curves fitted by improved maximum likelihood method. Specifically, the curves is extrapolated according to the AASHTO standard34.

The least square method

The least square method is used to fit the P-S-N curve for the test data in Table 3 when the survival probability is 50%, 95%, 97.3% and 99.9%, as shown in Fig. 5.

Figure 5
figure 5

The P-S-N curves by least square method.

Full size image

According to Fig. 5, there is no curve with high survival probability that crosses the curve with low survival probability, which is consistent with the objective law of fatigue life distribution; With the decrease of stress level and the increase of survival probability, the difference of fatigue life between the three curves gradually increases, which is consistent with the law that the dispersion of fatigue life increases with the decrease of stress level. In addition, three of the curves are basically within the range of sample fatigue life.

The maximum likelihood method

The maximum likelihood method is used to directly fit the P-S-N curve of the data in Table 3. First, the curve equation of life mean and stress level which is fitted with the least square method as follows:

$$\mu_{i} = 14.1862 - 3.9766Y_{i}$$
(18)

Then, the curve equation of life std and stress level which is fitted with the least square method as follows:

$$\sigma_{i} = 0.5089 - 0.2199Y_{i}$$
(19)

Finally, according to the relevant parameters in Eq. (18) and formula Eq. (19), and the mean and the std on the specific stress level 110 MPa, the P-S-N curve can be obtained by maximum likelihood method as follows:

$$Y_{i} = \frac{{6.0616 + 0.0711u_{p} - X_{p} }}{{\left( {0.2199u_{p} + 3.9766} \right)}} + 2.0414$$
(20)

According to Eq. (20), the P-S-N curves are fitted with survival probabilities p of 50%, 90%, 97.3% and 99.9% respectively and extrapolation, as shown in Fig. 6. For a more intuitive interpretation, the fatigue test data for the T-welded joints are plotted together in Fig. 6.

Figure 6
figure 6

The P-S-N curves by maximum likelihood method.

Full size image

According to Fig. 6, the curve fitted by the maximum likelihood method is basically consistent with the overall characteristics of the curve fitted by the least squares method. However, only one curve is in the range of sample fatigue life, and one curve is on the boundary of the range. Compared with the least square method, the curve gradually moves down with the increase of survival probability. It shows that the P-S-N curve fitted by this method is safer than the curve fitted by the least square method.

The improved maximum likelihood method

Samples information reconstruction

According to the test data in Table 3, the mean lives and stress levels follow the same approximate linear relationship in logarithmic coordinates. The line can be directly fitted by the least squares method, and the relationship between life mean and the stress level is:

$$\mu_{i} = 14.2562 - 4.0096Y_{i}$$
(21)

Based on Eq. (21), the mean life can be obtained for new stress levels, as shown in Table 4.

Table 4 Mean lives and stds for new stress levels.
Full size table

According to the test data in Table 3, the stds and stress levels follow roughly the same linear relationship in logarithmic coordinates, but the stds for stress levels 150 MPa, 130 MPa, and 110 MPa are higher than those under 140 MPa, 120 MPa, and 100 MPa. Therefore, the sample information reconstruction method can be used to reconstruct the sample information for two adjacent stress levels. After the reconstruction shown in Table 4, and the relationship between std and stress level is determined to be:

$$\sigma_{i} = 0.5484 - 0.2384Y_{i}$$
(22)

Life equivalent

Based on Eqs. (21) and (22), the mean and std for each stress level can be obtained, as shown in Table 5.

Table 5 Mean lives and stds after information reconstruction and fitting.
Full size table

According to the test data in Table 3, the number of specimens for stress level 110 MPa is relatively larger, and this stress level is close to the other stress levels. Therefore, s...

Comments

Popular posts from this blog