21.3. Probabilistic Methods

www.kxcad.net Home > CAE Index > ANSYS Index > Release 11.0 Documentation for ANSYS


Your Ad Here

21.3.1. Introduction

All probabilistic methods execute the deterministic problem several times, each time with a different set of values for the random input variables. The various probabilistic methods differ in the way in which they vary the values of the random input variables from one execution run to the next.

One execution run with a given set of values for the random input variables with m is the number of random input variables is called a sampling point, because the set of values for the random input variables marks a certain point in the space of the random input variables.

21.3.2. Common Features for all Probabilistic Methods

21.3.2.1. Random Numbers with Standard Uniform Distribution

A fundamental feature of probabilistic methods is the generation of random numbers with standard uniform distribution. The standard uniform distribution is a uniform distribution with a lower limit xmin = 0.0 and an upper limit xmax = 1.0. Methods for generating standard uniformly distributed random numbers are generally based on recursive calculations of the residues of modulus m from a linear transformation. Such a recursive relation is given by the equation:

(21–44)

where:

a, c, m = nonnegative integers
si-1 = previous seed value of the recursion
ki-1 = integer part of the ratio (a si-1 + c) / m

A set of random numbers with standard uniform distribution is obtained by normalizing the value calculated by (Equation 21–44) with the modulus m:

(21–45)

It is obvious from (Equation 21–44) that an identical set of random numbers will be obtained if the same start value for the seed si-1 is used. Therefore, the random numbers generated like that are also called “pseudo random” numbers. See Hammersley and Handscomb(308) for more details about the generation of random numbers with standard uniform distribution.

21.3.2.2. Non-correlated Random Numbers with an Arbitrary Distribution

For probabilistic analyses, random numbers with arbitrary distributions such as the ones described in Statistical Distributions for Random Input Variables are needed. The most effective method to generate random number with any arbitrary distribution is the inverse transformation method. A set of random numbers for the random variable X having a cumulative distribution function Fx (x) can be generated by using a set of standard uniformly distributed random numbers according to (Equation 21–45) and transforming them with the equation:

(21–46)

Depending on the distribution type of the random variable X, the inverse cumulative distribution function can be calculated as described in Statistical Distributions for Random Input Variables.

21.3.2.3. Correlated Random Numbers with an Arbitrary Distribution

Correlated random input variables must be dealt with by all probabilistic methods, if there are random input variables, the user has identified as being correlated with each other. In order to handle correlated random input variables it is necessary to transform the random variable values using the Nataf model. The Nataf model is explained in detail in Liu and Der Kiureghian(311)).

21.3.3. Monte Carlo Simulation Method

A fundamental characteristic of the Monte Carlo Simulation method is the fact that the sampling points are located at random locations in the space of the random input variables. There are various techniques available in literature that can be used to evaluate the random locations of the sampling points (see Hammersley and Handscomb(308), Iman and Conover(309)).

21.3.3.1. Direct Monte Carlo Simulation

The direct Monte Carlo Simulation method is also called the crude Monte Carlo Simulation method. It is based on randomly sampling the values of the random input variables for each execution run. For the direct Monte Carlo Simulation method the random sampling has no memory, i.e., it may happen that one sampling point is relative closely located to one or more other ones. An illustration of a sample set with a sample size of 15 generated with direct Monte Carlo Simulation method for two random variables X1 and X2 both with a standard uniform distribution is shown in Figure 21.10: "Sample Set Generated with Direct Monte Carlo Simulation Method".

Figure 21.10  Sample Set Generated with Direct Monte Carlo Simulation Method

As indicated with the circle, there may be sample points that are located relatively close to each other.

21.3.3.2. Latin Hypercube Sampling

For the Latin Hypercube Sampling technique the range of all random input variables is divided into n intervals with equal probability, where n is the number of sampling points. For each random variable each interval is “hit” only once with a sampling point. The process of generating sampling points with Latin Hypercube has a “memory” in the meaning that the sampling points cannot cluster together, because they are restricted within the respective interval. An illustration of a sample with a sample size of 15 generated with Latin Hypercube Sampling method for two random variables X1 and X2 both with a standard uniform distribution is shown in Figure 21.11: "Sample Set Generated with Latin Hypercube Sampling Method".

Figure 21.11  Sample Set Generated with Latin Hypercube Sampling Method

There are several ways to determine the location of a sampling point within a particular interval.

  1. Random location: Within the interval the sampling point is positioned at a random location that agrees with the distribution function of the random variable within the interval.

  2. Median location: Within the interval the sampling point is positioned at the 50% position as determined by the distribution function of the random variable within the interval.

  3. Mean value: Within the interval the sampling point is positioned at the mean value position as determined by the distribution function of the random variable within the interval.

See Iman and Conover(309) for further details.

21.3.4. The Response Surface Method

For response surface methods the sampling points are located at very specific, predetermined positions. For each random input variable the sampling points are located at given levels only.

Response surface methods consist of two key elements:

  1. Design of Experiments: Design of Experiments is a technique to determine the location of the sampling points. There are several versions for design of experiments available in literature (see Montgomery(312), Myers(313)). These techniques have in common that they are trying to locate the sampling points such that the space of random input variables is explored in a most efficient way, meaning obtaining the required information with a minimum number of sampling points. An efficient location of the sampling points will not only reduce the required number of sampling points, but also increase the accuracy of the response surface that is derived from the results of those sampling points. Two specific forms of design of experiments are outlined in the remainder of this section.

  2. Regression Analysis: Regression analysis is a technique to determine the response surface based on the results obtained at the sampling points (see Neter et al.(314)). Regression Analysis for Building Response Surface Models has been dedicated to discuss regression analysis, because regression analysis is not only used in the context of response surface methods.

21.3.4.1. Central Composite Design

Location of Sampling Points Expressed in Probabilities

For central composite design the sampling points are located at five different levels for each random input variable. In order to make the specification of these levels independent from the distribution type of the individual random input variables, it is useful to define these levels in terms of probabilities. The five different levels of a central composite design shall be denoted with pi, with i = 1, ... , 5.

A central composite design is composed of three different parts, namely:

  1. Center point: At the center point the values of all random input variables have a cumulative distribution function that equals p3.

  2. Axis points: There are two points for each random variable located at the axis position, i.e., if there are m random input variables then there are 2m axis points. For the axis points all random input variables except one have a value corresponding to the center location and one random variable has a value corresponding to p1 for the low level point and corresponding to p5 for the high level point.

  3. Factorial points: In a central composite design there are 2m-f factorial points. Here, f is the fraction of the factorial part. The fraction of the factorial part is explained in more detail in the next subsection. For the factorial points all random input variables have values corresponding to permutations of p2 for the lower factorial level and p4 for the upper factorial level.

A sample set based on a central composite design for three random variables X1, X2 and X3 is shown in Figure 21.12: "Sample Set Based on a Central Composite Design".

Figure 21.12  Sample Set Based on a Central Composite Design

For this example with three random input variables the matrix describing the location of the sampling points in terms of probabilities is shown in Table 21.1: "Probability Matrix for Samples of Central Composite Design".

Table 21.1  Probability Matrix for Samples of Central Composite Design

SampleX1X2X3Part
1p3p3p3Center
2p1p3p3Axis Points
3p5p3p3
4p3p1p3
5p3p5p3
6p3p3p1
7p3p3p5
8p2p2p2Factorial Points
9p2p2p4
10p2p4p2
11p2p4p4
12p4p2p2
13p4p2p4
14p4p4p2
15p4p4p4

Resolution of the Fractional Factorial Part

For problems with a large number of random input variables m, the number of sampling points is getting extensively large, if a full factorial design matrix would be used. This is due to the fact that the number of sampling points of the factorial part goes up according to 2m in this case. Therefore, with increasing number of random variables it is common practice to use a fractional factorial design instead of a full factorial design. For a fractional factorial design, the number of the sampling points of the factorial part grows only with 2m-f. Here f is the fraction of the factorial design so that f = 1 represents a half-factorial design, f = 2 represents a quarter-factorial design, etc. Consequently, choosing a larger fraction f will lead to a lower number of sampling points.

In a fractional factorial design the m random input variables are separated into two groups. The first group contains m - f random input variables and for them a full factorial design is used to determine their values at the sampling points. For the second group containing the remaining f random input variables defining equations are used to derive their values at the sampling points from the settings of the variables in the first group.

As mentioned above, we want to use the value of the random output parameters obtained at the individual sampling points for fitting a response surface. This response surface is an approximation function that is determined by a certain number of terms and coefficients associated with these terms. Hence, the fraction f of a fractional factorial design cannot become too large, because otherwise there would not be enough data points in order to safely and accurately determine the coefficients of the response surface. In most cases a quadratic polynomial with cross-terms will be used as a response surface model. Therefore, the maximum value for the fraction f must be chosen such that a resolution V design is obtained (here V stands for the Roman numeral 5). A design with a resolution V is a design where the regression coefficients are not confounded with each other. A resolution V design is given if the defining equation mentioned above includes at least 5 random variables as a total on both sides of the equation sign. Please see Montgomery(312) for details about fractional factorial designs and the use of defining equations.

For example with 5 random input variables X1 to X5 leads to a resolution V design if the fraction is f = 1. Consequently, a full factorial design is used to determine the probability levels of the random input variables X1 to X4. A defining equation is used to determine the probability levels at which the sampling points are located for the random input variable X5. See Montgomery(312) for details about this example.

Location of Sampling Points Expressed in Random Variable Values

In order to obtain the values for the random input variables at each sampling point, the probabilities evaluated in the previous section must be transformed. To achieve this, the inverse transformation outlined under Common Features for all Probabilistic Methods can be used for non-correlated random variables. The procedure dealing with correlated random variables also mentioned under Common Features for all Probabilistic Methods can be used for correlated random variables.

21.3.4.2. Box-Behnken Matrix Design

Location of Sampling Points Expressed in Probabilities

For a Box-Behnken Matrix design, the sampling points are located at three different levels for each random input variable. In order to make the specification of these levels independent from the distribution type of the individual random input variables, it is useful to define these levels in terms of probabilities. The three different levels of a Box-Behnken Matrix design shall be denoted with p1, with i = 1, ... , 3.

A Box-Behnken Matrix design is composed of two different parts, namely:

  1. Center point: At the center point the values of all random input variables have a cumulative distribution function that equals p2.

  2. Midside points: For the midside points all random input variables except two are located at the p2 probability level. The two other random input variables are located at probability levels with permutations of p1 for the lower level and p3 for the upper level.

See Box and Cox(307) for further details. A sample set based on a central composite design for three random variables X1, X2 and X3 is shown in Figure 21.13: "Sample Set Based on Box-Behnken Matrix Design".

Figure 21.13  Sample Set Based on Box-Behnken Matrix Design

For this example with three random input variables the matrix describing the location of the sampling points in terms of probabilities is shown in Table 21.2: "Probability Matrix for Samples of Box-Behnken Matrix Design".

Table 21.2  Probability Matrix for Samples of Box-Behnken Matrix Design

SampleX1X2X3Part
1p2p2p2Center
2p1p1p2 Midside Points
3p1p3p2
4p3p1p2
5p3p3p2
6p1p2p1
7p1p2p3
8p3p2p1
9p3p2p3
10p2p1p1
11p2p1p3
12p2p3p1
13p2p3p3

Location of Sampling Points Expressed in Random Variable Values

In order to obtain the values for the random input variables at each sampling point, the same procedure is applied as mentioned above for the Central Composite Design.

Your Ad Here