Key Words: multi-dimensional, non-linear, data regression, model, application
The “new square method” is an improved approach based on the “least square method”. It calculates not only the constants and coefficients but also the variables’ power values in a model in the course of data regression calculations, thus bringing about a simpler and more accurate calculation for non-linear data regression processes.
In non-linear data regression calculations, the “least square method” is applied for mathematical substitutions and transformations in a model, but the regression results may not always be correct, for which we have made improvement on the method adopted and named the improved one as “new square method”.
- Principle of New Square Method
While investigating the correlation between variables(x,y), we get a series of paired data (x1,y1,x2,y2……xn,yn) through actual measurements. Plot these data on the x – y coordinates, then a scatter diagram as shown in Figure 1 will be obtained. It can be observed that the points are in the vicinity of a curve, whose fitted equation is set as the following Equation 1.
where a0, a1 and k indicate any real numbers.
To establish the fitted equation, the values of a0, a1 and k need to be determined via subtracting the calculated valuefrom the measured value yi, i.e., via(yi - y).
Then calculate the quadratic sum of m(yi - y) as shown in Equation 2.Equation 2
Substitute Expression 1 into Expression 2, as shown in Expression 3:Equation 3
Find the partial derivatives for a0, a1 and k respectively through functionso as to make the derivatives equal to zero:Equation 4Equation 5Equation 6
Through derivation it is found that there is no analytic solution to this equation set, then computer programs are utilized to calculate its arithmetic solutions and obtain the solutions for a0, a1 and k as well as the correlation coefficient. It is observed that the closer the correlation coefficient is to 1, the better the model fits.
- Comparison between the “New Square Method” and the “Least Square Method”
If Equation 7 as shown below is adopted to fit any data:
y = a0 + a1xikEquation 7
The Comparison Table between the “New Square Method” and the “Least Square Method”:
Least Square Method New Square Method Fitted Equations y = a0 + a1x y = a0 + a1xik Calculated Regression Results a0 and a2 a0 and a2 and k
- In the “new square method”, the power valueof the dependent variable is calculated, while in the “least square method”, is assumed to be 1. With the calculated power value for the dependent variable, the new square method is able to have the fitted equation generate a fitted line at any curve to better fit the non-linear data.
- In the “new square method”, non-linear data with one factorcan be regressed by applying the following Equation 8 in the computer programs to obtain more accurate fittings of non-linear data by regression models.
y = a0 + a1xk1+ a2xk2+......+anxknEquation 8
In Equation 8:
x — Variable
y — Variable
x,y — Dimensional (two-dimensional).
x k1,xk2,xkn — Element.
a 0 — Constant.
a 1, a2, an — Coefficient.
k1, k2, kn — Power.
- As for the regression of non-linear data with multi-factors in the “new square method”, the following Equation 9 can be utilized in computer programs for this purpose. This equation takes into account both the contribution of factors(x1, x 2.....xn) to the objective function (y) and the interplays among factors(x1, x2.....xn) during the regression calculation, that is why the fitted models are of high correlation.
y = a0 + a1x1k11 + a2x2k21 + a3x1k12x2k22 + a4x1k13x2k23+......+an+1x1k1n+1x2k2n+1Equation 9
In Equation 9:
x 1, x2 — Variable.
y — Function.
x1,x2,y — Dimensional (three-dimensional).
x1k11, x2k21,x1k12,x2k22, x1k13, x2k23, x1k1n+1, x2k2n+1 — Element.
a 0, — Constant.
a 1, a2, a3, a4, an+2 — Coefficient.
k11, k21, l12, k22, k13, k23, k1n + 1, k2n + 1 — Power.
Note: Equation 9, which takes three-dimensional data as its example, can be applied for the regression of data in curved surface data.Bibliography:
- Shoupeng Wei. 1994. Petrochemical production process optimization. Sinopec press.
- Chengsen Lin. 1997. The numerical calculation method. China Science Press.
- Shisong Mao and Jixiang Zhou.1996. The theory of probability and statistics. China Statistics Press.
- Jeffrey Ri hter.2003. Microsoft.NET framework of the program design. Tinghua University press.