Discussion Of Results

  • November 2019
  • PDF

This document was uploaded by user and they confirmed that they have the permission to share it. If you are author or own the copyright of this book, please report to us by using this DMCA report form. Report DMCA


Overview

Download & View Discussion Of Results as PDF for free.

More details

  • Words: 3,901
  • Pages: 31
Presentation and Discussion of the Results

A. Descriptive Statistics of the actual data Descriptive Statistics Mean

Std.Dev

Minimum

Maximum

N

TOTEX2

102389.8 129866.6

8926.000

3203978

4130

TOTIN2

134119.4 216934.9

9067.000

4357180

4130

The above figure shows the descriptive statistics of the second visit actual data. The mean and the standard deviation of the data for TOTIN2 are larger than TOTEX2 primarily because most of the values from TOTIN2 are larger and vary significantly than TOTEX2. The statistics above will be used later in computing for the accurateness and precision of the imputed data. B. Formation of Imputation Classes Chi – Square Test for Independence Matching Variables PROV CODES1 CODEP1 Matching Variables PROV CODES1 CODEP1

TOTIN2 Test Stat P-Value 151.78 < 0.0001 613.859 < 0.0001 358.436 < 0.0001 Test Stat 137.83 687.342 193.132

TOTEX2 P-Value < 0.0001 < 0.0001 < 0.0001

Degrees of Freedom 9 6 9 Degrees of Freedom 9 6 9

A test of association using chi-square test for independence showed that all the candidates for the matching variable have a very significant relationship with both the nonresponse variables. In both nonresponse variables, CODES1 has the largest test statistics among all the candidates for the matching variable.

Measures of Association

MEASURES OF ASSOCIATION: INCOME VARIABLE PROVINCE CODES1 Phi-Coefficient 0.192 0.386 Cramer's V 0.111 0.273 Contingency Test 0.188 0.36 MEASURES OF ASSOCIATION: EXPENDITURE VARIABLE PROVINCE CODES1 Phi-Coefficient 0.183 0.408 Cramer's V 0.105 0.288 Contingency Test 0.18 0.378

CODEP1 0.295 0.17 0.283 CODEP1 0.216 0.125 0.211

In the results above, CODES1 has the highest association in all the tests to both nonresponse variables. In addition, it registered at least twenty percent, the minimum requirement mentioned in the methodology for it to be considered as a matching variable in all the tests. PROV fares the worst in the test for association after measuring below twenty percent in both nonresponse variable and tests for association. After dominating the tests for association, CODES1 has been chosen as matching variable for this study.

C. Distribution of the simulated data set by education status imputation classes for each nonresponse variable and nonresponse rates

CLASSES 1 2 3 TOTAL

TOTEX2, 10% NONRESPONSE RATE Total Number of Observations Observations set to Observations Retained Nonresponse Mean Mean No. % No. No. Rate Rate 2635 63.8% 2379 75602.76 256 78454.08 1434 34.7% 1280 137760.6 154 132438.5 61 1.5% 58 443518.3 3 157801.3 4130 100% 3717 102748.6 413 99160.23

Percentage of deleted Cases 9.7% 10.7% 4.9% 10.0%

The table above shows that mean of the respondents doesn’t significantly differ than the mean of the nonrespondents. However, in the third imputation class, the mean vary significantly between the respondent and nonrespondents. There were only three observations that were simulated to nonresponse as compared to almost ten and eleven percent from the first and second class respectively.

TOTEX2, 20% NONRESPONSE RATE

CLASSES 1 2 3 TOTAL

Total Number of Observations No.

%

2635 1434 61 4130

63.8% 34.7% 1.5% 100%

Observations Retained Mean No. Rate 2100 75446.62 1154 137492.3 50 412604.6 3304 102219.8

Observations set to Nonresponse Mean No. Rate 535 77580.03 280 135939.4 11 506112.1 826 103069.7

Percentage of nonresponse cases 20.3% 19.5% 18.0% 20.0%

Unlike the previous table with a half lower nonresponse rate, the table above showed the opposite. The mean of the nonrespondents vary significantly against the respondents. Also, the mean from the third class of the nonrespondents vary significantly from the respondents. The percentage of nonrespondents is almost equally distributed to each imputation class.

CLASSES 1 2 3 TOTAL

TOTEX2, 30% NONRESPONSE RATE Total Number of Observations Observations set to Observations Retained Nonresponse Mean Mean No. % No. No. Rate Rate 2635 63.8% 1833 75121.42 802 77613.02 1434 34.7% 1014 136477.1 420 138907.9 61 1.5% 44 342434.8 17 654725.5 4130 100% 2891 100709.9 1239 106309.4

Percentage of nonresponse cases 30.4% 29.3% 27.9% 30.0%

Results above showed didn’t differ significantly from the previous except that the mean from the each imputation classes are greater for the nonrespondents than the respondents especially from the third imputation class in which the mean of the respondents is almost just the half of the nonrespondents. Again, similar to the twenty percent nonresponse rate for TOTEX2, the distribution of nonrespondents for each class is almost equally distributed.

CLASSES

TOTIN2 , 10% NONRESPONSE RATE Total Number of Observations Observations set to Observations Retained Nonresponse Mean Mean No. % No. No. Rate Rate

Percentage of nonresponse cases

1 2 3 TOTAL

2635 1434 61 4130

63.8% 34.7% 1.5% 100%

2379 1280 58 3717

93569.17 187654.9 660909.9 134821.7

256 154 3 413

93766.26 181006.4 300630.7 127799.1

9.7% 10.7% 4.9% 10.0%

Like for the TOTEX2 variable from the lowest nonresponse rate, the percentage of nonrespondents for the TOTIN2 variable is the same. Unlike for TOTEX2, the mean of the respondents in all the classes significantly differ from the nonrespondents especially in the third imputation class where the nonrespondents mean is less than half of the respondents mean.

CLASSES 1 2 3 TOTAL

TOTIN2, 20% NONRESPONSE RATE Total Number of Observations Observations set to Observations Retained Nonresponse Mean Mean No. % No. No. Rate Rate 2635 63.8% 2100 93565.90 535 93676.31 1434 34.7% 1154 184817.6 280 195691.9 61 1.5% 50 634562.8 11 682411.3 4130 100% 3304 133624.7 826 136098.2

Percentage of nonresponse cases 20.3% 19.5% 18.0% 20.0%

Contrary to the lower nonresponse rate of TOTIN2, the mean coming from the nonrespondents in all imputation classes of the nonrespondents differ than the respondents. However, unlike from the previous results in both TOTIN2 and TOTEX2, the deviation of the nonrespondents and respondents from the second imputation class is surprisingly larger than the third imputation class.

CLASSES 1 2 3 TOTAL

TOTIN2, 30% NONRESPONSE RATE Total Number of Observations Observations set to Observations Retained Nonresponse Mean Mean No. % No. No. Rate Rate 2635 63.8% 1833 92972.55 802 94995.68 1434 34.7% 1014 181821.5 420 199300.6 61 1.5% 44 523326.6 17 953429.2 4130 100% 2891 130685.6 1239 142131.6

Percentage of nonresponse cases 30.4% 29.3% 27.9% 30.0%

The mean deviation of the nonrespondents and the nonrespondents for the results above has largest deviation from all the results coming from both nonresponse variables. Both the second and third imputation class has huge mean deviations. The mean rate of nonrespondents and respondents from the third imputation class has the largest and smallest value from all the nonresponse rates in the TOTIN2 respectively. D. Model Validation Results for Regression Imputation The following results for this section are the tests in validating the assumptions of a single regression model. For each imputation class, nonresponse variable and rate, different models were made and tested.

TOTEX, 10% Nonresponse Rate, First Imputation Class Summary Statistics MULTIPLE R MULTIPLE R2 ADJUSTED R F-STAT P-VALUE STD. ERR. OF ESTIMATE

0.853 0.728 0.728 6363.590 0.000 0.277

Analysis of Variance (ANOVA) Sums of Squares

Df

Regression 486.5398 1

Mean Squares

p-level

486.5398 6363.590 0.00

Residual 181.7378 2377 0.0765 Total

F

668.2776 Durbin-Watson Test DW STAT

Serial

Estimate 2.075187 -0.038196

Normal Probability Plot

Expected Normal Value

Normal Probability Plot of Residuals 10 8 6 4 2 0 -2 -4 -6 -2.0

-1.5

-1.0

-0.5

0.0

0.5

1.0

1.5

12.0

12.5

2.0

2.5

13.0

13.5

Residuals

Predicted vs. Residuals Predicted vs. Residual Scores

Residuals

Dependent variable: LNSV 3.0 2.5 2.0 1.5 1.0 0.5 0.0 -0.5 -1.0 -1.5 -2.0 9.0

9.5

10.0

10.5

11.0 Predicted Values

11.5

95% confidence

Summary statistics for the first imputation class of the expenditure variable under ten percent nonresponse rate showed that it has a fair coefficient of determination of seventy three percent and a strong direct association of eighty five percent between the first and second visit of the TOTEX variable. The model also is a good fit to the data after passing the ANOVA test in determining the fit of the model. The DW statistics and the serial correction showed that the error residuals have a very small autocorrelation. It’s a possibility that there are certain variables that are omitted from the model. The normal probability plot indicates that the data is still not normal after logarithmic transformation has been made. However, the graph showed that the data didn’t deviate away from the straight line. The researcher expected these kinds of results because in reality and in the literature, the continuous variables are rarely normal and the distribution of the data is frequently lognormal even after many data transformation have been applied. As for the homoskedasticity, the graph of the predicted vs. the residual scores showed that majority of the observations is near the line. Moreover, there is no pattern in the graph indicating that the residual variances are equal.

Expenditure, 10% Nonresponse Rate, Second Imputation Class

Summary Statistics 0.884 0.782 0.781 4574.954 0.000 0.309

MULTIPLE R MULTIPLE R2 ADJUSTED R F-STAT P-VALUE STD. ERR. OF ESTIMATE

Analysis of Variance (ANOVA) Sums of df Mean F Regress. 436.3063 1

p-level

436.3063 4574.954 0.00

Residual 121.8809 1278 0.0954 Total

558.1871

Durbin-Watson Test Normal Probability Plot Normal Probability Plot of Residuals Expected Normal Value

10 8 6 4 2 0 -2 -4 -6 -8 -2.0

-1.5

-1.0

-0.5

0.0

0.5

1.0

1.5

2.0

2.5

3.0

13.0

13.5

14.0

14.5

Residuals

Predicted vs. Residuals Predicted vs. Residual Scores

Residuals

Dependent variable: LNSV 3.0 2.5 2.0 1.5 1.0 0.5 0.0 -0.5 -1.0 -1.5 -2.0 9.5

10.0

10.5

11.0

11.5

12.0 Predicted Values

12.5

95% confidence

Expenditure, 10% Nonresponse Rate, Third Imputation Class Summary Statistics MULTIPLE R MULTIPLE R2 ADJUSTED R F-STAT P-VALUE STD. ERR. OF ESTIMATE

0.9499 0.9023 0.9005 516.8993 0.0000 0.3314

Linearity of the model: Analysis of Variance (ANOVA) Sums of

df

Regress. 436.3063 1

Mean

F

p-level

436.3063 4574.954 0.00

Residual 121.8809 1278 0.0954 Total

558.1871

Independence of Error Terms: Durbin-Watson Test DW STAT

Serial

Estimate 2.043782 -0.022334 Normality of Error Terms: Normal Probability Plot Normal Probability Plot of Residuals 10 Expected Normal Value

8 6 4 2 0 -2 -4 -6 -8

-2.0

-1.5

-1.0

-0.5

0.0

0.5 Residuals

1.0

1.5

2.0

2.5

3.0

Equality of Error Variances: Predicted vs. Residuals Predicted vs. Residual Scores

Residuals

Dependent variable: LNSV 3.0 2.5 2.0 1.5 1.0 0.5 0.0 -0.5 -1.0 -1.5 -2.0 9.5

10.0

10.5

11.0

11.5

12.0

12.5

13.0

13.5

95% confidence

Predicted Values

Income, 10% Nonresponse Rate, First Imputation Class Summary Statistics MULTIPLE R MULTIPLE R2 ADJUSTED R F-STAT P-VALUE STD. ERR. OF ESTIMATE

0.840 0.706 0.706 5703.605 0.000 0.331

Linearity of the model: Analysis of Variance (ANOVA) Sums of Regress. 625.0487 1

df

Mean

F

p-level

625.0487 5703.605 0.00

Residual 260.4915 2377 0.1096 Total

14.0

885.5402 Independence of Error Terms:

14.5

Durbin-Watson Test DW STAT

Serial

Estimate 2.047121 -0.023913 Normality of Error Terms: Normal Probability Plot Normal Probability Plot of Residuals Expected Normal Value

8 6 4 2 0 -2 -4 -6 -2.0

-1.0

-1.5

-0.5

0.0

0.5

1.0

1.5

2.0

2.5

3.0

Residuals

Equality of Error Variances: Predicted vs. Residuals Predicted vs. Residual Scores

Residuals

Dependent variable: LNSV 3.0 2.5 2.0 1.5 1.0 0.5 0.0 -0.5 -1.0 -1.5 -2.0 7

8

9

10

11

12

13 95% confidence

Predicted Values

Income Variable, 10% Nonresponse Rate, Second Imputation Class Summary Statistics MULTIPLE R MULTIPLE R2 ADJUSTED R

0.897 0.805 0.804

14

5261.480 0.000 0.331

F-STAT P-VALUE STD. ERR. OF ESTIMATE

Linearity of the Model: Analysis of Variance (ANOVA) Sums of

df

Regress. 576.7623 1

Mean

F

p-level

576.7623 5261.480 0.00

Residual 140.0941 1278 0.1096 Total

716.8564 Independence of Error Terms: Durbin-Watson Test DW STAT

Serial

Estimate 1.934528 0.031428 Normality of Error Terms: Normal Probability Plot Normal Probability Plot of Residuals Expected Normal Value

8 6 4 2 0 -2 -4 -6 -8 -2.5

-2.0

-1.5

-1.0

-0.5

0.0

0.5

Residuals

Equality of Error Variances: Predicted vs. Residuals

1.0

1.5

2.0

2.5

Predicted vs. Residual Scores

Residuals

Dependent variable: LNSV 2.5 2.0 1.5 1.0 0.5 0.0 -0.5 -1.0 -1.5 -2.0 -2.5 8

9

10

11

12

13

14 95% confidence

Predicted Values

Income Variable, 10% Nonresponse Rate, Third Imputation Class Summary Statistics MULTIPLE R MULTIPLE R2 ADJUSTED R F-STAT P-VALUE STD. ERR. OF ESTIMATE

0.9591 0.9199 0.9185 642.9754 0.0000 0.3171

Linearity of the model: ANOVA Sums of df

Mean

F

p-level

Regress. 64.67059 1 64.67059 642.9753 0.000000 Residual 5.63249 56 0.10058 Total

70.30308 Independence of Error Terms: Durbin-Watson Test DW Stat

Serial

Estimate 2.157647 -0.079104 Normality of Error Terms:

15

Normal Probability Plot Normal Probability Plot of Residuals Expected Normal Value

5 4 3 2 1 0 -1 -2 -3 -1.2

-1.0

-0.8

-0.6

-0.4

-0.2

0.0

0.2

0.4

0.6

0.8

1.0

1.2

1.4

1.6

Residuals

Independence of Error Terms: Predicted vs. Residuals

Predicted vs. Residual Scores

Residuals

Dependent variable: LNSV 1.6 1.4 1.2 1.0 0.8 0.6 0.4 0.2 0.0 -0.2 -0.4 -0.6 -0.8 -1.0 -1.2 10

11

12

13

14

15 95% confidence

Predicted Values

Expenditure, 20% Nonresponse Rate, First Imputation Class Summary Statistics MULTIPLE R MULTIPLE R2 ADJUSTED R F-STAT P-VALUE STD. ERR. OF ESTIMATE

0.857 0.734 0.734 5786.271 0.000 0.273

LINEARITY OF THE MODEL: ANOVA Sums of Regress. 429.7392 1

df

Mean

F

p-level

429.7392 5786.271 0.00

16

Residual 155.8159 2098 0.0743 Total

585.5551 INDEPENDENCE OF ERROR TERMS: DURBIN-WATSON TEST Durbin-

Serial

Estimate 2.013534 -0.006859 NORMALITY OF ERROR TERMS: NORMAL PROBABILITY PLOT Normal Probability Plot of Residuals Expected Normal Value

10 8 6 4 2 0 -2 -4 -6 -8 -2.0

-1.5

-1.0

-0.5

0.0

0.5

1.0

1.5

2.0

2.5

13.0

13.5

Residuals

EQUALITY OF ERROR VARIANCES: PREDICTED vs. RESIDUALS Predicted vs. Residual Scores Dependent variable: LNSV 2.5 2.0

Residuals

1.5 1.0 0.5 0.0 -0.5 -1.0 -1.5 -2.0

9.0

9.5

10.0

10.5

11.0

11.5

12.0

12.5

95% confidence

Predicted Values

Expenditure, 20% Nonresponse Rate, Second Imputation Class Summary Statistics MULTIPLE R MULTIPLE R2

0.887 0.787

ADJUSTED R F-STAT P-VALUE STD. ERR. OF ESTIMATE

0.787 4268.380 0.000 0.308

Linearity of the model: ANOVA Sums of Regress. 405.0530 1

df

Mean

F

405.0530 4268.380 0.00

Residual 109.3204 1152 0.0949 Total

p-level

514.3734 Independence of Error Terms: Durbin-Watson Test DW STAT

Serial

Estimate 2.014977 -0.007528

Normality of Error Terms: Normal Probability Plot

Normal Probability Plot of Residuals Expected Normal Value

10 8 6 4 2 0 -2 -4 -6 -8 -2.0

-1.5

-1.0

-0.5

0.0

0.5

1.0

1.5

2.0

2.5

3.0

Residuals

Equality of Error Variances: Predicted vs. Residuals Predicted vs. Residual Scores

Residuals

Dependent variable: LNSV 3.0 2.5 2.0 1.5 1.0 0.5 0.0 -0.5 -1.0 -1.5 -2.0 9.5

10.0

10.5

11.0

11.5

12.0

12.5

13.0

13.5

14.0 95% confidence

Predicted Values

Expenditure, 20% Nonresponse Rate, Third Imputation Class Summary Statistics MULTIPLE R MULTIPLE R2 ADJUSTED R F-STAT P-VALUE STD. ERR. OF ESTIMATE

0.9490 0.9006 0.8985 434.6591 0.0000 0.3336

Linearity of the Model: Analysis of Variance Sums of df

Mean

F

p-level

Regress. 48.36771 1 48.36771 434.6591 0.000000

14.5

Residual 5.34131 48 0.11128 Total

53.70902 Independence of Error Terms: Durbin-Watson Test Durbin-

Serial

Estimate 2.268400 -0.167040 Normality of Error Terms: Normal Probability Plot Normal Probability Plot of Residuals Expected Normal Value

3 2 1 0 -1 -2 -3 -1.2

-1.0

-0.8

-0.6

-0.4

-0.2

0.0

0.2

0.4

0.6

0.8

Residuals

Equality of Error Variances: Predicted vs. Residuals Predicted vs. Residual Scores

Residuals

Dependent variable: LNSV 0.8 0.6 0.4 0.2 0.0 -0.2 -0.4 -0.6 -0.8 -1.0 -1.2 10.0

10.5

11.0

11.5

12.0

12.5

13.0

13.5

14.0

95% confidence

Predicted Values

Income, 20% Nonresponse Rate, First Imputation Class Summary Statistics MULTIPLE R MULTIPLE R2 ADJUSTED R F-STAT P-VALUE STD. ERR. OF ESTIMATE

14.5

0.838 0.703 0.702 4954.234 0.000 0.331

15.0

Linearity of the Model: Analysis of Variance Sums of

df

Mean

Regress. 542.9226 1

F

p-level

542.9226 4954.233 0.00

Residual 229.9148 2098 0.1096 Total

772.8375

Independence of Error Terms: Durbin-Watson Test Durbin-

Serial

Estimate 1.985916 0.007035 Normality of Error Terms: Normal Probability Plot Normal Probability Plot of Residuals Expected Normal Value

8 6 4 2 0 -2 -4 -6 -1.5

-1.0

-0.5

0.0

0.5

1.0

1.5

2.0

2.5

3.0

Residuals

Equality of Error Variances: Predicted vs. Residuals Predicted vs. Residual Scores Dependent variable: LN SV 3.0 2.5

Residuals

2.0 1.5 1.0 0.5 0.0 -0.5 -1.0 -1.5 7

8

9

10 Predicted Values

11

12

13 95% confidence

14

Income, 20% Nonresponse Rate, Second Imputation Class Summary Statistics 0.906 0.821 0.821 5275.064 0.000 0.318

MULTIPLE R MULTIPLE R2 ADJUSTED R F-STAT P-VALUE STD. ERR. OF ESTIMATE

Analysis of Variance Independence of Error Terms: Durbin-

Serial

Estimate 1.980753 0.008036 Normality of Error Terms: Normal Probability Plot Normal Probability Plot of Residuals Expected Normal Value

6 4 2 0 -2 -4 -6 -8 -2.5

-2.0

-1.5

-1.0

-0.5

0.0 Residuals

Equality of Error Variances: Predicted vs. Residuals

0.5

1.0

1.5

2.0

Predicted vs. Residual Scores

Residuals

Dependent variable: LN SV 2.0 1.5 1.0 0.5 0.0 -0.5 -1.0 -1.5 -2.0 -2.5

9

10

11

12

13

14 95% confidence

Predicted Values

Income, 20% Nonresponse Rate, Third Imputation Class Summary Statistics MULTIPLE R MULTIPLE R2 ADJUSTED R F-STAT P-VALUE STD. ERR. OF ESTIMATE

0.9566 0.9151 0.9133 517.0385 0.0000 0.3277

Linearity of the Model: Analysis of Variance Sums of df

Mean

F

p-level

Regress. 55.51737 1 55.51737 517.0385 0.000000 Residual 5.15403 48 0.10738 Total

60.67140

Independence of Error Terms: Durbin-Watson Test Durbin-

Serial

Estimate 2.280743 -0.142227

Normality of Error Terms:

15

Normal Probability Plot Normal Probability Plot of Residuals Expected Normal Value

4 3 2 1 0 -1 -2 -3 -1.2

-1.0

-0.8

-0.6

-0.4

-0.2

0.0

0.2

0.4

0.6

0.8

1.0

1.2

1.4

1.6

Residuals

Equality of Error Variances: Predicted vs. Residuals Predicted vs. Residual Scores

Residuals

Dependent variable: LN SV 1.6 1.4 1.2 1.0 0.8 0.6 0.4 0.2 0.0 -0.2 -0.4 -0.6 -0.8 -1.0 -1.2

10

11

12

13

14

15 95% confidence

Predicted Values

Expenditure, 30% Nonresponse Rate, First Imputation Class Summary Statistics MULTIPLE R MULTIPLE R2 ADJUSTED R F-STAT P-VALUE STD. ERR. OF ESTIMATE

0.840 0.705 0.705 4382.102 0.000 0.290

16

Linearity of the Model: Analysis of Variance Sums of

df

Regress. 367.4335 1

Mean

F

p-level

367.4335 4382.102 0.00

Residual 153.5270 1831 0.0838 Total

520.9605 Independence of Error Terms: Durbin-Watson Test Durbin-

Serial

Estimate 2.072061 -0.036173 Normality of Error Terms: Normal Probability Plot Normal Probability Plot of Residuals

Expected Normal Value

10 8 6 4 2 0 -2 -4 -6

-2.0

-1.5

-1.0

-0.5

0.0

0.5

1.0

1.5

2.0

2.5

3.0

Residuals

Equality of Error Variances: Predicted vs. Residuals Predicted vs. Residual Scores

Residuals

Dependent variable: LNSV 3.0 2.5 2.0 1.5 1.0 0.5 0.0 -0.5 -1.0 -1.5 -2.0 9.0

9.5

10.0

10.5

11.0 Predicted Values

11.5

12.0

12.5

13.0 95% confidence

13.5

Expenditure, 30% Nonresponse Rate, Second Imputation Class Summary Statistics 0.890 0.791 0.791 3841.345 0.000 0.300

MULTIPLE R MULTIPLE R2 ADJUSTED R F-STAT P-VALUE STD. ERR. OF ESTIMATE

Linearity of the Model: Analysis of Variance Sums of

df

Regress. 346.1192 1

Mean

F

p-level

346.1192 3841.345 0.00

Residual 91.1849 1012 0.0901 Total

437.3041 Independence of Error Terms: Durbin-Watson Test Durbin-

Serial

Estimate 2.023625 -0.012021 Normality of Error Terms: Normal Probability Plot Normal Probability Plot of Residuals Expected Normal Value

10 8 6 4 2 0 -2 -4 -6 -8 -2.0

-1.5

-1.0

-0.5

0.0

0.5

1.0

Residuals

Equality of Error Variances: Predicted vs. Residuals

1.5

2.0

2.5

3.0

Predicted vs. Residual Scores

Residuals

Dependent variable: LNSV 3.0 2.5 2.0 1.5 1.0 0.5 0.0 -0.5 -1.0 -1.5 -2.0 9.5

10.0

10.5

11.0

11.5

12.0

12.5

13.0

13.5 95% confidence

Predicted Values

Expenditure, 30% Nonresponse Rate, Third Imputation Class Summary Statistics MULTIPLE R MULTIPLE R2 ADJUSTED R F-STAT P-VALUE STD. ERR. OF ESTIMATE

0.9425 0.8882 0.8856 333.7148 0.0000 0.3237

Linearity of the Model: Analysis of Variance Sums of df

Mean

F

p-level

Regress. 34.97366 1 34.97366 333.7148 0.000000 Residual 4.40164 42 0.10480 Total

39.37531

Independence of Error Terms: Durbin-Watson Test Durbin-

Serial

Estimate 2.589756 -0.326722

14.0

Normality of Error Terms: Normal Probability Plot Normal Probability Plot of Residuals Expected Normal Value

4 3 2 1 0 -1 -2 -3 -0.8

-0.6

-0.4

-0.2

0.0

0.2

0.4

0.6

0.8

1.0

1.2

Residuals

Equality of Error Variances: Predicted vs. Residuals Predicted vs. Residual Scores

Residuals

Dependent variable: LNSV 1.2 1.0 0.8 0.6 0.4 0.2 0.0 -0.2 -0.4 -0.6 -0.8 10.0

10.5

11.0

11.5

12.0

12.5

13.0

14.0 95% confidence

Predicted Values

Income, 30% Nonresponse Rate, First Imputation Class Summary Statistics MULTIPLE R MULTIPLE R2 ADJUSTED R F-STAT P-VALUE STD. ERR. OF ESTIMATE

13.5

0.845 0.713 0.713 4557.328 0.000 0.330

Linearity of the Model: Analysis of Variance

14.5

Sums of

df

Regress. 496.0775 1

Mean

F

p-level

496.0775 4557.328 0.00

Residual 199.3093 1831 0.1089 Total

695.3868

Independence of Error Terms: Durbin-Watson Test Durbin-

Serial

Estimate 2.094357 -0.047223

Normality of Error Terms: Normal Probability Plot

Normal Probability Plot of Residuals

Expected Normal Value

8 6 4 2 0 -2 -4 -6 -2.0

-1.5

-1.0

-0.5

0.0

0.5

1.0

1.5

2.0

2.5

12.5

13.0

13.5

3.0

Residuals

Equality of Error Variances: Predicted vs. Residuals Predicted vs. Residual Scores

Residuals

Dependent variable: LN SV 3.0 2.5 2.0 1.5 1.0 0.5 0.0 -0.5 -1.0 -1.5 -2.0 9.0

9.5

10.0

10.5

11.0

11.5 Predicted Values

12.0

95% confidence

Income, 30% Nonresponse Rate, Second Imputation Class

14.0

Summary Statistics MULTIPLE R MULTIPLE R2 ADJUSTED R F-STAT P-VALUE STD. ERR. OF ESTIMATE

0.909 0.826 0.826 4793.392 0.000 0.310

Linearity of the Model: Analysis of Variance Sums of

df

Regress. 460.5995 1

Mean

F

p-level

460.5995 4793.392 0.00

Residual 97.2436 1012 0.0961 Total

557.8431

Independence of Error Terms: Durbin-Watson Test Durbin-

Serial

Estimate 2.072614 -0.038549

Normality of Error Terms: Normal Probability Plot Normal Probability Plot of Residuals Expected Normal Value

6 4 2 0 -2 -4 -6 -2.0

-1.5

-1.0

-0.5

0.0

0.5

Residuals

Equality of Error Variances: Predicted vs. Residuals

1.0

1.5

2.0

Predicted vs. Residual Scores Dependent variable: LN SV 2.0 1.5 Residuals

1.0 0.5 0.0 -0.5 -1.0 -1.5 -2.0 9

10

11

12

13

14 95% confidence

Predicted Values

Income, 30% Nonresponse Rate, Third Imputation Class Summary Statistics MULTIPLE R MULTIPLE R2 ADJUSTED R F-STAT P-VALUE STD. ERR. OF ESTIMATE

0.9654 0.9319 0.9303 574.8240 0.0000 0.2753

Linearity of the Model: Analysis of Variance Sums of df

Mean

F

p-level

Regress. 43.55594 1 43.55594 574.8240 0.000000 Residual 3.18245 42 0.07577 Total

46.73840 Independence of Error Terms: Durbin-Watson Test Durbin-

Serial

Estimate 2.249448 -0.190671 Normality of Error Terms: Normal Probability Plot

15

Normal Probability Plot of Residuals

Expected Normal Value

3 2 1 0 -1 -2 -3 -4 -1.0

-0.8

-0.6

-0.4

-0.2

0.0

0.2

0.4

0.6

0.8

Residuals

Equality of Error Variances: Predicted vs. Residuals Predicted vs. Residual Scores

Residuals

Dependent variable: LN SV 0.8 0.6 0.4 0.2 0.0 -0.2 -0.4 -0.6 -0.8 -1.0 10.0

10.5

11.0

11.5

12.0

12.5

13.0

13.5

14.0

14.5

15.0

95% confidence

Predicted Values

E. Comparison of Imputation Methods Variance Estimation and Biasness Measures of the effectiveness of the four imputation methods for imputing for the partial nonresponse data Expenditure Variable, 10% Nonresponse Rate METHODS

MD

MAD

RMSD

OMI HDI3 DRI3 SRI3

-6406.603632

56929.613669

108547.820041

-7204.560545 5363.466919

23839.817150 33683.480259

57726.615582 70553.643550

Income Variable, 10% Nonresponse Rate

METHODS

MD

MAD

RMSD

OMI HDI3 DRI3 SRI3

5978.393462

77502.270650

167206.240181

-11284.461504 9043.982223

32115.804981 51363.168122

77228.476946 106374.388796

Expenditure Variable, 20% Nonresponse Rate METHODS MD MAD RMSD OMI HDI3 DRI3 SRI3

METHODS OMI HDI3 DRI3 SRI3

-2497.141162

59555.355146

119193.320481

-7347.862975 5400.712813

23231.654338 33782.602655

53180.024322 72487.392833

Income Variable, 20% Nonresponse Rate MD MAD RMSD 14277.427361

87469.865623

244757.995335

-11059.090609 -9076.980826

35274.032863 57429.236572

114957.425614 148278.489381

Expenditure Variable, 30% Nonresponse Rate METHODS MD MAD RMSD OMI HDI3 DRI3 SRI3

742.526312

62388.111315

151740.940804

-7554.607707 1328.061322

24082.880071 32449.488519

59795.667757 72803.602152

Income Variable, 30% Nonresponse Rate METHODS MD MAD RMSD OMI HDI3 DRI3 SRI3

20310.908394

90396.258755

271775.351071

-13792.599597 1188.309972

34537.359021 51886.726288

103253.122885 131429.611649

Related Documents