Home Reports Start

1999 Annual Water Level Raw Data Report for Kansas

Prev Page--Acquisition Activity || Next Page--Conclusions


II: Statistical Quality Control Measurements

John C. Davis, Mathematical Geology Section

INTRODUCTION

The Quality Control and Assurance Program for the 1999 observation well measurement season followed the general outline for quality assurance developed during the preceding two years, as described in Miller, Davis, and Olea (1997). This discussion of procedures is taken primarily from that source.

The primary variable measured in the water well observation program is depth to water in an observation well. This primary variable is associated with three secondary variables; the ground elevation, east-west coordinate, and north-south coordinate of the well. The secondary variables serve to locate the primary variable in space, and make it possible to determine spatial relationships between observation wells, including mapping the water table and calculating changes in aquifer volume. Historically, the three location variables were determined initially by the U.S. Geological Survey for each well and not re-determined unless a serious error in the original coordinates was suspected. In the 1997 groundwater observation measurement program conducted by the Kansas Geological Survey, the geographic (latitude and longitude) coordinates of all wells were re-determined by GPS techniques. In subsequent measurement programs, all observation wells were again redetermined by GPS. The 1999 measurements are considered the most accurate for reasons discussed in Miller, Davis, and Olea (1998) and are used in this study.

In addition, several secondary characteristics of the observation wells and of the measurement procedure were noted in order to determine if these might influence the quality of the measurements being made (in statistical parlance, these extra measurements are called exogenous variables). As part of the quality control program, water level measurements were repeated two or more times on 122 wells, yielding a collection of 164 quality control observations. Because these data include replicates, they provide an additional check on estimates of the influence of well conditions or measuring techniques on water levels. A subsequent round of measurements resampled 51 wells selected at random from the original set for quality assurance purposes.

The primary variable, depth to water, varies with geographic location and differences in topography so much that these factors will overwhelm all other sources of variation. This means that any errors in location may have a profound effect on the water table elevation. To avoid the complications of simultaneously considering uncertainties in the secondary variables, this statistical quality control study is based on first differences (specifically, the difference between 1999 and 1998 depth-to-water measurements). The secondary variables cancel out, leaving only the difference in depth, which is numerically identical to the year's change in water level. In this statistical quality control study, the difference between 1999 and 1998 corrected depth measurements is abbreviated "'99-'98." If the water table is lower this year, the variable '99-'98 will be a positive number. Because seventeen wells measured in the current program were not measured in 1998, a total of 525 wells have the variable '99-'98.

The objective in our quality control study is to identify and assess possible sources of unwanted variation in water level measurements made by the KGS. The purpose of the analysis is to provide guidance to the KGS field measurement program, to suggest ways in which field measurements might be improved, and to provide information necessary to identify past or current measurements that are suspect. The statistical quality control and field measurement programs have been intimately intertwined from the outset when the KGS assumed responsibility in 1997 for measuring observation wells formerly measured by the USGS. A comparison of results from 1999 with those from previous years shows that the quality control program is resulting in the desired improvements in the measurement program.

STATISTICAL PROCEDURES

An analysis of variance (ANOVA) procedure was used to estimate the significance of different well and procedural characteristics on '99-'98 in the initial set of 542 observations. The following variables were recorded for each well.
1. Depth to water
2. GPS longitude
3. GPS latitude
4. Date
5. Measurer's initials
6. Well Access
1 = good
0 = poor
7. Downhole Access
1 = good
0 = poor
8. Weighted Tape
1 = yes
0 = no
9. Oil on Water
1 = yes
0 = no
10. Chalk Cut Quality
2 = excellent
1 = good
0 = poor
In addition, each well has a unique USGS ID number and KGS ID designation, a surface elevation, a legal description of the well location, a UTM coordinate pair (obtained by LEO conversion of the GPS coordinates), and the purpose for which the well is used. The variable Aquifer Code describes the primary source of water in the well; the manner in which aquifer code values were assigned is summarized in Miller, Davis, and Olea (1997). The additional variables taken from the historical records are:
11. Well Use
H = household water supply
S = stock water
I = irrigation
U = unused
12. Aquifer Code
KD = Cretaceous Dakota aquifer
KJ = undifferentiated Cretaceous/Jurassic aquifer
KN = Cretaceous Niobrara aquifer
KU = undifferentiated Cretaceous aquifer
QA = Quaternary alluvium aquifer
QAQU = Quaternary alluvium and undifferentiated aquifers
QAQUTO = Quaternary alluvium and undifferentiated aquifers and Tertiary Ogallala aquifer
QATO = Quaternary alluvium and Tertiary Ogallala aquifers
QU = Quaternary undifferentiated aquifer
QUTO = Quaternary undifferentiated and Tertiary Ogallala aquifers
QUTOKJ = Quaternary undifferentiated, Tertiary Ogallala, and Cretaceous/ Jurassic aquifers
QUKD = Quaternary undifferentiated and Cretaceous Dakota aquifers
TO = Tertiary Ogallala aquifer
TOKD = Tertiary Ogallala and Cretaceous Dakota aquifers
TOKJ = Tertiary Ogallala and undifferentiated Cretaceous/Jurassic aquifers
The initial statistical model includes all exogenous variables recorded during the quality control study that may contribute to the variability in the response, '99-'98, plus the variables Well Use and Aquifer Code. In contrast to results obtained in 1998, several exogenous variables contribute significantly to the total variance. There is a significant operator effect as measured by the variable Measurer. Use (or non-use) of a Weighted Tape is a significant source of variance, as is Well Usage and the quality of the Chalk Cut. As expected, Aquifer Code is a significant source of variance. Last year, Measurer was only marginally significant, as was Well Use. Other variables made no contribution to the variance of the response variable.

Analysis of Variance Table for Initial Model
SourceDFSum of Squares Mean SquareF RatioProb>F
Model291225.999742.27592.4983>0.0001**
Measurer7302.2569443.179562.55170.0137**
Well Access 12.715342.715340.16050.6889ns
Downhole Access10.465080.465080.02750.8684ns
Weighted Tape1175.35171175.3517110.36250.0014**
Well Use3178.9114659.637103.52430.0149*
Oil on Water16.968906.968900.41180.5213ns
Chalk Cut Quality2155.8945977.947304.60640.0104**
Aquifer Code13419.4897032.268441.90690.0271*
Error5128663.910916.9217 
Total5419889.9106 
RSquare0.123965 

A revised model was run that combined aquifers into classes similar to those used in 1997 and 1998. This 5-part classification distinguishes between (1) alluvial aquifers, (2) alluvial aquifers plus other unconsolidated aquifers, (3) the High Plains aquifer, (4) bedrock aquifers, and (5) bedrock plus unconsolidated aquifers. This has the effect of reducing the degrees of freedom required for the model and thus increasing the sensitivity of the analysis.

Analysis of Variance Table for Initial Model
SourceDFSum of Squares Mean SquareF RatioProb>F
Model201067.040453.35203.1505>0.0001**
Measurer7296.6439342.377702.50240.0155*
Well Access 14.953004.953000.29250.5889ns
Downhole Access10.167170.167170.00990.9209ns
Weighted Tape1224.30268224.3026813.24530.0003**
Well Use3153.1292351.043083.01410.0296*
Oil on Water18.936198.936190.52770.4679ns
Chalk Cut Quality2147.7238973.861944.36160.0132**
Aquifer Group4260.5303565.132593.84610.0043**
Error5218822.870216.9345 
Total5419889.9106 
RSquare0.107892 

Most of the components in the revised model are significant, in strong contrast with results obtained in 1998, when most components of the equivalent model were not significant. Unfortunately, past models are not directly comparable because there are different numbers of classes in Measurer and Aquifer Code, and the response variable has significantly different variances from year to year. It is important to note that the variance of the response variable in 1998 was unusually low; the variance of the response in 1999 is almost three times greater than last year, and is comparable to the variance in 1997. (Direct comparisons of responses from year to year are not possible because of the different numbers of wells, and possible differences in the behavior of the specific wells measured in each year.)

One way to improve the statistical analysis is to discard wells in which the exogenous variables make unusually high contributions to the total variance, arguing that the readings from such wells are atypical and likely erroneous. We can identify 24 wells which seem to make spurious contributions to the variance. Most of these wells were repeatedly measured, indicating that the measurers either had difficulty obtaining a reliable reading or that the initial depth to water was significantly out-of-trend. By removing fewer than 5% of the wells, the total variance can be reduced to one-fifth its original value. After removal, only Measurer, Weighted Tape, and Aquifer Group contribute significantly to the variance, a result that is comparable to results obtained in 1988.

Analysis of Variance table for Grouped Aquifers After Removal of 24 Wells
SourceDFSum of Squares Mean SquareF RatioProb>F
Model20303.684115.18424.6110<0.0001**
Measurer7112.8732016.124744.8967<0.0001**
Well Access 10.622280.622280.18900.6640ns
Downhole Access14.453944.453941.35250.2454ns
Weighted Tape124.6719224.671927.49220.0064**
Well Use320.624986.874992.08780.1009ns
Oil on Water11.057331.057330.32110.5712ns
Chalk Cut Quality212.831856.415931.94830.1436ns
Aquifer Group4109.5470827.386778.3166<0.0001**
Error4971636.62633.2930 
Total5171940.3103 
RSquare0.156513 

Importance of Contributing Variables

We can determine the relative contributions of each category of the contributing variables by examining the least-squares means (averages) of '99-'98 for a specified state of a variable, while holding all other variables at their average value. (In statistical parlance, these averages are referred to as the expected values of the variables.) A positive value indicates the average depth to water in a well is greater in 1999 than in 1998 (the water level has dropped from last year's measurement). That is, the elevation of the water level in the well is lower than it was previously. The following list gives the least-squares means for the complete data set and for the data after removal of the 24 wells.

Operator
LevelOriginal
Least Sq Mean
24 Wells Removed
Least Sq Mean
DR0.19072.2155
DRL-1.88731.6487
JCS-0.08171.7368
JDS-0.65261.1995
JMA-1.62330.7093
JMH0.53722.1048
RB-0.81020.7715
RDM-0.90561.3163

Well Access
LevelOriginal
Least Sq Mean
24 Wells Removed
Least Sq Mean
0-0.44611.5382
1-0.86211.3873

Downhole Access
LevelOriginal
Least Sq Mean
24 Wells Removed
Least Sq Mean
0-0.68161.6120
1-0.62661.3136

Weighted Tape
LevelOriginal
Least Sq Mean
24 Wells Removed
Least Sq Mean
0-1.62581.1305
10.31761.7951

Well Use
LevelOriginal
Least Sq Mean
24 Wells Removed
Least Sq Mean
H-1.07880.6683
I-0.92951.8822
S1.21491.8468
U-1.82301.4539

Oil on Water
LevelOriginal
Least Sq Mean
24 Wells Removed
Least Sq Mean
0-0.83581.5261
1-0.47241.3995

Chalk Cut
LevelOriginal
Least Sq Mean
24 Wells Removed
Least Sq Mean
0-2.91562.1813
10.43361.1637
20.51971.0434

Geologic Group
LevelOriginal
Least Sq Mean
24 Wells Removed
Least Sq Mean
1 (Cretaceous)-2.81282.9674
2 (Alluvium)-0.72490.4852
3 (Al. + Tert.)-0.20740.8042
4 (Tertiary)0.07351.3210
5 (Tert. + K)0.40101.7362

Summary of the Analyses of Variance

Data collected in 1999 show significant variations attributable to Measurer, Downhole Access, Weighted Tape, and Chalk Cut, in addition to differences between the aquifer being tapped by the well. Most of the variation seems attributable to extreme changes in water level in a very few wells. The standard deviation of variable '99-'98 is 4.21 ft., almost twice the 2.48 ft. standard deviation of variable '98-'97. When the 24 troublesome wells discussed above are removed from the analysis, the standard deviation of variable '99-'98 is only 1.94 ft. The median decline in water level from 1998 to 1999 (0.72 ft.), although not large, is almost twice the median decline in the previous year; that is, from 1997 to 1998 (0.41 ft.). Removal of the 24 wells does not significantly change the median decline in water level (0.76 ft.). The fact that variation in the change in water table elevation shows a continuing reduction if certain wells are removed from the calculations suggests that quality control measures are successful in controlling exogenous variation but the improvement in data quality is being swamped by unreliable responses from a small number of wells. This is a strong argument for systematically replacing those wells in the network which yield unreliable values.

There are significant differences between the operators, particularly from JMH. (In addition, measurer DR is not part of the measurement team but is a DWR staff member who provided the water depth for a locked and hence inaccessible well.) These significant differences between measurers remain after removal of troublesome wells, showing that the differences are systematic and probably due to measurement techniques and not to the particular wells measured. There is little difference between wells with good access and poor access (an average difference of 0.4 foot, or 0.2 foot after removal of 24 wells) and the same small difference between wells with good downhole access and poor downhole access (0.4 foot and 0.3 foot). This similarity, coupled with measurer's informal comments, suggests that the field crews do not clearly distinguish between these two variables. The variable Well Access may be redundant; a better measure of this property may be extractable from the WaterWitch time log.

The water level measured without a weight on the tape tends to be almost 2 feet shallower than water levels measured in wells where the tape is weighted, but the difference drops to only 0.7 foot when the 24 wells are removed. By comparison, the difference between using or not using a weight on the tape was related to a difference in water level of only 0.4 feet in 1998, but more than 2.6 feet in 1997.

In 1999, the classification (P) was combined with (H) because only one public well was measured. Usage from all wells of unknown (?) classification was resolved using field photographs and notes. Eliminating these two classes of well usage removed what had been a major source of variance in previous years. Only unused or monitoring wells (U) are significantly different than other classes, measuring over 1.8 ft. shallower. With removal of the 24 wells, differences attributable to well usage is no longer significant. Household wells (H) deviate most from other classes of usage (0.8 ft. deeper than average), but the difference is not statistically significant.

If the quality of the chalk cut on the measuring tape is "poor," the reading tends to be over 2.9 foot shallower than in 1998; if the cut is "good" or "excellent," the reading tends to be about a half-foot deeper than the preceding year. If the 24 wells are removed from the data set, the differences attributable to chalk cut quality are no longer statistically significant.

Water levels in exclusively Cretaceous aquifers (1) tend to be almost 2.8 feet shallower than in 1997 (3.0 feet after removal of 24 wells). The Ogallala aquifer (4) tends to be about the same (1.3 feet deeper after removal of 24 wells) as last year. Measurements made in wells tapping alluvial aquifers (2) or alluvial plus other sources (3) tend to be about a half-foot shallower. Water levels in wells tapping Cretaceous aquifers plus Quaternary and/or Tertiary aquifers (5) tend to be about 0.4 feet deeper (1.7 feet after removal of 24 wells) this year. Unlike measurements made in 1998, the differences in annual changes of water level for different aquifers are highly significant, and remain significant after deletion of the 24 troublesome wells.

The ANOVA equation can be used to create an expected value and residual (difference between observed and expected value) for each well. The distribution of residuals should be approximately normal. Examination of the residual outliers will reveal any well measurements which cannot be explained by extreme combinations of the different sources of variation. Four wells have been identified by this process. These wells show rises in water level between 1998 and 1999 that are far outside the range expected; each well was measured repeatedly (usually by different individuals) and the measurements proved to be reproducible. These well measurements may be correct and reflect highly unusual changes in aquifer level; the wrong wells may have been measured in 1998 or 1999; or changes in well construction or other factors may have altered the measurability of a well (field notes indicate measurement difficulties with all four wells). The four wells are:

37595809953010124S 23W 06AAB 01
37442110149090126S 41W 32DDB 01
37322810059570129S 34W 11ADD 01
37220010021400131S 28W 10BCB 01

All of these wells are contained in the set of 24 wells that were identified by their extreme expected values and excluded from the second Analysis of Variance.

Quality Assurance (Remeasure) Program

The 1999 Quality Assurance program of random remeasurements showed that the QA data contained statistically significant sources of variation, in contrast to 1998 when no significant sources of variation were detected among the QA wells. This is in conformity with characteristics observed in the main 1999 measurements, which are much more variable than in 1998. Fifty-one randomly selected QA wells were remeasured during the late January supplementary data collection period. Combined with measurements made during the regular collection period, 132 measurements were available for statistical quality control. In spite of the additional control provided by replication, significant contributions from exogenous variables were detected. As expected, the variance among the QA replicates is much lower, approximately half, that of the complete data set. The most extreme value of '99-'98 among the QA wells is only 14.7, compared to an extreme of -54.4 in the complete data set. The most extreme well in the QA data set is:

37403509950010127S 23W 28AAA 01

Within the QA data set, there are significant contributions due to Measurer, mostly attributable to DRL and to a lesser extent to RDM. Wells in the QA set measured by DRL show a change (decline in water level) in '99-'98 of about 2 feet, double the average change for the QA data. Downhole Access is a highly significant source of variation; much of this is attributable to a single well, 27S 23W 28AAA 01, but Downhole Access remains a significant source of variation even after this well is removed. Wells with poor downhole access tend to measure about 3 feet deeper than wells with good downhole access, possibly the result of tape deflections and hang-ups in the wells. There are significant effects attributable to Oil on Water, but these are due almost entirely to one well, 25S 35W 25BBB 03, which has an average decline of -5.65 ft. with a very small variance over 4 repeated measurements. There are significant differences attributable to use of Weighted Tape; unweighted measurements tend to be about 1.1 ft. deeper than expected. Well Usage is anticipated to be a significant source of variation, and this is confirmed in the QA data. However, only household use appears to be significantly different than other uses, resulting in measurements that are 2 to 3 ft. deeper than expected. Quality of the Chalk Cut has a significant effect; a poor cut tends to be about 2.5 ft. deeper than an excellent cut, or 4 ft. deeper than a good cut. Although there are significant effects attributable to Geological Unit, these are unreliable because of the small number of measurements per unit. Geological Groups, which are aggregations of Geological Units, show no significant differences in the QA data set.

CONCLUSIONS

The purpose of the Quality Control and Assurance Program is to identify wells and procedural conditions that may contribute significantly to the variance of Depth to Water measured in observation wells, and which do not reflect true changes in the water table elevation. Gathering Quality Control information requires little additional effort by the field crews, emphasizes the importance of procedural consistency, and certifies performance. Quality Control for the 1999 field season shows inconsistencies compared to the 1998 field season. The variability of the randomly-gathered QA data reflects the increased variability in the main body of 1999 data. The results can be interpreted as reinforcing the need for training and the desirability of deleting troublesome wells from the monitoring program. The QA process continues to identify specific wells as troublesome, and flags well locations which require verification before being permanently incorporated into the WIZARD data base.

The Quality Control program has achieved its objectives of identifying and quantifying sources of unwanted variation in observation well data collection, and in flagging wells whose measurements required verification. It detected numerous spurious values, both in the measured data and in the historical data, resulting in a much "cleaner" data base than otherwise would have been the case. As the Quality Control process is routinely applied to KGS observation well measurements in the future, and particularly if it is applied to the entire Kansas observation well network, the quality of the data will be progressively improved with time.

Prev Page--Acquisition Activity || Next Page--Conclusions


Kansas Geological Survey, Water Level CD-ROM
Send comments and/or suggestions to webadmin@kgs.ku.edu
Updated March 9, 2000
Available online at URL = http://www.kgs.ku.edu/Magellan/WaterLevels/CD/Reports/OFR995/rep03.htm