Record Display for the EPA National Library Catalog

RECORD NUMBER: 75 OF 78

PREV NEXT

Main Title

Using 'Found' Data to Augment a Probability Sample: Procedure and Case Study.

Author

Overton, J. M. C. ; Young, T. C. ; Overton, W. S. ;

CORP Author

California Univ., Los Angeles. Dept. of Biology. ;Clarkson Univ., Potsdam, NY. Dept. of Civil and Environmental Engineering. ;Oregon State Univ., Corvallis. Dept. of Statistics.;Corvallis Environmental Research Lab., OR.

Publisher

cJul 92

Year Published

1992

Report Number

EPA/600/J-94/223;

Stock Number

PB94-169984

Additional Subjects

Water quality ; Sampling ; Population(Statistics) ; Probability theory ; Estimates ; Data bases ; Mathematical models ; Reprints ;

Holdings

Library	Call Number	Additional Info	Location	Last Modified	Checkout Status
NTIS	PB94-169984	Some EPA libraries have a fiche copy filed under the call number shown.		07/26/2022

Collation

20p

Abstract

While the probability sampling has the advantage of permitting unbiased population estimates, many past and existing monitoring schemes do not employ probability sampling. The authors describe and demonstrate a general procedure for augmenting an existing probability sample with data from nonprobability-based surveys ('found' data). The procedure uses sampling frame attributes to group the probability and found samples into similar subsets. Subsequently, the similarity is assumed to reflect the representativeness of the found sample for the matching subpopulation. Two methods of establishing similarity and producing estimates are described: pseudo-random and calibration. The pseudo-random method is used when the found sample can contribute additional information on variables already measured for the probability sample, thus increasing the effective sample size. The calibration method is used when the found sample contributes information that is unique to the found observations. For either approach, the found sample data yield observations that are treated as a probability sample, and population estimates are made according to a probability estimation protocol. To demonstrate these approaches, the authors applied them to found and probability samples of stream discharge data for the southeastern US.

PREV NEXT