Science Inventory

LAKECAT – DATASET OF LAKE BASIN CHARACTERISTICS

Citation:

Hill, R. AND S. Leibowitz. LAKECAT – DATASET OF LAKE BASIN CHARACTERISTICS. U.S. EPA Office of Research and Development, Washington, DC, EPA/600/F-18/009, 2020.

Impact/Purpose:

FACTSHEET FOR LAKECAT. IMPACT STATEMENT OF ORIGINAL ARTICLE: We developed an extensive dataset of landscape metrics for 378,088 lakes and their associated catchments within the conterminous USA: The Lake-Catchment (LakeCat) Dataset. This dataset summarizes nearby and upslope landscape features for these lakes, and includes both natural (e.g., climate, soils, and lithology) and anthropogenic (e.g., urbanization, agriculture, and dams) landscape features. The dataset will parallel and compliment a recently published dataset of watershed characteristics for streams, the StreamCat Dataset, in the numbers and types of watershed metrics. StreamCat is beginning to be widely used by university, government, and NGO researchers and managers and it is anticipated that LakeCat will provide similar benefits. At publication, the dataset will contain at least 170 landscape metrics, but more will be added as they become available. These data will be made available to the public for download and will greatly reduce the specialized geospatial expertise needed by researchers and managers to extract an extensive suite of watershed characteristics for lakes of interest. This paper provides a detailed description of the development and main features of LakeCat. The final publication will point to the URL home of the dataset (https://www.epa.gov/national-aquatic-resource-surveys/lakecat), which will also contain extensive metadata. All code used to develop the LakeCat Dataset will be made publically available through a GitHub website (https://github.com/USEPA/LakeCat) to ensure transparency of methods. In addition, the paper provides an illustration (with scripting code) of how LakeCat can be used. This illustration modeled eutrophication based on samples from the National Lakes Assessment. It then predicted the probability of eutrophication for 297,071 unsampled lakes across the conterminous US. The map of lake-specific predicted probabilities of eutrophication produced by this illustration could provide an important tool for states and managers to focus monitoring and sampling efforts. LakeCat data may also be of use to Office of Water for a number of applications, including modeling reference condition for National Lakes Assessment sample sites. This manuscript is an FY17 milestone under SSWR 3.01B.1, “Mapping of watershed integrity, stream condition, and lake condition for the continental US”.

Description:

FACTSHEET FOR LAKECAT. ABSTRACT OF ORIGINAL ARTICLE: Natural and human-related landscape features influence the ecology and water quality within lakes. It is critical to summarize these features in a hydrologically-meaningful way to understand and manage lake ecosystems. Such summaries are often done through the delineation of watershed boundaries of individual lakes. However, there are many technical challenges associated with delineating hundreds or thousands of lake watersheds at broad spatial extents that can limit the application of analyses to new, unsampled locations. We present the development of the Lake-Catchment (LakeCat) Dataset (https://www.epa.gov/national-aquatic-resource-surveys/lakecat); a dataset of watershed features for 378,088 lakes within the conterminous US. We describe the methods we used to (1) delineate lake catchments, (2) hydrologically connect nested lake catchments, and (3) generate several hundred watershed-level metrics that summarize both natural (e.g., soils, geology, climate, and land cover) and anthropogenic (e.g., urbanization, agriculture, and mines) features. To illustrate how this dataset can be used, we developed a random forest model to predict the probability of lake eutrophication by combining LakeCat with data from US EPA’s National Lakes Assessment (NLA). This model correctly predicted the trophic state of 72% of NLA lakes and we applied the model to predict the probability of eutrophication at 297,071 unsampled lakes across the conterminous US. The large suite of LakeCat metrics could be used to improve analyses of lakes at broad spatial extents, improve the applicability of analyses to new, unsampled lakes, and ultimately improve the management of these important ecosystems.

Record Details:

Record Type:DOCUMENT( COMMUNICATION PRODUCT/ EXTERNAL FACT SHEET)
Product Published Date:12/01/2020
Record Last Revised:04/14/2021
OMB Category:Other
Record ID: 351401