Science Inventory

Decision-Tree, Rule-Based, and Random Forest Classification of High-Resolution Multispectral Imagery for Wetland Mapping and Inventory

Citation:

Berhane, T., C. Lane, Q. Wu, Brad Autrey, O. Anenkhonov, V. Chepinoga, AND H. Liu. Decision-Tree, Rule-Based, and Random Forest Classification of High-Resolution Multispectral Imagery for Wetland Mapping and Inventory. Remote Sensing. MDPI, Basel, Switzerland, 10(4):580, (2018).

Impact/Purpose:

In this paper, we systematically and comprehensively evaluated the utility of three non-parametric machine-learning algorithms (DT, RB and RF) for effective supervised classification of 22 complex freshwater deltaic wetland vegetation and aquatic habitats in the Selenga River Delta of Lake Baikal, Russia.

Description:

Efforts are increasingly being made to classify the world’s wetland resources, an important ecosystem and habitat that is diminishing in abundance. There are multiple remote sensing classification methods, including a suite of nonparametric classifiers such as decision-tree (DT), rule-based (RB), and random forest (RF). High-resolution satellite imagery can provide more specificity to the classified end product, and ancillary data layers such as the Normalized Difference Vegetation Index, and hydrogeomorphic layers such as distance-to-a-stream can be coupled to improve overall accuracy (OA) in wetland studies. In this paper, we contrast three nonparametric machine-learning algorithms (DT, RB, and RF) using a large field-based dataset (n = 228) from the Selenga River Delta of Lake Baikal, Russia. We also explore the use of ancillary data layers selected to improve OA, with a goal of providing end users with a recommended classifier to use and the most parsimonious suite of input parameters for classifying wetland-dominated landscapes. Though all classifiers appeared suitable, the RF classification outperformed both the DT and RB methods, achieving OA >81%. Including a texture metric (homogeneity) substantially improved the classification OA. However, including vegetation/soil/water metrics (based on WorldView-2 band combinations), hydrogeomorphic data layers, and elevation data layers to increase the descriptive content of the input parameters surprisingly did not markedly improve the OA. We conclude that, in most cases, RF should be the classifier of choice. The potential exception to this recommendation is under the circumstance where the end user requires narrative rules to best manage his or her resource. Though not useful in this study, continuously increasing satellite imagery resolution and band availability suggests the inclusion of ancillary contextual data layers such as soil metrics or elevation data, the granularity of which may define its utility in subsequent wetland classifications.

Record Details:

Record Type:DOCUMENT( JOURNAL/ PEER REVIEWED JOURNAL)
Product Published Date:04/09/2018
Record Last Revised:04/16/2018
OMB Category:Other
Record ID: 340392