This paper introduces a modular processing chain to derive global high-resolution maps of leaf traits. In particular, we present global maps at 500 m resolution of specific leaf area, leaf dry matter content, leaf nitrogen and phosphorus content per dry mass, and leaf nitrogen/phosphorus ratio. The processing chain exploits machine learning techniques along with optical remote sensing data (MODIS/Landsat) and climate data for gap filling and up-scaling of in-situ measured leaf traits. The chain first uses random forests regression with surrogates to fill gaps in the database (> 45% of missing entries) and maximizes the global representativeness of the trait dataset. Plant species are then aggregated to Plant Functional Types (PFTs). Next, the spatial abundance of PFTs at MODIS resolution (500 m) is calculated using Landsat data (30 m). Based on these PFT abundances, representative trait values are calculated for MODIS pixels with nearby trait data. Finally, different regression algorithms are applied to globally predict trait estimates from these MODIS pixels using remote sensing and climate data. The methods were compared in terms of precision, robustness and efficiency. The best model (random forests regression) shows good precision (normalized RMSE≤ 20%) and goodness of fit (averaged Pearson's correlation R = 0.78) in any considered trait. Along with the estimated global maps of leaf traits, we provide associated uncertainty estimates derived from the regression models. The process chain is modular, and can easily accommodate new traits, data streams (traits databases and remote sensing data), and methods. The machine learning techniques applied allow attribution of information gain to data input and thus provide the opportunity to understand trait-environment relationships at the plant and ecosystem scales. The new data products – the gap-filled trait matrix, a global map of PFT abundance per MODIS gridcells and the high-resolution global leaf trait maps – are complementary to existing large-scale observations of the land surface and we therefore anticipate substantial contributions to advances in quantifying, understanding and prediction of the Earth system.
Bibliographical noteFunding Information:
This research was financially supported by the NASA Earth Observing System MODIS project (grant NNX08AG87A ). This work was also supported by the European Research Council (ERC) funding under the ERC Consolidator Grant 2014 SEDAL (Statistical Learning for Earth Observation Data Analysis) project under Grant Agreement 647423 . We also want to gratefully acknowledge the efforts of all researchers involved at the TRY initiative on plant traits ( http://www.try-db.org ), hosted at the Max Planck Institute for Biogeochemistry, Jena, Germany. The authors would also like to thank the anonymous reviewers for their constructive comments on an earlier version of this manuscript.
© 2018 Elsevier Inc.
Copyright 2018 Elsevier B.V., All rights reserved.
- Machine learning
- Plant ecology
- Plant traits
- Random forests
- Remote sensing