dc.contributor.author | Gachoki, Stella | |
dc.contributor.author | Groen, Thomas A. | |
dc.contributor.author | Vrieling, Anton | |
dc.contributor.author | Skidmore, Andrew | |
dc.contributor.author | Masiga, Daniel | |
dc.date.accessioned | 2024-05-10T11:55:58Z | |
dc.date.available | 2024-05-10T11:55:58Z | |
dc.date.issued | 2024 | |
dc.identifier.uri | http://hdl.handle.net/20.500.12562/1999 | |
dc.description | Publication | en_US |
dc.description.abstract | Vector-borne diseases, like those transmitted by tsetse flies, pose a significant global public health threat.Reducing vector populations is a promising strategy for disease control, especially in the case of tsetsetransmitted African trypanosomiasis. However, the cost-effective implementation of large-scale vector surveillance and control measures face challenges due to the lack of spatially explicit and reliable maps identifying vector hotspots. In this study, we assessed the accuracy of predicting Glossina pallidipes relative densities across Kenya by linking constrained in-situ tsetse catch data from 660 traps across three Kenyan regions with readily available gridded satellite information (human population, land cover, soil properties, elevation, precipitation,and land surface temperature) using a classical random forest algorithm. To enhance predictive performance, we employed two feature elimination techniques specifically designed for machine learning algorithms, i.e.,Recursive Feature Elimination (RFE) and Variable Selection Using Random Forests (VSURF). For each set of retained variables, we trained a Random Forest model using a spatial cross-validation technique. Our findings showed that tsetse fly relative densities decreased with mean annual precipitation, and soil moisture, and conversely increased with higher tree cover. Based on the cross-validated R2 , 41% of the spatial variability in relative densities of tsetse flies could be explained. For spatial extrapolation, only the set of predictors retained by VSURF closely matched known tsetse fly distributions in Kenya. This more accurate performance of VSURF may be attributed to its approach of assessing variables for both importance and their contribution to reducing prediction error. Our study demonstrates the potential of using a random forest method to upscale tsetse relative abundance predictions to the national level. However, the reliability of the current extrapolated map remains uncertain. We recommend: 1) increasing tsetse fly sampling efforts, particularly in the data-limited northern and eastern regions of Kenya, and 2) developing a more precise and accurate land cover map with classes that directly associate with known habitat characteristics of the target tsetse species | en_US |
dc.description.sponsorship | German Federal Ministry for Economic Cooperation and Development (BMZ) Deutsche Gesellschaft für Internationale Zusammenarbeit (GIZ) International Agricultur l Research (FIA) Royal Netherlands Academy of Arts and Sciences (KNAW) Schlumberger Foundation (Faculty for the Future fellowship Swedish International Development Cooperation Agency (Sida) Swiss Agency for Development and Cooperation (SDC) Australian Centre for International Agricultural Research (ACIAR) Federal Democratic Republic of Ethiopia; Government of the Republic of Kenya | en_US |
dc.publisher | Ecological Informatics | en_US |
dc.rights | Attribution-NonCommercial-ShareAlike 3.0 United States | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/3.0/us/ | * |
dc.subject | Tsetse abundance | en_US |
dc.subject | Machine learning | en_US |
dc.subject | Vector borne diseases | en_US |
dc.subject | Spatial extrapolations | en_US |
dc.subject | Satellite data | en_US |
dc.subject | Random forest | en_US |
dc.title | Towards accurate spatial prediction of Glossina pallidipes relative densities at country-scale in Kenya | en_US |
dc.type | Article | en_US |
The following license files are associated with this item: