-
Modeling the Influence of Local Environmental Factors on Malaria Transmission in Benin and Its Implications for Cohort Study
Authors:
Gilles Cottrell,
Bienvenue Kouwaye,
Charlotte Pierrat,
Agnès Le Port,
Bouraïma Aziz,
Noël Fonton,
Achille Massougbodji,
Vincent Corbel,
Mahouton Norbert Hounkonnou,
André Garcia
Abstract:
Malaria remains endemic in tropical areas, especially in Africa. For the evaluation of new tools and to further ourunderstanding of host-parasite interactions, knowing the environmental risk of transmission-even at a very local scale-isessential. The aim of this study was to assess how malaria transmission is influenced and can be predicted by local climaticand environmental factors. As the entomo…
▽ More
Malaria remains endemic in tropical areas, especially in Africa. For the evaluation of new tools and to further ourunderstanding of host-parasite interactions, knowing the environmental risk of transmission-even at a very local scale-isessential. The aim of this study was to assess how malaria transmission is influenced and can be predicted by local climaticand environmental factors. As the entomological part of a cohort study of 650 newborn babies in nine villages in the ToriBossito district of Southern Benin between June 2007 and February 2010, human landing catches were performed to assessthe density of malaria vectors and transmission intensity. Climatic factors as well as household characteristics were recordedthroughout the study. Statistical correlations between Anopheles density and environmental and climatic factors weretested using a three-level Poisson mixed regression model. The results showed both temporal variations in vector density(related to season and rainfall), and spatial variations at the level of both village and house. These spatial variations could belargely explained by factors associated with the house's immediate surroundings, namely soil type, vegetation index andthe proximity of a watercourse. Based on these results, a predictive regression model was developed using a leave-one-outmethod, to predict the spatiotemporal variability of malaria transmission in the nine villages. This study points up theimportance of local environmental factors in malaria transmission and describes a model to predict the transmission risk ofindividual children, based on environmental and behavioral characteristics.
△ Less
Submitted 19 August, 2016;
originally announced August 2016.
-
Anopheles number prediction on environmental and climate variables using Lasso and stratified two levels cross validation
Authors:
Bienvenue Kouwaye
Abstract:
This paper deals with prediction of anopheles number using environmental and climate variables. The variables selection is performed by an automatic machine learning method based on Lasso and stratified two levels cross validation. Selected variables are debiased while the predictionis generated by simple GLM (Generalized linear model). Finally, the results reveal to be qualitatively better, at…
▽ More
This paper deals with prediction of anopheles number using environmental and climate variables. The variables selection is performed by an automatic machine learning method based on Lasso and stratified two levels cross validation. Selected variables are debiased while the predictionis generated by simple GLM (Generalized linear model). Finally, the results reveal to be qualitatively better, at selection, the prediction,and the CPU time point of view than those obtained by B-GLM method.
△ Less
Submitted 4 August, 2016;
originally announced August 2016.
-
Regression Trees and Random forest based feature selection for malaria risk exposure prediction
Authors:
Bienvenue Kouwayè
Abstract:
This paper deals with prediction of anopheles number, the main vector of malaria risk, using environmental and climate variables. The variables selection is based on an automatic machine learning method using regression trees, and random forests combined with stratified two levels cross validation. The minimum threshold of variables importance is accessed using the quadratic distance of variables…
▽ More
This paper deals with prediction of anopheles number, the main vector of malaria risk, using environmental and climate variables. The variables selection is based on an automatic machine learning method using regression trees, and random forests combined with stratified two levels cross validation. The minimum threshold of variables importance is accessed using the quadratic distance of variables importance while the optimal subset of selected variables is used to perform predictions. Finally the results revealed to be qualitatively better, at the selection, the prediction , and the CPU time point of view than those obtained by GLM-Lasso method.
△ Less
Submitted 24 June, 2016;
originally announced June 2016.
-
Lasso based feature selection for malaria risk exposure prediction
Authors:
Bienvenue Kouwayè,
Noël Fonton,
Fabrice Rossi
Abstract:
In life sciences, the experts generally use empirical knowledge to recode variables, choose interactions and perform selection by classical approach. The aim of this work is to perform automatic learning algorithm for variables selection which can lead to know if experts can be help in they decision or simply replaced by the machine and improve they knowledge and results. The Lasso method can dete…
▽ More
In life sciences, the experts generally use empirical knowledge to recode variables, choose interactions and perform selection by classical approach. The aim of this work is to perform automatic learning algorithm for variables selection which can lead to know if experts can be help in they decision or simply replaced by the machine and improve they knowledge and results. The Lasso method can detect the optimal subset of variables for estimation and prediction under some conditions. In this paper, we propose a novel approach which uses automatically all variables available and all interactions. By a double cross-validation combine with Lasso, we select a best subset of variables and with GLM through a simple cross-validation perform predictions. The algorithm assures the stability and the the consistency of estimators.
△ Less
Submitted 4 November, 2015;
originally announced November 2015.
-
Sélection de variables par le GLM-Lasso pour la prédiction du risque palustre
Authors:
Bienvenue Kouwayè,
Noël Fonton,
Fabrice Rossi
Abstract:
In this study, we propose an automatic learning method for variables selection based on Lasso in epidemiology context. One of the aim of this approach is to overcome the pretreatment of experts in medicine and epidemiology on collected data. These pretreatment consist in recoding some variables and to choose some interactions based on expertise. The approach proposed uses all available explanatory…
▽ More
In this study, we propose an automatic learning method for variables selection based on Lasso in epidemiology context. One of the aim of this approach is to overcome the pretreatment of experts in medicine and epidemiology on collected data. These pretreatment consist in recoding some variables and to choose some interactions based on expertise. The approach proposed uses all available explanatory variables without treatment and generate automatically all interactions between them. This lead to high dimension. We use Lasso, one of the robust methods of variable selection in high dimension. To avoid over fitting a two levels cross-validation is used. Because the target variable is account variable and the lasso estimators are biased, variables selected by lasso are debiased by a GLM and used to predict the distribution of the main vector of malaria which is Anopheles. Results show that only few climatic and environmental variables are the mains factors associated to the malaria risk exposure.
△ Less
Submitted 9 September, 2015;
originally announced September 2015.