The data presented below represent the predicted number of people per ~100 m pixel as estimated using the random forest (RF) model as described in Stevens, et al. (2015). The following pages contain a description of the RF model and its covariates, their sources and any metadata collected for each covariate. The prediction weighting layer is used to dasymetrically redistribute the census counts and project counts to match estimated populations based on UN estimates for the final population maps provided by WorldPop.
These data are the population density values used to estimate the RF model used to create the prediction weighting layer you see above. Values represent population density as measured by people per hectare and calculated from population counts within each census unit. These values are used as the dependent variable during model estimation.
File Name: adminpop.shp
Source: National Bureau of Statistics, Nigeria
Description: These census data at the level of Local Government Areas (LGAs) were downloaded from http://www.geohive.com/cntry/nigeria.aspx. Required fields for map production are ADMINID and ADMINPOP.
area, buff, zones,
Field name: '2006cens' changed to: 'X2006cens'
class : SpatialPolygonsDataFrame features : 774 extent : -199288, 1118761, 472912, 1538804 (xmin, xmax, ymin, ymax) coord. ref. : NA variables : 16
These output and figures outline the estimated RF model that is used to predict the population density weighting layer. The model is fitted to the population density values for the preceding census data using covariates aggregatedfrom the ancillary data sources summarized following the model diagnostics.
Call: randomForest(x = x_data, y = y_data, ntree = popfit$ntree, mtry = popfit$mtry, nodesize = length(y_data)/1000, importance = TRUE, proximity = TRUE) Type of random forest: regression Number of trees: 500 No. of variables tried at each split: 13 Mean of squared residuals: 0.22 % Var explained: 88
File Name: nga_lc.tif
Source: GlobCover, 300m; GeoTerraImage
Description: Landcover from the GlobCover product resampled to 100m, refined with detailed Landsat-derived settlement extents (GeoTerraImage), reclassified to match AfriPop coding and eventually broken down into binary classifications by aggregated land cover type (see Linard, et al., 2010 and Gaughan, et al. 2013 for category information).
cls011, dst011, cls040, dst040, cls130, dst130, cls140, dst140, cls150, dst150, cls160, dst160, cls190, dst190, cls200, dst200, cls210, dst210, cls230, dst230, cls240, dst240, cls250, dst250, clsBLT, dstBLT,
class : RasterBrick dimensions : 10826, 13555, 146746430, 1 (nrow, ncol, ncell, nlayers) resolution : 100, 100 (x, y) extent : -214112, 1141388, 472626, 1555226 (xmin, xmax, ymin, ymax) coord. ref. : +proj=utm +zone=32 +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 data source : D:\Research\Population\Analysis\RF\data\NGA\Landcover\Derived\landcover.tif names : landcover min values : 11 max values : 250
File Name: DEFAULT: MODIS 17A3 2010
Source: United States Geological Survey (USGS)
Description: MODIS 17A3 version-55 derived estimates of net primary productivity for the year 2010, estimated for 1km pixel sizes and subset and resampled to match the available land cover and final population map output requirements.
class : RasterBrick dimensions : 10826, 13555, 146746430, 1 (nrow, ncol, ncell, nlayers) resolution : 100, 100 (x, y) extent : -214112, 1141388, 472626, 1555226 (xmin, xmax, ymin, ymax) coord. ref. : +proj=utm +zone=32 +datum=WGS84 +units=m +no_defs +ellps=WGS84 +towgs84=0,0,0 data source : D:\Research\Population\Analysis\RF\data\NGA\NPP\Derived\npp.tif names : npp min values : 0 max values : 12426