Demographics of Lebanon#

To overcome gaps in official demographic statistics, the team has turned to private sector data sources. The Data Lab used algorithmically generated data such as that released by Meta. Additionally, the team also extracted the Relative Wealth Index data from Meta which helps identify differences in standard of living in the population.

Data Description#

High Resolution Population Density#

In partnership with the Center for International Earth Science Information Network (CIESIN) at Columbia University, Meta uses state-of-the-art computer vision techniques to identify buildings from publicly accessible mapping services to create these population datasets. These maps are available at 30-meter resolution. These maps aren’t built using Facebook data and instead rely on applying machine vision AI to satellite imagery and census information (latest available). The analysis was validated in 18 countries (Tiecke, et.al., 2017). For Lebanon, the sources that went into the creation of this dataset are -

This dataset, along with population density estimates, also has the following demographic disaggregations -

  • Men

  • Women

  • Children under 5

  • Elderly

  • Women of Reproductive Age (15-49)

  • Youth (15-24)

  • Senior

Data Access#

Relative Wealth Index#

The Relative Wealth Index (RWI) is a number that compares the wealth within the country where negative RWI implies poorer areas and positive RWI implies relatively richer areas. The Meta Relative Wealth Index predicts the relative standard of living within countries using privacy protecting connectivity data, satellite imagery, and other novel data sources. To calculate this index, Meta uses measurements of household wealth collected through face-to-face surveys conducted by the United States Agency for International Development with 1,457,315 unique households living in 66,819 villages in 56 different LMICs around the world. Following this, they use spatial markers to link villages to non-traditional data sources , including satellite imagery, cellular network data, topographic maps, and privacy protecting connectivity data from Facebook. They process the non-traditional data using deep learning and other computational algorithms, which convert the raw data to a set of quantitative features for each village. They use these features to train a supervised machine learning model that predicts the relative wealth of each populated 2.4km2 grid cell on the planet, even in regions where no ground truth data exists (Chi et. al., 2022).

The model was validated in 30 high-income countries (OECD Countries). They found that models trained using data in one country are most accurate when applied to neighboring countries and countries with similar attributes. The country level attributes on which similarity is assessed are - area, population, island, landlocked, distance to the closest country with DHS, number of neighboring countries with DHS, GDP per capita, and Gini coefficient.

Limitations#

For Lebanon, there is very limited census data available. It also does not have an accessible ground truth DHS survey dataset. The estimated mean error of the predicated model results in Lebanon is reported to be 0.470, median error is 0.480 and standard deviation is 0.042.

Data Access#

Relative Wealth Index dataset was obtained through the Development Data Partnership. Given the sensitiivty of the region, the otherwise publicly available dataset, can only be accessed through a signed agreement between the World Bank and Meta. All project team members who need to access this data must agree to the terms of the dataset and must be part of the proposal - Understanding Lebanon’s Economy though Alternative Data.

  • Dataset: Lebanon Relative Wealth Index

  • Dataset Provider: Data for Good at Meta

  • Granularity: 2.4km2 grid cell

  • Frequency: The dataset was released in 2020. The data is entirely predicted because of lack of ground truth survey information.

  • License: The use of this data is restricted to project members who are part of the proposal with the Development Data Partnership.

  • Data Access: Data has been made available for project team members on SharePoint. If you would like to request access please write to datapartnership@worldbank.org

Insights#

Hide code cell source
fig, ax = plt.subplots()
plt.axis("off")
hrsl_lebanon3.plot(column="population", cmap="viridis", vmax=100000, legend=True, ax=ax)
plt.title("Total population by admin 3 region", fontsize=12, ha="center")

subtitle_text = (
    "Source: Population estimates from Meta's HRSL aggregated to admin regions from HdX"
)
ax.text(-0.25, -0.05, subtitle_text, transform=ax.transAxes, fontsize=10, ha="left");
Text(-0.25, -0.05, "Source: Population estimates from Meta's HRSL aggregated to admin regions from HdX")
../../_images/e474d47afecf466735a3477b2e885c9bffc604bded6b99c891fd1391c42bfa80.png

Observations#

  • The area around Beirut is substantially more populated than the rest of the country.

  • The areas sharing a border with Syria also have a higher population.

Hide code cell source
fig, ax = plt.subplots()
plt.axis("off")
rw.plot(column="rwi", legend=True, ax=ax, markersize=5)

plt.title("Relative Wealth Index at 2.4sq km grids", fontsize=12, ha="center")

subtitle_text = "Source: Meta"
ax.text(-0.25, -0.05, subtitle_text, transform=ax.transAxes, fontsize=10, ha="left")
Text(-0.25, -0.05, 'Source: Meta')
../../_images/ef5e95be56a9c362a4862f883aac16f785384019bb34a6e7dc20b53295df793c.png

Observations#

  • The northern border shared with Syria includes some of the poorest regions in the country

  • The area around Beirut is the wealthiest in the country

References#

  • Chi, G., Fang, H., Chatterjee, S., & Blumenstock, J. E. (2022). Microestimates of wealth for all low-and middle-income countries. Proceedings of the National Academy of Sciences, 119(3), e2113658119.

  • Tiecke, T. G., Liu, X., Zhang, A., Gros, A., Li, N., Yetman, G., … & Dang, H. A. H. (2017). Mapping the world population one building at a time. arXiv preprint arXiv:1712.05839.