Metadata last updated on Aug 14, 2024
Compare
Version : 
The dataset is a relational dataset of 10,003,891 individuals (2,501,755 households), representing the entire population of an imaginary middle-income country. The dataset contains two data files: one with variables at the household level, the other one with variables at the individual level. It includes variables that are typically collected in population censuses (demography, education, occupation, dwelling characteristics, fertility, mortality, and migration) and in household surveys (household expenditure, anthropometric data for children, assets ownership). The data only includes ordinary households (no community households). The dataset was created using REaLTabFormer, a model that leverages deep learning methods. The dataset was created for the purpose of training and simulation and is not intended to be representative of any specific country.

A sample dataset of 8000 households was created out of this full-population dataset, and is also distributed as open data.
Metadata
View More
Data Access and Licensing
Classification: Public
This dataset is classified as Public under the Access to Information Classification Policy. Users inside and outside the Bank can access this dataset.
License: Creative Commons Attribution 4.0
This dataset is licensed under Creative Commons Attribution 4.0
Statistics
Views (384)
Downloads (0)
Share Metadata
The information on this page (the dataset metadata) is also available in these formats.
EmailJSON
Emergency Contact Number (US): (202) 458-8888|© 2022 The World Bank Group, All Rights Reserved