Metadata last updated on Nov 1, 2024
Compare
Version : 
The dataset is a relational dataset of 8,000 households households, representing a sample of the population of an imaginary middle-income country. The dataset contains two data files: one with variables at the household level, the other one with variables at the individual level. It includes variables that are typically collected in population censuses (demography, education, occupation, dwelling characteristics, fertility, mortality, and migration) and in household surveys (household expenditure, anthropometric data for children, assets ownership). The data only includes ordinary households (no community households). The dataset was created using REaLTabFormer, a model that leverages deep learning methods. The dataset was created for the purpose of training and simulation and is not intended to be representative of any specific country.

The full-population dataset (with about 10 million individuals) is also distributed as open data.
Metadata
View More
Data Access and Licensing
Classification: Public
This dataset is classified as Public under the Access to Information Classification Policy. Users inside and outside the Bank can access this dataset.
License: Creative Commons Attribution 4.0
This dataset is licensed under Creative Commons Attribution 4.0
Statistics
Views (730)
Downloads (0)
Share Metadata
The information on this page (the dataset metadata) is also available in these formats.
EmailJSON
Emergency Contact Number (US): (202) 458-8888|© 2022 The World Bank Group, All Rights Reserved