Ethiopia - COVID-19 High Frequency Phone Survey of Households 2020, Baseline (Round 1)

Primary tabs

The potential impacts of the COVID-19 pandemic in Ethiopia are expected to be severe on Ethiopian households' welfare. To monitor these impacts on households, the team selected a subsample of households that had been interviewed for the Living Standards Measurement Study (LSMS) in 2019, covering urban and rural areas in all regions of Ethiopia. The 15-minute questionnaire covers a series of topics, such as knowledge of COVID and mitigation measures, access to routine healthcare as public health systems are increasingly under stress, access to educational activities during school closures, employment dynamics, household income and livelihood, income loss and coping strategies, and external assistance. The survey is implemented using Computer Assisted Telephone Interviewing, using a modular approach, which allows for modules to be dropped and/or added in different waves of the survey. Survey data collection started at the end of April 2020 and households are called back every three to four weeks for a total of seven survey rounds to track the impact of the pandemic as it unfolds and inform government action. This provides data to the government and development partners in near real-time, supporting an evidence-based response to the crisis. The sample of households was drawn from the sample of households interviewed in the 2018/2019 round of the Ethiopia Socioeconomic Survey (ESS). The extensive information collected in the ESS, less than one year prior to the pandemic, provides a rich set of background information on the COVID-19 High Frequency Phone Survey of households which can be leveraged to assess the differential impacts of the pandemic in the country.

Type: 
Microdata
Acronym: 
COVID-19 HFPS-R1 2020
Languages Supported: 
English
Topics: 
Topic not specified
Geographical Coverage: 
Ethiopia
Economy Coverage: 
Economy Coverage not specified
Release Date: 
June 11, 2020

Last Updated

Last Updated: 
June 15, 2020

Harvest System ID

Harvest System ID: 
Microdata

Harvest Source ID

Harvest Source ID: 
11422
Disclaimer: 
The user of the data acknowledges that the original collector of the data, the authorized distributor of the data, and the relevant funding agency bear no responsibility for use of the data or for interpretations or inferences based upon such uses.
Version Description: 
Version 01 (June 2020)
Publisher Name: 

Development Economics Data Group; The World Bank

Funding Name, Abbreviation, Role: 
United States Agency for International Development, World Bank
Study Type: 
Socio-Economic/Monitoring Survey [hh/sems]
Series Information: 
The World Bank is providing support to countries to help mitigate the spread and impact of the new coronavirus disease (COVID-19). One area of support is for data collection to inform evidence-based policies that may help mitigate the effects of this disease. Towards this end, the World Bank is leveraging the Living Standards Measurement Study - Integrated Survey on Agriculture (LSMS-ISA) program to implement high-frequency phone surveys on COVID-19 in 5 African countries - Nigeria, Ethiopia, Uganda, Tanzania, and Malawi. This effort is part of a broader first wave of World Bank-supported national longitudinal high frequency survey that can be used to help assess the economic and social implications of the COVID-19 pandemic on households and individuals.
Universe: 
The survey covered all de jure households excluding prisons, hospitals, military barracks, and school dormitories.
Primary Investigator Name, Affiliation: 
World Bank
Sampling Procedure: 
The sample of the HFPS-HH is a subsample of the 2018/19 Ethiopia Socioeconomic Survey (ESS). The ESS is built on a nationally and regionally representative sample of households in Ethiopia. ESS 2018/19 interviewed 6,770 households in urban and rural areas. In the ESS interview, households were asked to provide phone numbers either their own or that of a reference household (i.e. friends or neighbors) so that they can be contacted in the follow-up ESS surveys should they move from their sampled location. At least one valid phone number was obtained for 5,374 households (4,626 owning a phone and 995 with a reference phone number). These households established the sampling frame for the HFPS-HH. To obtain representative strata at the national, urban, and rural level, the target sample size for the HFPS-HH is 3,300 households; 1,300 in rural and 2,000 households in urban areas. In rural areas, we attempt to call all phone numbers included in the ESS as only 1,413 households owned phones and another 771 households provided reference phone numbers. In urban areas, 3,213 households owned a phone and 224 households provided reference phone numbers. To account for non-response and attrition all the 5,374 households were called in round 1 of the HFPS-HH. The total number of completed interviews in round one is 3,249 households (978 in rural areas, 2,271 in urban areas).
Weighting: 
To obtain unbiased estimates from the sample, the information reported by households needs to be adjusted by a sampling weight (or raising factor) w_h. To construct the sampling weights, we follow the steps outlined in Himelein, K. (2014), which outlines eight steps, of which we follow six, to construct the sampling weights for the HFPS-HH: 1. Begin with base weights from the Ethiopia Socioeconomic Survey ESS 2018/19 for each household 2. Incorporate probability of sub-selection of round 1 unit for each of the phone survey households. We calculate the probability of selection for each of the 20 strata in the ESS (urban and rural in each of the 11 regions except for Addis Ababa where we only have an urban stratum) by creating the numerators as the number of completed phone interviews and the denominator as the number of households in the ESS for each stratum. 3. Pool the weights in Steps 1 and 2. 4. Derive attrition-adjusted weights for all individuals by running a logistic response propensity model based on characteristics of the household head (i.e. education, labor force status, demographic characteristics), characteristics of the household (consumption, assets, financial characteristics), and characteristics of the dwelling (house ownership, overcrowding). 5. Trim weights by replacing the top two percent of observations with the 98th percentile cut-off point; and 6. Post-stratify weights to known population totals to correct for the imbalances across our urban and rural sample. In doing so, we ensure that the distribution in the survey matches the distribution in the ESS. * Additional technical details and explanations on each of the steps briefly outlined above can be found in Himelein, K. (2014).
Questionnaires: 
The Ethiopia COVID-19 High Frequency Phone Survey of households questionnaire consists of the following 10 sections: - Interview Information - Household Roster - Knowledge Regarding the Spread of COVID-19 - Behaviour and Social Distancing - Access to Basic Services - Employment - Income Loss - Coping/Shocks - Food Security - Aid and Support/ Social Safety Nets
Data Collector(s) Name: 
Laterite BV
Data Editing: 
DATA CLEANING At the end of data collection, the raw dataset was cleaned by the Research team. This included formatting, and correcting results based on monitoring issues, enumerator feedback and survey changes. Data cleaning carried out is detailed below. Variable naming and labeling: • Variable names were changed to reflect the lowercase question name in the paper survey copy, and a word or two related to the question. • Variables were labeled with longer descriptions of their contents and the full question text was stored in Notes for each variable. • “Other, specify” variables were named similarly to their related question, with “_other” appended to the name. • Value labels were assigned where relevant, with options shown in English for all variables, unless preloaded from the roster in Amharic. Variable formatting: • Variables were formatted as their object type (string, integer, decimal, time, date, or datetime). • Multi-select variables were saved both in space-separated single-variables and as multiple binary variables showing the yes/no value of each possible response. • Time and date variables were stored as POSIX timestamp values and formatted to show Gregorian dates. • Location information was left in separate ID and Name variables, following the format of the incoming roster. IDs were formatted to include only the variable level digits, and not the higher-level prefixes (2-3 digits only.) • Full Household and Enumeration Area ID variables were given leading 0s to match incoming roster format. Observation and variable arrangement: • Only consented surveys were kept in the dataset, and all personal information and internal survey variables were dropped from the clean dataset. • Roster data is separated from the main data set and kept in long-form but can be merged on the key variable (key can also be used to merge with the raw data). • In the main dataset, ii4_resp_id and cs7_hhh_id are the roster IDs of the respondent and household head respectively, and can be merged with individual_id in the roster. • The variables were arranged in the same order as the paper instrument, with observations arranged according to their submission time. Backcheck data review: Results of the backcheck survey are compared against the originally captured survey results using the bcstats command in Stata. This function delivers a comparison of variables and identifies any discrepancies. Any discrepancies identified are then examined individually to determine if they are within reason.
Other Processing: 
The Ethiopia- COVID-19 High Frequency Phone Survey of Households, Baseline (Round 1) covered the following topics: - Household Roster - Knowledge Regarding the Spread of COVID-19 - Behaviour and Social Distancing - Access to Basic Services - Employment - Income Loss - Coping/Shocks - Food Security - Aid and Support/ Social Safety Nets
Access Authority Name, Affiliation, Email: 

World Bank

No Visualizations Available.

Use of the dataset must be acknowledged using a citation which would include: - the Identification of the Primary Investigator - the title of the survey (including country, acronym and year of implementation) - the survey reference number - the source and date of download World Bank. Ethiopia- COVID-19 High Frequency Phone Survey of Households, Baseline (Round 1) 2020. Dataset downloaded from www.microdata.worldbank.org on [date].

The potential impacts of the COVID-19 pandemic in Ethiopia are expected to be severe on Ethiopian households' welfare. To monitor these impacts on households, the team selected a subsample of households that had been interviewed for the Living Standards Measurement Study (LSMS) in 2019, covering urban and rural areas in all regions of Ethiopia. The 15-minute questionnaire covers a series of topics, such as knowledge of COVID and mitigation measures, access to routine healthcare as public health systems are increasingly under stress, access to educational activities during school closures, employment dynamics, household income and livelihood, income loss and coping strategies, and external assistance. The survey is implemented using Computer Assisted Telephone Interviewing, using a modular approach, which allows for modules to be dropped and/or added in different waves of the survey. Survey data collection started at the end of April 2020 and households are called back every three to four weeks for a total of seven survey rounds to track the impact of the pandemic as it unfolds and inform government action. This provides data to the government and development partners in near real-time, supporting an evidence-based response to the crisis. The sample of households was drawn from the sample of households interviewed in the 2018/2019 round of the Ethiopia Socioeconomic Survey (ESS). The extensive information collected in the ESS, less than one year prior to the pandemic, provides a rich set of background information on the COVID-19 High Frequency Phone Survey of households which can be leveraged to assess the differential impacts of the pandemic in the country.

FieldValue
Modified Date
2020-06-16
Release Date
Identifier
77ed7625-0f38-4d66-9506-2188ba45f7a1
License
License Not Specified
Contact Email
Public Access Level
Public
Rating: 
0
No votes yet
Acronym: 
COVID-19 HFPS-R1 2020
Type: 
Languages Supported: 
Access Authority Name, Affiliation, Email: 
World Bank
Disclaimer: 
The user of the data acknowledges that the original collector of the data, the authorized distributor of the data, and the relevant funding agency bear no responsibility for use of the data or for interpretations or inferences based upon such uses.
Weighting: 
To obtain unbiased estimates from the sample, the information reported by households needs to be adjusted by a sampling weight (or raising factor) w_h. To construct the sampling weights, we follow the steps outlined in Himelein, K. (2014), which outlines eight steps, of which we follow six, to construct the sampling weights for the HFPS-HH: 1. Begin with base weights from the Ethiopia Socioeconomic Survey ESS 2018/19 for each household 2. Incorporate probability of sub-selection of round 1 unit for each of the phone survey households. We calculate the probability of selection for each of the 20 strata in the ESS (urban and rural in each of the 11 regions except for Addis Ababa where we only have an urban stratum) by creating the numerators as the number of completed phone interviews and the denominator as the number of households in the ESS for each stratum. 3. Pool the weights in Steps 1 and 2. 4. Derive attrition-adjusted weights for all individuals by running a logistic response propensity model based on characteristics of the household head (i.e. education, labor force status, demographic characteristics), characteristics of the household (consumption, assets, financial characteristics), and characteristics of the dwelling (house ownership, overcrowding). 5. Trim weights by replacing the top two percent of observations with the 98th percentile cut-off point; and 6. Post-stratify weights to known population totals to correct for the imbalances across our urban and rural sample. In doing so, we ensure that the distribution in the survey matches the distribution in the ESS. * Additional technical details and explanations on each of the steps briefly outlined above can be found in Himelein, K. (2014).
Economy Coverage: 
Primary Investigator Name, Affiliation: 
World Bank
Publisher Name: 
Development Economics Data Group; The World Bank
Version Description: 
Version 01 (June 2020)
Subtitle: 
Baseline (Round 1)
Universe: 
The survey covered all de jure households excluding prisons, hospitals, military barracks, and school dormitories.
Geographical Coverage: 
Data Classification of a Dataset: 
Series Information: 
The World Bank is providing support to countries to help mitigate the spread and impact of the new coronavirus disease (COVID-19). One area of support is for data collection to inform evidence-based policies that may help mitigate the effects of this disease. Towards this end, the World Bank is leveraging the Living Standards Measurement Study - Integrated Survey on Agriculture (LSMS-ISA) program to implement high-frequency phone surveys on COVID-19 in 5 African countries - Nigeria, Ethiopia, Uganda, Tanzania, and Malawi. This effort is part of a broader first wave of World Bank-supported national longitudinal high frequency survey that can be used to help assess the economic and social implications of the COVID-19 pandemic on households and individuals.
Sampling Procedure: 
The sample of the HFPS-HH is a subsample of the 2018/19 Ethiopia Socioeconomic Survey (ESS). The ESS is built on a nationally and regionally representative sample of households in Ethiopia. ESS 2018/19 interviewed 6,770 households in urban and rural areas. In the ESS interview, households were asked to provide phone numbers either their own or that of a reference household (i.e. friends or neighbors) so that they can be contacted in the follow-up ESS surveys should they move from their sampled location. At least one valid phone number was obtained for 5,374 households (4,626 owning a phone and 995 with a reference phone number). These households established the sampling frame for the HFPS-HH. To obtain representative strata at the national, urban, and rural level, the target sample size for the HFPS-HH is 3,300 households; 1,300 in rural and 2,000 households in urban areas. In rural areas, we attempt to call all phone numbers included in the ESS as only 1,413 households owned phones and another 771 households provided reference phone numbers. In urban areas, 3,213 households owned a phone and 224 households provided reference phone numbers. To account for non-response and attrition all the 5,374 households were called in round 1 of the HFPS-HH. The total number of completed interviews in round one is 3,249 households (978 in rural areas, 2,271 in urban areas).
Release Date: 
Thursday, June 11, 2020
Last Updated Date: 
Monday, June 15, 2020
Questionnaires: 
The Ethiopia COVID-19 High Frequency Phone Survey of households questionnaire consists of the following 10 sections: - Interview Information - Household Roster - Knowledge Regarding the Spread of COVID-19 - Behaviour and Social Distancing - Access to Basic Services - Employment - Income Loss - Coping/Shocks - Food Security - Aid and Support/ Social Safety Nets
Data Editing: 
DATA CLEANING At the end of data collection, the raw dataset was cleaned by the Research team. This included formatting, and correcting results based on monitoring issues, enumerator feedback and survey changes. Data cleaning carried out is detailed below. Variable naming and labeling: • Variable names were changed to reflect the lowercase question name in the paper survey copy, and a word or two related to the question. • Variables were labeled with longer descriptions of their contents and the full question text was stored in Notes for each variable. • “Other, specify” variables were named similarly to their related question, with “_other” appended to the name. • Value labels were assigned where relevant, with options shown in English for all variables, unless preloaded from the roster in Amharic. Variable formatting: • Variables were formatted as their object type (string, integer, decimal, time, date, or datetime). • Multi-select variables were saved both in space-separated single-variables and as multiple binary variables showing the yes/no value of each possible response. • Time and date variables were stored as POSIX timestamp values and formatted to show Gregorian dates. • Location information was left in separate ID and Name variables, following the format of the incoming roster. IDs were formatted to include only the variable level digits, and not the higher-level prefixes (2-3 digits only.) • Full Household and Enumeration Area ID variables were given leading 0s to match incoming roster format. Observation and variable arrangement: • Only consented surveys were kept in the dataset, and all personal information and internal survey variables were dropped from the clean dataset. • Roster data is separated from the main data set and kept in long-form but can be merged on the key variable (key can also be used to merge with the raw data). • In the main dataset, ii4_resp_id and cs7_hhh_id are the roster IDs of the respondent and household head respectively, and can be merged with individual_id in the roster. • The variables were arranged in the same order as the paper instrument, with observations arranged according to their submission time. Backcheck data review: Results of the backcheck survey are compared against the originally captured survey results using the bcstats command in Stata. This function delivers a comparison of variables and identifies any discrepancies. Any discrepancies identified are then examined individually to determine if they are within reason.
Other Processing: 
The Ethiopia- COVID-19 High Frequency Phone Survey of Households, Baseline (Round 1) covered the following topics: - Household Roster - Knowledge Regarding the Spread of COVID-19 - Behaviour and Social Distancing - Access to Basic Services - Employment - Income Loss - Coping/Shocks - Food Security - Aid and Support/ Social Safety Nets
Harvest Source: 
Harvest System ID: 
11422
Citation Text: 
Use of the dataset must be acknowledged using a citation which would include: - the Identification of the Primary Investigator - the title of the survey (including country, acronym and year of implementation) - the survey reference number - the source and date of download World Bank. Ethiopia- COVID-19 High Frequency Phone Survey of Households, Baseline (Round 1) 2020. Dataset downloaded from www.microdata.worldbank.org on [date].
Modified date: 
18428
Study Type: 
Socio-Economic/Monitoring Survey [hh/sems]
Primary Dataset: 
Yes
Data Collector(s) Name: 

Laterite BV

Funding Name, Abbreviation, Role: 

United States Agency for International Development, World Bank

Data Access and Licensing

This dataset is classified as Public under the Access to Information Classification Policy. Users inside and outside the Bank can access this dataset.

This dataset is made available under the World Bank Microdata Research License

Share Metadata

The information on this page (the dataset metadata) is also available in these formats.

PRINT EMAIL JSON RDF