Impacts of reducing water collection times in rural Kenya: Meru ESM RCT

SND-ID: snd1294-1. Version: 1. DOI: https://doi.org/10.5878/qa1e-sq29

Is part of collection at SND: Environment for Development

Citation

Creator/Principal investigator(s)

Jane Kabubo-Mariara - Partnership for Economic Policy

Peter Kimuyu - Commission on Revenue Allocation, Government of Kenya

Joseph Cook - Washington State University, School of Economics

Research principal

University of Gothenburg - Environment for Development, School of Business, Economics and Law rorId

Principal's reference number

MS-105

Description

We measured momentary well-being using the Experience Sampling Method (ESM) among 220 water collectors in rural Meru County, Kenya over eight weeks. Subjects reported on affect and time use at four randomly-chosen times through the day (Monday through Saturday) on a custom-designed ODK survey app, deployed on a low-cost smartphone. Subjects completed a second ODK survey each weekday evening, reporting on school attendance, study time and chores performed for each school-aged child in the household. After several weeks of baseline data, half of households were randomly chosen to receive free delivery of water to their door for four weeks, reducing water collection times to (near) zero. In-person baseline, midline and endline surveys were conducted by enumerators.

The dataset “Meru ESM RCT.dta” contains (in Stata format) the merged data from the ESM exercise and the baseline, midline and endline surveys. The baseline, midline and endline survey were conducted once with each household, but each household completed multiple ESM surveys. This dataset contains 12,956 observations, so to recreat

... Show more..
We measured momentary well-being using the Experience Sampling Method (ESM) among 220 water collectors in rural Meru County, Kenya over eight weeks. Subjects reported on affect and time use at four randomly-chosen times through the day (Monday through Saturday) on a custom-designed ODK survey app, deployed on a low-cost smartphone. Subjects completed a second ODK survey each weekday evening, reporting on school attendance, study time and chores performed for each school-aged child in the household. After several weeks of baseline data, half of households were randomly chosen to receive free delivery of water to their door for four weeks, reducing water collection times to (near) zero. In-person baseline, midline and endline surveys were conducted by enumerators.

The dataset “Meru ESM RCT.dta” contains (in Stata format) the merged data from the ESM exercise and the baseline, midline and endline surveys. The baseline, midline and endline survey were conducted once with each household, but each household completed multiple ESM surveys. This dataset contains 12,956 observations, so to recreate the baseline, midline and endline datasets (one row per household) one would collapse the data on phoneid.
The baseline, midline and endline surveys contain some data and questions that were repeated across waves. To make variable names unique, a “_base”, “_mid” or “_end” is appended at the end of the variable name. For example, each survey contained the time that the interviewer opened the app and started the survey (start), as did the ESM survey completed by the subject. This dataset therefore contains four variables, start (the ESM surveys), start_base, start_mid, and start_end.
All data was collected in ODK apps. These apps are compiled based on data in Excel spreadsheets, including variable names, questions, and answer codes. These ODK excel spreadsheets thus also serve as data dictionaries.
The key unique identifier linking records is phoneid. This is an identifier created by the team, and is not a phone number, or SIM serial number that is any way identifiable.
With few exceptions, data files do not contain any variables generated ex-post by the researchers. The exception is a variable tracking treatment status. The original, randomly-assigned treatment status is encoded in treat. But because implementation of the treatment program was uneven in the first week (and particularly the first two days) due to logistical issues, the team created three time-varying treatment variables capturing three assumptions. trtsimple drops all treated households from the dataset during the first two days of treatment. trtconserv drops all treated households until the water delivery system was running as planned, dropping approximately 2 weeks of data. trtmain uses detailed information collected by the study team about the dates and locations where water delivery was operating as expected. In other words, if the team has data that treated households in a given location were delivered water as planned on a given day, we do not drop those treated households even if other treated households elsewhere did not receive water as planned. Show less..

Data contains personal data

No

Language

Method and outcome

Unit of analysis

Population

Households in rural Kenya without a private water connection at home

Time Method

Sampling procedure

Probability
See papers for more details.

Time period(s) investigated

2016-08-20 – 2016-10-08

Data format / data structure

Data collection
  • Mode of collection: Face-to-face interview: CAPI/CAMI
  • Time period(s) for data collection: 2015-08-01 – 2015-08-31
  • Source of the data: Research data
Geographic coverage

Geographic spread

Geographic location: Kenya

Geographic description: Rural Meru County

Lowest geographic unit

Constituency

Highest geographic unit

Province

Administrative information

Responsible department/unit

Environment for Development, School of Business, Economics and Law

Funding 1

  • Funding agency: Environment for Development Initiative

Funding 2

  • Funding agency: Sida (The Swedish International Development Cooperation Agency)

Ethics Review

Ref. 52167

Ethics approval from the University of Washington (USA) Institutional Review Board

Topic and keywords

Research area

Social sciences (Standard för svensk indelning av forskningsämnen 2011)

Economics (CESSDA Topic Classification)

Economic systems and development (CESSDA Topic Classification)

Social conditions and indicators (CESSDA Topic Classification)

Time use (CESSDA Topic Classification)

Psychology (CESSDA Topic Classification)

Publications

RFF-EfD Discussion Paper 18-07 (working paper)

If you have published anything based on these data, please notify us with a reference to your publication(s). If you are responsible for the catalogue entry, you can update the metadata/data description in DORIS.

Versions

Version 1. 2022-04-11

Version 1: 2022-04-11

DOI: https://doi.org/10.5878/qa1e-sq29

Contacts for questions about the data

Agustin Petroni

data@efd.gu.se

Joseph Cook

joe.cook@wsu.edu

Is part of collection at SND

Published: 2022-04-11