Abstract

This paper analyzes the magnitude and predictors of misreporting on intimate partner violence. Women in Nigeria were randomly assigned to answer questions using either an indirect method (list experiment) that gives respondents anonymity, or the standard, direct face-to-face method. Intimate partner violence rates were up to 35 percent greater when measured using the list method than the direct method. Misreporting was associated with indicators often targeted in empowerment and development programs, such as education and vulnerability. These results suggest that standard survey methods may generate significant underestimates of the prevalence of intimate partner violence, and biased correlations and treatment effect estimates.

1. Introduction

It is challenging to accurately measure sensitive attitudes, behaviors, and experiences such as political preferences, prejudice, risky sex, and intimate partner violence (IPV). Often, researchers and policy makers must rely on self-reported survey data, yet such data may not be accurate. For sensitive topics in particular, there are many reasons why survey respondents may be reluctant to tell the truth, including a fear of being judged, endangered, or legally penalized (Blair, Coppock, and Moor 2020). If misreporting bias is significant and systematic, researchers and policy makers risk over- or underestimating the prevalence and welfare costs of sensitive issues and drawing inaccurate conclusions about their causes and potential solutions.

This paper compares women’s self-reported experience of IPV under different survey methods. In a survey experiment in Nigeria, women were randomly assigned to answer questions about their experience of emotional, physical, and sexual IPV using one of two methods: an indirect method (list experiment) or the standard survey method (face-to-face questions asked by an enumerator). Compared to the direct face-to-face method, the list experiment provides additional confidentiality as respondents do not directly disclose their experience of violence. Instead, they give a response that allows researchers to calculate average prevalence across the whole sample, essentially making respondents’ answers anonymous (Blair and Imai 2012).

Results show that estimated IPV prevalence varies depending on the survey method used to measure it. The most widely used method, face-to-face interview, resulted in the lowest IPV rates, while the anonymous list method had higher rates.1 Women’s reported experience of emotional and physical IPV was 31 and 35 percent greater when measured using the list method compared to the direct method. However, we find no statistically significant difference between the list and face-to-face methods for sexual IPV. Anecdotal evidence and the similarly high rates of sexual IPV reported for both methods (26 and 28 percent) suggest this lack of difference for sexual violence could potentially be explained by a context and interview setting where reporting forced sexual intercourse within marriage was less stigmatized.

This paper finds some similar patterns of misreporting in a related experiment in Rwanda, where we randomly assigned women and men to answer violence questions using one of three methods: the indirect list experiment and two direct survey methods (face-to-face questions asked by an enumerator, or audio computer-assisted self-interview (ACASI) on an electronic tablet). The ACASI method affords more privacy than enumerator-administered methods as respondents answer questions themselves by listening to pre-recorded questions in a headset and then selecting their answer on a tablet. In Rwanda, as in Nigeria, we find women report more physical IPV when interviewed under the list experiment than the direct methods.2 In this paper, we primarily focus on the Nigeria experiment as it uses the more common list experiment design and IPV questions based on the DHS. The Rwanda experiment is discussed in the supplementary online appendix.

In addition to significant differences in reporting rates, this paper also finds that misreporting is systematic and correlated with characteristics that have been theorized as causes of IPV and targeted in development and female empowerment programs. The characteristics that correlate differently with IPV depending on the measurement method used include indicators of women’s education, vulnerability, and marital relationship quality. For example, literate Nigerian women are 8 percentage points less likely than illiterate women to experience emotional IPV when violence is measured using the face-to-face method, but 12 percentage points more likely if IPV is measured using the list method. This suggests that more- educated women underreport IPV when asked directly but that the list method’s anonymity may make them comfortable enough to report it. This accords with results from other contexts (Joseph et al. 2017; Agüero and Frisancho 2022).

Results suggest that the standard, direct methods currently used in most surveys risk generating significant underestimates of IPV prevalence. Further, social scientists risk estimating biased correlations and, if treatment is correlated with misreporting, estimating biased treatment effects and drawing inaccurate conclusions about IPV’s causes and the effectiveness of interventions to prevent it.3

This paper contributes to the growing social science literature studying alternative survey methods that may reduce misreporting (Karlan and Zinman 2012; Blattman et al. 2016; Blair, Coppock, and Moor 2020; Chuang et al. 2021), including on IPV measurement. Recent studies on IPV measurement have compared respondents’ experience of violence across the face-to-face and list experiment methods. They found mixed results, with one study in Peru finding the same rates of IPV between face-to-face and list methods (Agüero and Frisancho 2022) and others in Vietnam, India, and Burkina Faso finding a difference (Joseph et al. 2017; Bulte and Lensink 2019; Lépine, Treibich, and d’Exelle 2020).4 However, the latter three of these studies asked respondents about their IPV experience using two methods within the same survey about the same or similar, but slightly different, violence acts. This potentially limits comparability across methods as respondents may answer differently the second time they are asked the same question. To avoid this issue, we randomly assign respondents to answer questions using only one method rather than asking respondents two overlapping questions with different methods. This paper also contributes to the limited evidence on who misreports on sensitive questions, the implications of misreporting for bias, and the trade-offs between different survey methods that seek to address misreporting.

We contribute to recent debates about the list experiment’s accuracy and efficiency in generating prevalence estimates, and its relevance given evidence that misreporting under direct methods can be small. For example, Chuang et al. (2021) identify challenges with the consistency of the prevalence of sexual and health behaviors estimated using list experiments, finding that prevalence sometimes differed depending on the other questions included on the list. Other questions about the list experiment concern its trade-off between potential bias reduction from minimizing misreporting, and the efficiency of its estimates given it adds noise to the data (Lépine, Treibich, and d’Exelle 2020). Finally, a recent review of reporting on sensitive topics in political science found that the difference in prevalence estimates between list and direct methods is often only around 5 percentage points (Blair, Coppock, and Moor 2020). If also true for IPV, there may be a limited upside to using the method. However, we show that face-to-face underreporting on IPV is likely to be substantial in some contexts. This highlights the continued need to carefully design and test list experiments, while also identifying new methods or interviewing techniques that minimize misreporting, are consistent, precise, and feasible to implement in resource-constrained settings. While the ACASI method holds some promise, it produced broadly similar IPV estimates to the face-to-face method in our experiment in Rwanda.

This paper contributes to the nascent literature comparing IPV prevalence using multiple survey methods that offer respondents different levels of privacy and anonymity. It is also one of few papers to study IPV misreporting in Africa, and to demonstrate that misreporting is substantial in some contexts, varies by the type of violence, and is correlated with characteristics commonly targeted in women’s empowerment interventions.

2. Measuring Sensitive Topics

There is robust evidence that people underreport the truth about sensitive topics when they are directly asked in a survey. Such research compares self-reported survey answers to observed or independently verifiable outcomes such as tax records, receipt of welfare benefits, drug prescriptions for mental illness, and voting in an abortion law referendum (Gottschalk and Huynh 2010; Rosenfeld, Imai, and Shapiro 2016; Bharadwaj, Pai, and Suziedelyte 2017; Murray-Close and Heggeness 2018; Meyer and Mittag 2019).

However, reliable, unbiased observational data are rare for many sensitive topics, including IPV.5 This is particularly true in developing countries, where there is limited administrative data from the medical or legal services that IPV survivors may use. Non-survey data sources that address the risk of reporting bias, such as emergency and domestic violence hotline calls, hospital and police administrative records, and Google search data (Leslie and Wilson 2020; Ravindran and Shah 2020; Bullinger, Carr, and Packham 2021), were widely used to study the effect of COVID-19 lockdowns on IPV. However, they typically capture the more extreme cases of IPV and might still be biased in other ways, for example by the types of people selecting into services. For example, in Nigeria, only 1 percent of women who experienced physical or sexual IPV reported it to the police (DHS 2019). While administrative and other non-survey data sources can be useful for studying extreme IPV cases and to validate survey data in some contexts, there is still a need for accurate, representative survey data from lower-resource settings and for less extreme cases.

As a result of these non-survey data limitations and concerns about underreporting on face-to-face surveys, researchers have considered alternative survey methods to minimize misreporting bias. A popular alternative method, the list experiment broadly aims to address misreporting driven by shame, embarrassment, a preference for anonymity, or fear of repercussions if one’s answer becomes known.

2.1. The List Experiment

Researchers have used the list experiment to improve data accuracy when measuring sensitive behaviors and attitudes.6 The standard list experiment works as follows: respondents are randomly assigned to one of two groups to answer the list question. In the “non-sensitive” or “control” group, enumerators read respondents a set of three non-sensitive statements and ask the respondent to tell them how many but not which statements are true for the respondent, with possible answers between zero and three. Meanwhile, the “sensitive” or “treatment” group is read the same list of statements, with the addition of a fourth sensitive statement. They are given the same instructions, to say how many, between zero and four statements, are true. Given the sensitive and non-sensitive list groups are randomly assigned, the two groups are, in expectation, equivalent and the only difference is the inclusion of the sensitive statement. The prevalence of the sensitive item across the whole sample is thus the difference in the mean number of items reported by the two groups.

Unlike with direct methods, anonymity is ensured because the lists are designed in such a way that very few people in the list sensitive group have all four statements as either true or not true.7 This way, a respondent is not revealing their true answer to the sensitive item to the enumerator or anyone with data access. This could mitigate underreporting by respondents concerned about shame or consequences if someone were to find out their answers, or those who doubt the confidentiality of their data.

There are potential drawbacks to the list experiment. One challenge is that, by design, it cannot be used to identify individuals who respond affirmatively to the sensitive question of interest, potentially limiting its use where individuals need to be referred to support services. Also, with list experiment answers ranging from zero to four, it typically has greater variance than direct questioning methods (but also likely lower misreporting bias because of respondent anonymity). Thus list experiments may reduce bias at the cost of efficiency (though there are approaches such as the double list experiment that can reduce the method’s variance) (Blair and Imai 2012).8 Given these power concerns, the list method may not be ideal for use in impact evaluations as it may leave outcome analysis, and heterogeneity analysis, underpowered. Further, the method does not easily lend itself to further question complexity, for example follow-up questions to measure frequency and injuries from violence, or aggregation of several questions to create indicator variables of “any experience of violence.” Finally, Chuang et al. (2021) find evidence that prevalence estimates under the list experiment can be internally inconsistent, with rates that are sensitive to the other questions used in the list.9

However, given the list method’s potential to reduce misreporting, researchers have begun using it to measure IPV and other forms of gender-based violence. In several studies, IPV rates are compared between the face-to-face and list methods. The study most related to this paper, conducted in urban Lima, Peru, found no difference in overall IPV prevalence by survey method (Agüero and Frisancho 2022); studies in other contexts did find a difference. In Sri Lanka, Traunmüller, Kijewski, and Freitag (2019) found the self-reported experience of wartime sexual violence (not IPV) to be higher under the list method than face-to-face, and in India and Burkina Faso, Joseph et al. (2017) and Lépine, Treibich, and d’Exelle (2020), respectively, found higher rates of IPV under the list method than face-to-face.10

In these studies, respondents were either (a) randomly assigned to answer questions about the same experience of IPV using the direct face-to-face method or the list method (Agüero and Frisancho 2022), or (b) all respondents were asked about their IPV experience using both methods within the same survey—either with similar, but slightly different, question wording (Bulte and Lensink 2019) or the same question wording (Joseph et al. 2017; Lépine, Treibich, and d’Exelle 2020).11 One issue with approach (b) is that being asked the same question in two ways may bias responses to the second question. In our Nigeria experiment, we take approach (a), asking individuals each question only once to minimize this risk.

2.2. Misreporting and Truth

Tourangeau and Yan (2007) argue that for sensitive questions where responding in the affirmative is taboo, the survey method that finds the highest prevalence is closest to truth. In this paper, particularly given the stigma surrounding the issue of IPV, we assume that in most cases, the method with the highest prevalence is closest to truth and that most misreporting is underreporting.

Without objective, non-self-reported data, we cannot, ultimately, confirm which method is closest to truth. However, the alternative survey methods tested in this paper are designed to address motivations for underreporting when interviewed with the face-to-face method—shame and fear. These alternative methods are unlikely to introduce new reasons for people to report IPV that they have not experienced. This is particularly true for the list experiment, where answers are anonymous by design, hence individuals have no incentive to overreport given they cannot be followed up with to provide aid or program services.12 Therefore, it seems likely that the method with the highest reporting indicates the method that best addresses respondents’ reasons for misreporting and is closest to truth.

This assumption is supported by empirical evidence from the only study we identified that compares an observed sensitive behavior with self-reported behavior measured using direct and indirect methods including the list experiment. Rosenfeld, Imai, and Shapiro (2016) found that when they were asked directly, people underreported their past voting behavior on a sensitive abortion referendum in Mississippi. However, rates were higher and closer to truth when they were asked using the list experiment. No method yielded rates higher than the true “socially undesirable” outcome of voting against tightened abortion restrictions.

Overall, evidence on the measurement of sensitive topics suggests that misreporting appears to be highly issue and context dependent (Tourangeau and Yan 2007; Langhaug, Sherr, and Cowan 2010; Blair, Coppock, and Moor 2020), presumably affected by factors such as the interview setting, cultural norms, security and safety, enumerator skill, and confidence in data privacy. Therefore, this paper primarily relates to misreporting about IPV, and in contexts similar to those studied here.

3. Experimental Design

3.1. Context, Data, & Design

Countries in Sub-Saharan Africa have high rates of IPV compared to the global average. According to data gathered using the direct face-to-face method, 14 percent of Nigerian women report having experienced physical or sexual IPV in the previous 12 months (DHS 2019). This compares to 3 percent of women in the United States, 21 percent in Rwanda, 23 percent in Kenya, 18 percent in India and Bolivia, and 10 percent in South Africa (Global Burden of Disease 2017; DHS 2021).

Our study took place in one of Nigeria’s poorest and most conservative regions, Kebbi state in Northwest Nigeria. Kebbi has the country’s highest acceptance of wife-beating, with 67 percent of women believing a husband is justified in using physical violence if a wife burns the food. Yet it also has the country’s lowest reported rates of IPV (DHS 2019). Only 13, 7, and 1 percent of women report experiencing emotional, physical, and sexual IPV in the previous 12 months (compared to the national average of 26, 12, and 5, respectively). In this region, IPV is not criminalized. The penal code governing Northern Nigerian states does not outlaw marital rape or physical violence against a wife “so long as the woman is not seriously harmed” (Benebo, Schumann, and Vaezghasemi 2018).13

The survey experiments described in this paper were conducted in accordance with WHO guidelines on the safe and ethical collection of data on violence against women (WHO 2001, 2016).

In Nigeria, the measurement experiment was conducted during the endline survey of a cluster randomized control trial studying the effects of the Feed the Future Nigeria Livelihoods Project, a multi-component cash transfer and livelihoods program.14 The sample consisted of ultra-poor households, with approximately 15 percent of women self-reporting as literate, 30 percent in polygamous marriages, and 84 percent identifying as Muslim.

The eligibility criteria for the measurement experiment were that the respondent was female, over 17 years of age, and married and living in the same household as a male. In total, 2,817 women met these criteria and were randomly assigned to answer three experimental questions about their experience of IPV using either the face-to-face or list method. Assignment to the method groups was stratified by geographic and poverty-related strata, as well as treatment status in the randomized control trial, to ensure balance.

The three IPV questions used for the measurement experiment were either directly from or combined several violence questions from the DHS.15 The experimental questions were “In the past 12 months, has your husband

  1. said or done something to humiliate you in front of others?

  2. slapped you, pushed you, shaken you, or thrown something at you?

  3. physically forced you to have sexual intercourse with him when you did not want to?”

Respondents were interviewed one-on-one by a female enumerator in a private setting, usually in the household compound. Enumerators were instructed to pause if anyone came within earshot of the interview, not proceeding until privacy was ensured. The survey structure is outlined in table A.1.

3.2. The Measurement Experiment: Calculating Prevalence

We assess misreporting bias by comparing rates of IPV across the face-to-face and list experiment methods (with a third method, ACASI, also tested in Rwanda and discussed in the supplementary online appendix). Given that assignment to the survey method was random, we assume that the only difference in reported violence between the groups is due to the method used.

Under the list method, prevalence of the sensitive statement is calculated by taking the difference in the mean number of statements reported as true by the two groups (Blair and Imai 2012). To administer the list experiment to the non-sensitive group, respondents are read the same list, bar the fourth sensitive statement, and asked to report how many of the three statements apply to them.16

Prevalence under the list method was estimated using an OLS regression as
where ρ is prevalence, or the difference in the mean number of items reported in the groups, Yi is the number of statements on the list that are true for individual i, α is the mean number of items for the non-sensitive group, and Ti is assignment to the list treatment group.

In Nigeria, 1,405 women were randomly assigned to answer the sensitive questions using the list method group, and 1,412 using the face-to-face method.17 Power calculations indicated that the sample in Nigeria was large enough to detect a difference of approximately 6 percentage points between the groups, and in Rwanda of approximately 5 percentage points, with a significance level of 0.05, power of 0.8, and base prevalence of 20 percent.18

Overall, the groups were well balanced on key characteristics, as shown in table A.3. While several of the raw differences are statistically significant at conventional levels, the differences are small, and they are within the bounds of what might be expected given the number of variables tested for balance. No normalized differences are greater than 0.25 standard deviations, a suggested rule of thumb indicating good balance (Imbens and Rubin 2015). In additional specifications, we estimate prevalence including controls selected using post-double selection LASSO, and variables with baseline imbalance (Belloni, Chernozhukov, and Hansen 2014).19 These do not significantly change results.

4. Results

4.1. Method Matters: IPV Reporting Differs by Survey Method

This paper assesses whether there is a difference in reported rates of IPV across survey methods. A difference in prevalence implies misreporting under at least one of the methods.

Table 1 shows IPV prevalence by survey method in Nigeria. Column 3 shows the difference in prevalence between methods. The p-values for these prevalence difference tests were adjusted for multiple hypothesis testing. Sharpened q-values were computed using the approach proposed by Benjamini and Hochberg (1995) and Anderson (2008).20

Table 1.

Prevalence of Intimate Partner Violence by Survey Method in Nigeria

Difference between pairs of methods
(1)(2)(3)
Face-to-face meanList method prevalence (difference in means)List−F2F (3)−(1)
In the past 12 months, has your husband
 Q1: said or done something to humiliate you in front of others0.299***0.392***0.093**
(0.012)(0.031)[0.013]
N1,3942,817
 Q2: slapped you, pushed you, shaken you, or thrown something at you0.192***0.259***0.067*
(0.011)(0.035)[0.055]
N1,3942,817
 Q3: physically forced you to have sex when you did not want to0.281***0.261***−0.020
(0.012)(0.032)[0.551]
N1,3812,817
Difference between pairs of methods
(1)(2)(3)
Face-to-face meanList method prevalence (difference in means)List−F2F (3)−(1)
In the past 12 months, has your husband
 Q1: said or done something to humiliate you in front of others0.299***0.392***0.093**
(0.012)(0.031)[0.013]
N1,3942,817
 Q2: slapped you, pushed you, shaken you, or thrown something at you0.192***0.259***0.067*
(0.011)(0.035)[0.055]
N1,3942,817
 Q3: physically forced you to have sex when you did not want to0.281***0.261***−0.020
(0.012)(0.032)[0.551]
N1,3812,817

Source: Data from author’s experiment.

Note: The left column shows the violence variables. Columns 1 and 2 show prevalence by method. Robust standard errors are shown in parentheses. Column 3 shows differences in prevalence between the methods. For the test of difference in means, Benjamini and Hochberg (1995) adjusted q-values are displayed in square brackets, from a chi-squared test of the differences in means across groups. Regressions are adjusted for stratification. f2f, face to face. ***, **, and * indicate significance at the 1, 5, and 10 percent critical level.

Table 1.

Prevalence of Intimate Partner Violence by Survey Method in Nigeria

Difference between pairs of methods
(1)(2)(3)
Face-to-face meanList method prevalence (difference in means)List−F2F (3)−(1)
In the past 12 months, has your husband
 Q1: said or done something to humiliate you in front of others0.299***0.392***0.093**
(0.012)(0.031)[0.013]
N1,3942,817
 Q2: slapped you, pushed you, shaken you, or thrown something at you0.192***0.259***0.067*
(0.011)(0.035)[0.055]
N1,3942,817
 Q3: physically forced you to have sex when you did not want to0.281***0.261***−0.020
(0.012)(0.032)[0.551]
N1,3812,817
Difference between pairs of methods
(1)(2)(3)
Face-to-face meanList method prevalence (difference in means)List−F2F (3)−(1)
In the past 12 months, has your husband
 Q1: said or done something to humiliate you in front of others0.299***0.392***0.093**
(0.012)(0.031)[0.013]
N1,3942,817
 Q2: slapped you, pushed you, shaken you, or thrown something at you0.192***0.259***0.067*
(0.011)(0.035)[0.055]
N1,3942,817
 Q3: physically forced you to have sex when you did not want to0.281***0.261***−0.020
(0.012)(0.032)[0.551]
N1,3812,817

Source: Data from author’s experiment.

Note: The left column shows the violence variables. Columns 1 and 2 show prevalence by method. Robust standard errors are shown in parentheses. Column 3 shows differences in prevalence between the methods. For the test of difference in means, Benjamini and Hochberg (1995) adjusted q-values are displayed in square brackets, from a chi-squared test of the differences in means across groups. Regressions are adjusted for stratification. f2f, face to face. ***, **, and * indicate significance at the 1, 5, and 10 percent critical level.

Results show that the survey method used can substantially affect IPV reporting. Overall, the most widely used survey method, face-to-face interviewing, resulted in the lowest reported IPV rates, with the list method having higher rates. In Nigeria, emotional and physical IPV rates were 31 and 35 percent greater, respectively, when measured using the list method (emotional violence: 39.2 percent; physical violence: 25.9 percent) compared to face-to-face (emotional violence: 29.9 percent; physical violence: 19.2 percent). The differences between the methods were statistically significant at the 5 and 10 percent levels respectively. However, there was no statistically significant difference between direct and indirect rates of sexual IPV, with reported prevalence of 28.1 and 26.1 percent respectively.

The second experiment, conducted in Rwanda and discussed in the supplementary online appendix, shows a similar pattern of results. There, the third method tested, ACASI, produced similar or slightly higher rates of sexual and intimate partner violence experience (for women) or perpetration (for men) than the face-to-face method.

Our rates of directly reported IPV are much greater than equivalent rates from Nigeria’s DHS, shown in table 2. For example, in the DHS, only 15.2 percent of women say their partner said or did something to humiliate them in front of others; 9.2 percent of women say they have been slapped; and 4.8 percent say they have been pushed, shaken, or had something thrown at them in the previous 12 months by their partner (DHS 2019). Four percent say that in the past 12 months, they have been physically forced to have sexual intercourse with their partner when they did not want to. While the DHS is conducted in a different, representative sample and thus is not directly comparable to our targeted, highly economically vulnerable sample, the contrast in IPV rates with both our direct and list experiment estimates is striking. Our much greater face-to-face rates than those in the DHS may be due to our different sample, our very private interview setting, or our two-hour long survey covering many other non-sensitive issues that gives more plausible deniability to respondents and allows enumerators to build a strong rapport before asking IPV questions.

Table 2.

Comparison of Women’s Reported Experience of IPV in the Previous 12 Months by Method and Source (in Percent)

Survey experiment
Face-to-faceList experimentACASINationally representative DHS data
Nigeria
Emotional IPV
“Partner said or did something to humiliate you in front of others”29.939.215.2
Physical IPV
“Partner has slapped you, pushed you, shaken you, or thrown something at you”19.225.9“Partner has slapped you”: 9.2;“Partner has pushed you, shaken you, or thrown something at you”: 4.8
Sexual IPV
“Partner physically forced to have sexual intercourse with your partner when you did not want to”28.126.14
Rwanda
Physical IPV
“Partner pushed you or thrown you against something”9.320.68“Partner has pushed you, shaken you, or thrown something at you”: 9.8
Survey experiment
Face-to-faceList experimentACASINationally representative DHS data
Nigeria
Emotional IPV
“Partner said or did something to humiliate you in front of others”29.939.215.2
Physical IPV
“Partner has slapped you, pushed you, shaken you, or thrown something at you”19.225.9“Partner has slapped you”: 9.2;“Partner has pushed you, shaken you, or thrown something at you”: 4.8
Sexual IPV
“Partner physically forced to have sexual intercourse with your partner when you did not want to”28.126.14
Rwanda
Physical IPV
“Partner pushed you or thrown you against something”9.320.68“Partner has pushed you, shaken you, or thrown something at you”: 9.8

Source: DHS (2019), DHS (2021) and author’s own data.

Note: This table shows IPV prevalence from this survey experiment by method, and compares this to IPV rates reported face-to-face in nationally representative Demographic and Health Surveys in Nigeria and Rwanda. Reported in percent. DHS, Demographic and Heath Survey; ACASI, Audio computer assisted self-interview

Table 2.

Comparison of Women’s Reported Experience of IPV in the Previous 12 Months by Method and Source (in Percent)

Survey experiment
Face-to-faceList experimentACASINationally representative DHS data
Nigeria
Emotional IPV
“Partner said or did something to humiliate you in front of others”29.939.215.2
Physical IPV
“Partner has slapped you, pushed you, shaken you, or thrown something at you”19.225.9“Partner has slapped you”: 9.2;“Partner has pushed you, shaken you, or thrown something at you”: 4.8
Sexual IPV
“Partner physically forced to have sexual intercourse with your partner when you did not want to”28.126.14
Rwanda
Physical IPV
“Partner pushed you or thrown you against something”9.320.68“Partner has pushed you, shaken you, or thrown something at you”: 9.8
Survey experiment
Face-to-faceList experimentACASINationally representative DHS data
Nigeria
Emotional IPV
“Partner said or did something to humiliate you in front of others”29.939.215.2
Physical IPV
“Partner has slapped you, pushed you, shaken you, or thrown something at you”19.225.9“Partner has slapped you”: 9.2;“Partner has pushed you, shaken you, or thrown something at you”: 4.8
Sexual IPV
“Partner physically forced to have sexual intercourse with your partner when you did not want to”28.126.14
Rwanda
Physical IPV
“Partner pushed you or thrown you against something”9.320.68“Partner has pushed you, shaken you, or thrown something at you”: 9.8

Source: DHS (2019), DHS (2021) and author’s own data.

Note: This table shows IPV prevalence from this survey experiment by method, and compares this to IPV rates reported face-to-face in nationally representative Demographic and Health Surveys in Nigeria and Rwanda. Reported in percent. DHS, Demographic and Heath Survey; ACASI, Audio computer assisted self-interview

Table 3.

Correlations between IPV and Respondent Characteristics across Survey Methods

Q1: emotional violenceQ2: physical violenceQ3: sexual violenceMethod differences: p-values from tests of differences
(1)(2)(3)(4)(5)(6)(1)−(2)(3)−(4)(5)−(6)
ListF2FListF2FListF2FQ1Q2Q3
Variable
 Vulnerability index0.02−0.08***0.09−0.05***0.04−0.05**0.130.040.23
(0.06)(0.02)(0.07)(0.02)(0.07)(0.02)
 Unequal-village-gender-norms index0.050.08***0.09*0.05***0.08*0.05***0.550.440.55
(0.04)(0.02)(0.05)(0.02)(0.04)(0.02)
 Relationship-quality index−0.07−0.21***−0.02−0.21***−0.05−0.27***0.110.040.01
(0.08)(0.03)(0.09)(0.02)(0.08)(0.03)
 Public-speaking-confidence index−0.02−0.08***0.00−0.03***0.00−0.010.120.390.69
(0.04)(0.01)(0.04)(0.01)(0.04)(0.01)
 Progressive-gender-attitudes index−0.12*−0.13***−0.23***−0.09***−0.16**−0.16***0.920.101.00
(0.07)(0.03)(0.08)(0.03)(0.07)(0.03)
Bargaining power measures
 Wife’s age0.000.00**0.000.000.00−0.000.570.490.59
(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)
 Wife reads and writes0.12−0.08**0.00−0.12***0.11−0.040.030.190.13
(0.08)(0.03)(0.10)(0.02)(0.09)(0.03)
 Monogamous0.07−0.040.16**−0.030.12*−0.05*0.120.010.02
(0.06)(0.03)(0.07)(0.02)(0.07)(0.02)
 Polygamous: wife number (1–4)−0.030.020.060.030.13−0.020.560.760.13
(0.08)(0.04)(0.10)(0.03)(0.09)(0.03)
 Wife brought assets to marriage−0.04−0.18***0.04−0.13***−0.02−0.09***0.040.030.33
(0.06)(0.02)(0.07)(0.02)(0.07)(0.02)
Q1: emotional violenceQ2: physical violenceQ3: sexual violenceMethod differences: p-values from tests of differences
(1)(2)(3)(4)(5)(6)(1)−(2)(3)−(4)(5)−(6)
ListF2FListF2FListF2FQ1Q2Q3
Variable
 Vulnerability index0.02−0.08***0.09−0.05***0.04−0.05**0.130.040.23
(0.06)(0.02)(0.07)(0.02)(0.07)(0.02)
 Unequal-village-gender-norms index0.050.08***0.09*0.05***0.08*0.05***0.550.440.55
(0.04)(0.02)(0.05)(0.02)(0.04)(0.02)
 Relationship-quality index−0.07−0.21***−0.02−0.21***−0.05−0.27***0.110.040.01
(0.08)(0.03)(0.09)(0.02)(0.08)(0.03)
 Public-speaking-confidence index−0.02−0.08***0.00−0.03***0.00−0.010.120.390.69
(0.04)(0.01)(0.04)(0.01)(0.04)(0.01)
 Progressive-gender-attitudes index−0.12*−0.13***−0.23***−0.09***−0.16**−0.16***0.920.101.00
(0.07)(0.03)(0.08)(0.03)(0.07)(0.03)
Bargaining power measures
 Wife’s age0.000.00**0.000.000.00−0.000.570.490.59
(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)
 Wife reads and writes0.12−0.08**0.00−0.12***0.11−0.040.030.190.13
(0.08)(0.03)(0.10)(0.02)(0.09)(0.03)
 Monogamous0.07−0.040.16**−0.030.12*−0.05*0.120.010.02
(0.06)(0.03)(0.07)(0.02)(0.07)(0.02)
 Polygamous: wife number (1–4)−0.030.020.060.030.13−0.020.560.760.13
(0.08)(0.04)(0.10)(0.03)(0.09)(0.03)
 Wife brought assets to marriage−0.04−0.18***0.04−0.13***−0.02−0.09***0.040.030.33
(0.06)(0.02)(0.07)(0.02)(0.07)(0.02)

Source: Author’s analysis based on own data.

Note: Robust standard errors are shown in parentheses. Regressions are adjusted for sampling stratification. Columns 1–6 show the relevant covariate coefficients from separate regressions. Columns 7–9 show the p-values from a chi-squared test of the differences between these coefficients. The vulnerability index includes measures of food poverty and household economic insecurity, and wife’s stress. The unequal-village-gender-norms index includes beliefs about unequal gender attitudes held by others in the village. The relationship-quality index includes measures of trust, satisfaction, empathy, and communication in the marriage. The public-speaking-confidence index includes measures of willingness to speak up in public. The progressive-gender-attitudes index includes measures of personal views on the acceptability of IPV and on gender roles. IPV, Intimate partner violence; F2F, face to face. ***p < 0.01, **p < 0.05, *p < 0.1.

Table 3.

Correlations between IPV and Respondent Characteristics across Survey Methods

Q1: emotional violenceQ2: physical violenceQ3: sexual violenceMethod differences: p-values from tests of differences
(1)(2)(3)(4)(5)(6)(1)−(2)(3)−(4)(5)−(6)
ListF2FListF2FListF2FQ1Q2Q3
Variable
 Vulnerability index0.02−0.08***0.09−0.05***0.04−0.05**0.130.040.23
(0.06)(0.02)(0.07)(0.02)(0.07)(0.02)
 Unequal-village-gender-norms index0.050.08***0.09*0.05***0.08*0.05***0.550.440.55
(0.04)(0.02)(0.05)(0.02)(0.04)(0.02)
 Relationship-quality index−0.07−0.21***−0.02−0.21***−0.05−0.27***0.110.040.01
(0.08)(0.03)(0.09)(0.02)(0.08)(0.03)
 Public-speaking-confidence index−0.02−0.08***0.00−0.03***0.00−0.010.120.390.69
(0.04)(0.01)(0.04)(0.01)(0.04)(0.01)
 Progressive-gender-attitudes index−0.12*−0.13***−0.23***−0.09***−0.16**−0.16***0.920.101.00
(0.07)(0.03)(0.08)(0.03)(0.07)(0.03)
Bargaining power measures
 Wife’s age0.000.00**0.000.000.00−0.000.570.490.59
(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)
 Wife reads and writes0.12−0.08**0.00−0.12***0.11−0.040.030.190.13
(0.08)(0.03)(0.10)(0.02)(0.09)(0.03)
 Monogamous0.07−0.040.16**−0.030.12*−0.05*0.120.010.02
(0.06)(0.03)(0.07)(0.02)(0.07)(0.02)
 Polygamous: wife number (1–4)−0.030.020.060.030.13−0.020.560.760.13
(0.08)(0.04)(0.10)(0.03)(0.09)(0.03)
 Wife brought assets to marriage−0.04−0.18***0.04−0.13***−0.02−0.09***0.040.030.33
(0.06)(0.02)(0.07)(0.02)(0.07)(0.02)
Q1: emotional violenceQ2: physical violenceQ3: sexual violenceMethod differences: p-values from tests of differences
(1)(2)(3)(4)(5)(6)(1)−(2)(3)−(4)(5)−(6)
ListF2FListF2FListF2FQ1Q2Q3
Variable
 Vulnerability index0.02−0.08***0.09−0.05***0.04−0.05**0.130.040.23
(0.06)(0.02)(0.07)(0.02)(0.07)(0.02)
 Unequal-village-gender-norms index0.050.08***0.09*0.05***0.08*0.05***0.550.440.55
(0.04)(0.02)(0.05)(0.02)(0.04)(0.02)
 Relationship-quality index−0.07−0.21***−0.02−0.21***−0.05−0.27***0.110.040.01
(0.08)(0.03)(0.09)(0.02)(0.08)(0.03)
 Public-speaking-confidence index−0.02−0.08***0.00−0.03***0.00−0.010.120.390.69
(0.04)(0.01)(0.04)(0.01)(0.04)(0.01)
 Progressive-gender-attitudes index−0.12*−0.13***−0.23***−0.09***−0.16**−0.16***0.920.101.00
(0.07)(0.03)(0.08)(0.03)(0.07)(0.03)
Bargaining power measures
 Wife’s age0.000.00**0.000.000.00−0.000.570.490.59
(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)
 Wife reads and writes0.12−0.08**0.00−0.12***0.11−0.040.030.190.13
(0.08)(0.03)(0.10)(0.02)(0.09)(0.03)
 Monogamous0.07−0.040.16**−0.030.12*−0.05*0.120.010.02
(0.06)(0.03)(0.07)(0.02)(0.07)(0.02)
 Polygamous: wife number (1–4)−0.030.020.060.030.13−0.020.560.760.13
(0.08)(0.04)(0.10)(0.03)(0.09)(0.03)
 Wife brought assets to marriage−0.04−0.18***0.04−0.13***−0.02−0.09***0.040.030.33
(0.06)(0.02)(0.07)(0.02)(0.07)(0.02)

Source: Author’s analysis based on own data.

Note: Robust standard errors are shown in parentheses. Regressions are adjusted for sampling stratification. Columns 1–6 show the relevant covariate coefficients from separate regressions. Columns 7–9 show the p-values from a chi-squared test of the differences between these coefficients. The vulnerability index includes measures of food poverty and household economic insecurity, and wife’s stress. The unequal-village-gender-norms index includes beliefs about unequal gender attitudes held by others in the village. The relationship-quality index includes measures of trust, satisfaction, empathy, and communication in the marriage. The public-speaking-confidence index includes measures of willingness to speak up in public. The progressive-gender-attitudes index includes measures of personal views on the acceptability of IPV and on gender roles. IPV, Intimate partner violence; F2F, face to face. ***p < 0.01, **p < 0.05, *p < 0.1.

Our results are comparable to those from several related studies. Lépine, Treibich, and d’Exelle (2020) find significant face-to-face underreporting of physical IPV in rural Burkina Faso (16–20 percentage point difference between face-to-face and list method prevalence), as do Bulte and Lensink (2019) in Vietnam (10 percentage point difference), and Joseph et al. (2017) in India (9 percentage points). However, our results diverge from those of the one other study that also adopts our one-method-per-person approach. In Lima, Peru, Agüero and Frisancho (2022) found no overall difference between face-to-face and list method physical and sexual IPV prevalence. These divergent results indicate that reporting bias is, unsurprisingly, strongly affected by contextual factors. Differences in misreporting may be due to the social norms and degree of stigma women faced in each context (e.g. urban Lima versus rural Nigeria), or differences in the interview context (e.g. privacy of the interview setting or respondents’ faith in the privacy of their data). Significant reporting differences have also been found in other surveys, for example underreporting of condom use or sexual assault during war (Jamison et al. 2013; Traunmüller, Kijewski, and Freitag 2019), suggesting that sexual and health topics in particular may be prone to reporting bias; the direct method–list method differences appear to be much smaller in many political science studies (Blair, Coppock, and Moor 2020).

Overall, our results suggest that women’s experience of emotional and physical IPV may be substantially underreported when measured using direct methods. If the difference in prevalence between the list and direct methods is a proxy for fear, shame, or social desirability bias, then these results also suggest that questions about emotional and physical IPV were more sensitive than sexual IPV in our setting. This surprising result could potentially be explained by the context. In Nigeria, a quarter of the women interviewed using the list method and a quarter interviewed using the face-to-face method reported experiencing sexual IPV in the past year. Qualitative interviews and focus groups suggested that there may be less stigma associated with reporting sexual IPV than physical IPV. Participants in our predominantly Muslim study sites indicated that religious leaders cautioned male followers not to use physical violence against their wives, but that sexual IPV was viewed differently and was not considered violence. Further, DHS statistics show that women in this region have an extremely high acceptance of violence related to sex, in particular. In Kebbi state’s statistics on acceptance of wife beating, the worst in the country, 87 percent of women believe it is acceptable for a man to beat his wife if she refuses to have sex with him, while 67 percent believe the same if she burns the food (DHS 2019). This very widespread belief that husbands are entitled to sex from their wives suggests that sexual violence may not be particularly stigmatized in this context and therefore women may feel as comfortable reporting it face-to-face as with the list method. The more normalized reporting of sexual IPV may also be a coping mechanism in a context where this form of IPV is widespread and tolerated. A potential alternative explanation could be that some might consider sexual violence as resulting from a husband’s passion rather than anger, potentially making it less shameful to report than physical or emotional violence, which can be seen as disciplining errant wives (Benebo, Schumann, and Vaezghasemi 2018).

In the study setting, IPV, including marital rape, is not criminalized. It seems likely that the different legal, religious, and social treatment of emotional, physical, and sexual IPV may contribute to their relative reporting sensitivity.

4.2. Characteristics of Misreporting

In this section, we briefly consider whether misreporting is random, and if not, identify the characteristics that predict it. To do this, we assess correlations between respondent characteristics and IPV, as reported under each of the survey methods. When the IPV-characteristic coefficient differs across methods, this implies systematic misreporting under at least one of the methods that is associated with the characteristic. This analysis provides suggestive evidence on the type of person that might misreport, potential reasons for misreporting, and the risks of drawing biased inferences from such data.

4.2.1. Estimation: Characteristics Associated with Misreporting

For those assigned to the face-to-face group, we assess the characteristics of those reporting IPV by regressing women’s self-reported experience of IPV against the variable of interest, such as years of education, as
For the list method, we regress the number of list items reported by an individual against an indicator variable for whether they were in the list sensitive group, as well as the variable of interest and the interaction term. This regression model is
where Yi is the number of true statements for individual i, Ti is assignment to the sensitive list group, xi is the characteristic, and δ is the parameter of interest.21 If there is no (or consistent) misreporting across the methods, the relevant coefficients for each method should be equal. If there are differences, this implies that the relationship between IPV and the characteristic is sensitive to the measurement method, and we may be estimating biased descriptive and causal relationships.

4.2.2. Results: Respondent Characteristics Predict Misreporting

We analyze misreporting by respondent characteristics that fall into three overlapping categories. These characteristics include (a) those most likely to motivate respondents to misreport (to understand why people misreport IPV), (b) variables typically targeted in IPV prevention interventions (to evaluate risks of estimating biased treatment effects), and (c) characteristics frequently cited as IPV risk factors. The first two categories of characteristics include perceptions of village gender norms, poverty and vulnerability, marital relationship quality, and personal gender attitudes. Additional risk factors considered include women’s education and literacy, age, labor-force participation, and other indicators of women’s bargaining power, where theory and empirical evidence are mixed on the relationship with IPV (Eswaran and Malhotra 2011; Bobonis, González-Brenes, and Castro 2013; Heise and Kotsadam 2015; Buller et al. 2018; Erten and Keskin 2018; Haushofer et al. 2019).

Table 3 shows the relationships between IPV and respondent characteristics by survey method. Results show several differences in coefficients across the methods, implying that a number of characteristics are associated with systematic misreporting of IPV.22 These results are also shown graphically in fig. A.1. For ease of interpretation, in table A.4 we also show the differences in mean IPV prevalence for respondents above and below the median of the characteristic, or by their indicator variable characteristic. However, given we are likely underpowered to detect the differences in this table, results should be interpreted with caution.

This subsection presents the results of descriptive analysis and, as such, the usual caution should be applied when interpreting results given that omitted variables or other sources of bias may be driving observed correlations.23

Our results suggest that vulnerable women were more likely to report physical IPV under the list method than face-to-face. For emotional and sexual IPV, the same pattern existed but the differences were not statistically significant.

Greater marital relationship quality, measured using questions about satisfaction, empathy, and trust in marriage, was also associated with underreporting. We found that women who reported greater relationship quality were more likely to report IPV when interviewed using the list method than face-to-face; the method differences were statistically significant on two out of the three IPV questions considered. A potential reason for women who report being in a more trusting, supportive relationship to underreport IPV when asked directly is because the direct question forces her to confront an uncomfortable truth about her marriage, or she may feel disloyal by directly admitting her husband has been violent.

These results highlight a potential measurement bias issue for the growing number of policies being assessed on their IPV impact. For example, an intervention that improves relationship quality, as several IPV prevention programs seek to do, may have no impact on actual IPV rates. However, it may lead to a reduction in a wife’s willingness to directly report IPV for the reasons discussed above. This would bias treatment effect estimates, suggesting a reduction in IPV that was, in fact, an artefact of the survey method used.

We also examined whether common correlates of IPV remain stable across the method used. For some indicators of women’s intrahousehold bargaining power, we find some evidence of systematic misreporting of IPV, though no clear pattern across different indicators.

Concerningly, we observe some opposite-signed relationships, where the sign on the relationship between IPV and the respondent’s characteristic is positive for one method and negative for another. In Nigeria, these variables include wife’s vulnerability, literacy, and assets brought into the marriage. For example, we find suggestive evidence of a positive relationship between IPV and women’s education when IPV is measured directly, but a negative relationship when using the list method. We observe a similar relationship in Rwanda. This is also similar to results in Peru and India, where researchers found women’s education to be positively correlated with IPV when measured using the list method but negatively when measured face-to-face (Joseph et al. 2017; Agüero and Frisancho 2022). The opposite-signed IPV–education associations would imply opposing theories about women’s bargaining power as a cause of IPV.24

Overall, the sensitivity of the relationships between IPV and several bargaining power and other indicators to the survey method used suggests we should be cautious when using direct IPV measures to understand descriptive and causal relationships, as misreporting may bias correlations and treatment effect estimates. The anonymity of the list experiment may minimize this reporting bias, though at the cost of precision.

4.3. Robustness Checks

In this section we report the results of robustness checks on the list method, followed by checks on the overall analysis.

4.3.1. Checks on List Method Assumptions

There are two assumptions that underpin the validity of statistical analyses of list experiments, and which, if they hold, should result in an unbiased difference-in-means estimator (Blair and Imai 2012). The first assumption is that of “no design effect,” meaning that the inclusion of the sensitive item in the list does not affect answers to non-sensitive items. The second assumption is of “no liars,” or that the answers given for the sensitive item are true.

Blair and Imai (2012) propose a likelihood ratio test to assess the probability that the first assumption is violated. The test is based on the assumption that the inclusion of a sensitive statement should not change the sum of affirmative answers to the non-sensitive statements, and that cumulative proportions of different respondent types should be non-negative (Bulte and Lensink 2019). If one of the proportions is negative, then the no-design-effect assumption has likely been violated. The rejection of the null hypothesis of no design effect using Blair and Imai’s statistical test would suggest there was a design effect. The test returns a Bonferroni-corrected minimum p-value of the test (Blair and Imai 2011).25

In Nigeria, the test returned a minimum p-value of 1, suggesting there was no design effect for the three list questions.26

There is no explicit test for the second “no liars” assumption. However, the most obvious threat to this assumption is when a respondent’s anonymity for their answer to the sensitive statement is compromised. This “ceiling effect” exists when a respondent’s true answer is affirmative for all of the statements, thus their affirmative response to the sensitive statement will no longer be hidden. The proportion of respondents answering with the maximum number of items (three and four for the non-sensitive and sensitive list groups, respectively) is within the range of other studies.27 The presence of a ceiling effect likely indicates downwards-biased list estimates because other respondents would be likely to have understated their true number of affirmative answers to conceal their sensitive answer (Blair and Imai 2012).

4.3.2. Checks on Overall Approach

The two key assumptions underpinning the unbiasedness of the list experiment estimator have parallels for the experiment as a whole.

First, we checked balance on survey questions asked of all respondents after the experiment questions. If respondents were differentially affected by one survey method, perhaps indicating a design effect, we may observe them answering subsequent survey questions differently. We found very few statistically significant differences in answers to subsequent survey questions despite the many tests for significance, and any differences were small.

Second, we analyzed the main list method prevalence regressions with enumerator fixed effects, controls for pre-experiment imbalance, and controls selected using post-double-selection LASSO (Belloni, Chernozhukov, and Hansen 2014). Estimates remained consistent, with few differences in coefficients or significance, as shown in table A.7.

Third, although a general “no liars” assumption is untestable, we compared refusal rates across methods to indicate the potential relative risk of misreporting. We find very low refusal rates, the highest being 0.64 percent for the face-to-face sexual violence question. Refusal rates are shown in table A.8.

Additional indications that these results are robust are supported by the broadly similar patterns of misreporting by survey method observed over different violence types, countries, and list method designs, and in Rwanda, different genders, and perpetration and victimization questions.

5. Conclusion

This paper provided some of the first evidence that the prevalence of intimate partner violence can vary substantially depending on the survey method used to measure it. By randomizing individuals to answer using only one method, we avoid some pitfalls from earlier studies. Our results suggest that in some contexts, standard survey methods are likely to result in significant underestimates of the prevalence and welfare costs of IPV. The most widely used method, face-to-face interview, resulted in the lowest prevalence estimates, followed by the more private ACASI method (tested in Rwanda). Finally, the list method, which allows respondents to report anonymously, resulted in the highest prevalence. In Nigeria, women’s experience of physical IPV was 35 percent greater when measured using the list method than the direct method, and 180 percent greater than physical IPV rates reported in Nigeria’s DHS.

This paper also showed that misreporting was systematic and was correlated with indicators that theory suggests are risk factors for IPV and those often targeted in women’s empowerment programs, including vulnerability, marital quality, and education.

As other studies have found, direct measures of IPV can produce biased treatment effect estimates, for example, finding programs caused a reduction in IPV if measured face-to-face, but an increase if measured with the list experiment (Bulte and Lensink 2019). Given the growing number of studies measuring program impacts on IPV, our results highlight the potential risk of collecting measurement-error-prone data in generating unbiased correlations and treatment effect estimates.28

Our finding of significant and systematic underreporting for some types of IPV in our rural African contexts contrasts with findings from Lima, Peru, where researchers found no overall difference between face-to-face and list method prevalence (Agüero and Frisancho 2022), and with research on sensitive topics in political science, which typically find lower levels of misreporting (Blair, Coppock, and Moor 2020). However, our findings are consistent with IPV misreporting observed in rural Burkina Faso, Vietnam, and India (Joseph et al. 2017; Bulte and Lensink 2019; Lépine, Treibich, and d’Exelle 2020), and in other sensitive health and sexual behavior surveys (Jamison et al. 2013; Traunmüller, Kijewski, and Freitag 2019). Overall, our results indicate that misreporting bias is, unsurprisingly, strongly affected by contextual factors, but can be substantial, and varies by type of violence.

How then should we measure IPV, given our results suggest that IPV is likely to be underreported when measured using the status quo face-to-face method? Policy makers and social scientists measuring sensitive topics could use similar randomized survey experiments to compare IPV reporting rates between the list experiment and direct methods to identify the magnitude of stigma, the types of people most likely to misreport, and the risk of generating biased treatment effects. For example, social scientists researching the impact of interventions on IPV might use such an experiment to sign potential reporting bias in their treatment effect estimates. Also, given DHS’s large sample sizes and use in multi-country analysis, perhaps such surveys should measure several IPV questions using the list method as well as the face-to-face method. This would allow data users to assess the likely extent of reporting bias by context, allowing them to make adjustments to inferences about true levels and correlates of IPV.29

While the list experiment may reduce reporting bias, the method presents some analytical and administration challenges that may undermine its suitability for measuring IPV in some circumstances, and suggests the need for research into alternative approaches that also reduce misreporting. The larger variance associated with list experiment prevalence estimates makes it difficult to use as an outcome measure in smaller studies. Further, it is more time consuming to administer and requires patient, well-trained enumeration. Finally, there are ongoing debates about the method’s internal consistency and optimal design.

We suggest that social scientists measuring IPV consider using both direct questions and the list experiment to test the likely magnitude of underreporting; use the double list experiment to both reduce variance in prevalence estimates, and to apply the internal consistency test proposed by Chuang et al. (2021); and test alternative methods and survey administration conditions that may minimize reporting bias (for example, testing the effect of enumerators more clearly explaining data confidentiality and ensuring interview privacy, or of incentivizing and monitoring empathetic enumeration). Further research could also explore the feasibility and reliability of alternative survey methods such as the ballot method, or of other non-self-reported indicators of IPV, such as brain imaging or stress hormones.

6. Data Availability

The data underlying this article were provided by the World Bank and Government of Rwanda by permission. Data will be shared on request to the author with permission of the World Bank and Government of Rwanda.

Appendix

 

Nigeria: Relationships between IPV and Respondent Characteristics across Survey Methods
Figure A.1.

Nigeria: Relationships between IPV and Respondent Characteristics across Survey Methods

Source: Author's analysis from author's data.

Note: The figure shows correlation coefficients between IPV and respondent characteristic, by survey method.

Table A.1.

Nigeria: Survey Structure

Five direct IPV questions
Consent form
Household roster—demographics
Business & income
Labor
Expenditure
Assets & housing
Nutrition & food security
Plot & crop roster
Happiness, shocks, & family history
Savings & loans
Gender attitudes & relationship quality
Three experimental IPV questions
Face-to-faceList method
Three non-sensitive sets of list questionsThree sensitive sets of list questions
Women’s empowerment in agriculture
Three direct experimental questions
Five direct IPV questions
Anthropometric measures
Five direct IPV questions
Consent form
Household roster—demographics
Business & income
Labor
Expenditure
Assets & housing
Nutrition & food security
Plot & crop roster
Happiness, shocks, & family history
Savings & loans
Gender attitudes & relationship quality
Three experimental IPV questions
Face-to-faceList method
Three non-sensitive sets of list questionsThree sensitive sets of list questions
Women’s empowerment in agriculture
Three direct experimental questions
Five direct IPV questions
Anthropometric measures

Source: Author’s survey.

Note: This table shows the survey structure in Nigeria. IPV, Intimate partner violence.

Table A.1.

Nigeria: Survey Structure

Five direct IPV questions
Consent form
Household roster—demographics
Business & income
Labor
Expenditure
Assets & housing
Nutrition & food security
Plot & crop roster
Happiness, shocks, & family history
Savings & loans
Gender attitudes & relationship quality
Three experimental IPV questions
Face-to-faceList method
Three non-sensitive sets of list questionsThree sensitive sets of list questions
Women’s empowerment in agriculture
Three direct experimental questions
Five direct IPV questions
Anthropometric measures
Five direct IPV questions
Consent form
Household roster—demographics
Business & income
Labor
Expenditure
Assets & housing
Nutrition & food security
Plot & crop roster
Happiness, shocks, & family history
Savings & loans
Gender attitudes & relationship quality
Three experimental IPV questions
Face-to-faceList method
Three non-sensitive sets of list questionsThree sensitive sets of list questions
Women’s empowerment in agriculture
Three direct experimental questions
Five direct IPV questions
Anthropometric measures

Source: Author’s survey.

Note: This table shows the survey structure in Nigeria. IPV, Intimate partner violence.

Table A.2.

Non-sensitive Questions

Nigeria - HOW MANY of the following 3 things have happened to you in the past 12 months?
Q1
 You purchased a fan
 You went to Abuja with a family relative
 You participated in a marriage celebration in a neighborhood household
Q2
 You gave a gift to another woman who gave birth in the village
 You talked to a health worker about your health
 You talked to the village leader
Q3
 You participated in a name giving ceremony in a neighborhood household
 You went to Kano
 You took care of a sick family member who was unable to care for themselves
Rwanda - Has this happened to you in the past 12 months/ ever?
Women Q1
 Have you taken care of a sick family member in the past 12 months?
 Have you gone to a health worker and talked about your health in the past 12 months?
 Have you forgotten to return a borrowed item in the past 12 months?
Women Q2
 Have you ever borrowed a neighbor’s tool for farm work?
 Have you ever traveled to another village for a week or more for work?
 Have you ever used a phone to transfer mobile phone credit to a friend?
Men Q1
 Have you gone to a health worker and talked about your health in the past 12 months?
 Have you forgotten to return a borrowed item in the past 12 months?
 Have you talked to a friend about their health troubles in the past 12 months?
Men Q2
 Have you borrowed a neighbor’s tool for farm work in the past 12 months?
 Have you traveled to another village for work for a week or more in the past 12 months?
 Have you used a phone to transfer mobile phone credit to a friend in the past 12 months?
Nigeria - HOW MANY of the following 3 things have happened to you in the past 12 months?
Q1
 You purchased a fan
 You went to Abuja with a family relative
 You participated in a marriage celebration in a neighborhood household
Q2
 You gave a gift to another woman who gave birth in the village
 You talked to a health worker about your health
 You talked to the village leader
Q3
 You participated in a name giving ceremony in a neighborhood household
 You went to Kano
 You took care of a sick family member who was unable to care for themselves
Rwanda - Has this happened to you in the past 12 months/ ever?
Women Q1
 Have you taken care of a sick family member in the past 12 months?
 Have you gone to a health worker and talked about your health in the past 12 months?
 Have you forgotten to return a borrowed item in the past 12 months?
Women Q2
 Have you ever borrowed a neighbor’s tool for farm work?
 Have you ever traveled to another village for a week or more for work?
 Have you ever used a phone to transfer mobile phone credit to a friend?
Men Q1
 Have you gone to a health worker and talked about your health in the past 12 months?
 Have you forgotten to return a borrowed item in the past 12 months?
 Have you talked to a friend about their health troubles in the past 12 months?
Men Q2
 Have you borrowed a neighbor’s tool for farm work in the past 12 months?
 Have you traveled to another village for work for a week or more in the past 12 months?
 Have you used a phone to transfer mobile phone credit to a friend in the past 12 months?

Source: Author’s survey.

Note: This table lists the non-sensitive questions used in the list experiments in Nigeria and Rwanda.

Table A.2.

Non-sensitive Questions

Nigeria - HOW MANY of the following 3 things have happened to you in the past 12 months?
Q1
 You purchased a fan
 You went to Abuja with a family relative
 You participated in a marriage celebration in a neighborhood household
Q2
 You gave a gift to another woman who gave birth in the village
 You talked to a health worker about your health
 You talked to the village leader
Q3
 You participated in a name giving ceremony in a neighborhood household
 You went to Kano
 You took care of a sick family member who was unable to care for themselves
Rwanda - Has this happened to you in the past 12 months/ ever?
Women Q1
 Have you taken care of a sick family member in the past 12 months?
 Have you gone to a health worker and talked about your health in the past 12 months?
 Have you forgotten to return a borrowed item in the past 12 months?
Women Q2
 Have you ever borrowed a neighbor’s tool for farm work?
 Have you ever traveled to another village for a week or more for work?
 Have you ever used a phone to transfer mobile phone credit to a friend?
Men Q1
 Have you gone to a health worker and talked about your health in the past 12 months?
 Have you forgotten to return a borrowed item in the past 12 months?
 Have you talked to a friend about their health troubles in the past 12 months?
Men Q2
 Have you borrowed a neighbor’s tool for farm work in the past 12 months?
 Have you traveled to another village for work for a week or more in the past 12 months?
 Have you used a phone to transfer mobile phone credit to a friend in the past 12 months?
Nigeria - HOW MANY of the following 3 things have happened to you in the past 12 months?
Q1
 You purchased a fan
 You went to Abuja with a family relative
 You participated in a marriage celebration in a neighborhood household
Q2
 You gave a gift to another woman who gave birth in the village
 You talked to a health worker about your health
 You talked to the village leader
Q3
 You participated in a name giving ceremony in a neighborhood household
 You went to Kano
 You took care of a sick family member who was unable to care for themselves
Rwanda - Has this happened to you in the past 12 months/ ever?
Women Q1
 Have you taken care of a sick family member in the past 12 months?
 Have you gone to a health worker and talked about your health in the past 12 months?
 Have you forgotten to return a borrowed item in the past 12 months?
Women Q2
 Have you ever borrowed a neighbor’s tool for farm work?
 Have you ever traveled to another village for a week or more for work?
 Have you ever used a phone to transfer mobile phone credit to a friend?
Men Q1
 Have you gone to a health worker and talked about your health in the past 12 months?
 Have you forgotten to return a borrowed item in the past 12 months?
 Have you talked to a friend about their health troubles in the past 12 months?
Men Q2
 Have you borrowed a neighbor’s tool for farm work in the past 12 months?
 Have you traveled to another village for work for a week or more in the past 12 months?
 Have you used a phone to transfer mobile phone credit to a friend in the past 12 months?

Source: Author’s survey.

Note: This table lists the non-sensitive questions used in the list experiments in Nigeria and Rwanda.

Table A.3.

Nigeria: Balance across Two Survey Method Groups

(1)(2)
Face-to-faceList sensitive group
VariableNMean/SENMean/SENormalized difference (1)−(2)F-test for joint orthogonality
Wife can read and write1,4120.1591,4050.1320.080.04**
[0.010][0.009]
Number of children1,4124.6741,4054.722−0.020.60
[0.067][0.065]
Age1,41235.1181,40535.0110.010.80
[0.298][0.288]
In polygamous marriage1,4120.4141,4050.3960.040.35
[0.013][0.013]
In paid work last week1,4120.0041,4050.010−0.070.07*
[0.002][0.003]
Number of stoves household owns1,4120.0241,4050.0010.0770.041**
[0.011][0.001]
Household experienced accidents/disasters past year1,4123.0001,4053.070−0.060.09*
[0.030][0.029]
Current value of household assets (Naira)1,41222,689.1221,40521,154.0320.050.18
[846.079][755.367]
Husband’s age when first married1,41022.9941,40324.243−0.060.09*
[0.159][0.728]
Number of gender equitable attitudes held1,4122.5051,4052.528−0.040.30
[0.016][0.016]
Locus of control1,41210.7941,40510.6310.060.14
[0.077][0.078]
(1)(2)
Face-to-faceList sensitive group
VariableNMean/SENMean/SENormalized difference (1)−(2)F-test for joint orthogonality
Wife can read and write1,4120.1591,4050.1320.080.04**
[0.010][0.009]
Number of children1,4124.6741,4054.722−0.020.60
[0.067][0.065]
Age1,41235.1181,40535.0110.010.80
[0.298][0.288]
In polygamous marriage1,4120.4141,4050.3960.040.35
[0.013][0.013]
In paid work last week1,4120.0041,4050.010−0.070.07*
[0.002][0.003]
Number of stoves household owns1,4120.0241,4050.0010.0770.041**
[0.011][0.001]
Household experienced accidents/disasters past year1,4123.0001,4053.070−0.060.09*
[0.030][0.029]
Current value of household assets (Naira)1,41222,689.1221,40521,154.0320.050.18
[846.079][755.367]
Husband’s age when first married1,41022.9941,40324.243−0.060.09*
[0.159][0.728]
Number of gender equitable attitudes held1,4122.5051,4052.528−0.040.30
[0.016][0.016]
Locus of control1,41210.7941,40510.6310.060.14
[0.077][0.078]

Source: Data from author’s experiment.

Note: This table shows balance on key respondent characteristics in Nigeria across those randomly assigned to the different survey groups. Brackets show standard errors. Strata fixed effects are included. The value displayed for F-tests are F-statistics for the differences in means across groups. ***, **, and * indicate significance at the 1, 5, and 10 percent critical level.

Table A.3.

Nigeria: Balance across Two Survey Method Groups

(1)(2)
Face-to-faceList sensitive group
VariableNMean/SENMean/SENormalized difference (1)−(2)F-test for joint orthogonality
Wife can read and write1,4120.1591,4050.1320.080.04**
[0.010][0.009]
Number of children1,4124.6741,4054.722−0.020.60
[0.067][0.065]
Age1,41235.1181,40535.0110.010.80
[0.298][0.288]
In polygamous marriage1,4120.4141,4050.3960.040.35
[0.013][0.013]
In paid work last week1,4120.0041,4050.010−0.070.07*
[0.002][0.003]
Number of stoves household owns1,4120.0241,4050.0010.0770.041**
[0.011][0.001]
Household experienced accidents/disasters past year1,4123.0001,4053.070−0.060.09*
[0.030][0.029]
Current value of household assets (Naira)1,41222,689.1221,40521,154.0320.050.18
[846.079][755.367]
Husband’s age when first married1,41022.9941,40324.243−0.060.09*
[0.159][0.728]
Number of gender equitable attitudes held1,4122.5051,4052.528−0.040.30
[0.016][0.016]
Locus of control1,41210.7941,40510.6310.060.14
[0.077][0.078]
(1)(2)
Face-to-faceList sensitive group
VariableNMean/SENMean/SENormalized difference (1)−(2)F-test for joint orthogonality
Wife can read and write1,4120.1591,4050.1320.080.04**
[0.010][0.009]
Number of children1,4124.6741,4054.722−0.020.60
[0.067][0.065]
Age1,41235.1181,40535.0110.010.80
[0.298][0.288]
In polygamous marriage1,4120.4141,4050.3960.040.35
[0.013][0.013]
In paid work last week1,4120.0041,4050.010−0.070.07*
[0.002][0.003]
Number of stoves household owns1,4120.0241,4050.0010.0770.041**
[0.011][0.001]
Household experienced accidents/disasters past year1,4123.0001,4053.070−0.060.09*
[0.030][0.029]
Current value of household assets (Naira)1,41222,689.1221,40521,154.0320.050.18
[846.079][755.367]
Husband’s age when first married1,41022.9941,40324.243−0.060.09*
[0.159][0.728]
Number of gender equitable attitudes held1,4122.5051,4052.528−0.040.30
[0.016][0.016]
Locus of control1,41210.7941,40510.6310.060.14
[0.077][0.078]

Source: Data from author’s experiment.

Note: This table shows balance on key respondent characteristics in Nigeria across those randomly assigned to the different survey groups. Brackets show standard errors. Strata fixed effects are included. The value displayed for F-tests are F-statistics for the differences in means across groups. ***, **, and * indicate significance at the 1, 5, and 10 percent critical level.

Table A.4.

Nigeria: Mean IPV Prevalence across Methods by Respondent Characteristics

Emotional IPVPhysical IPVSexual IPV
Below medianAbove medianBelow medianAbove medianBelow medianAbove median
(1)(2)(3)(4)(5)(6)(7)(8)(9)(10)(11)(12)
ListF2FListF2FListF2FListF2FListF2FListF2F
Variable
 Vulnerability index0.430.330.360.260.270.200.260.190.270.300.250.26
(0.04)(0.02)(0.05)(0.02)(0.05)(0.01)(0.05)(0.01)(0.04)(0.02)(0.05)(0.02)
 Unequal-village-gender-norms index0.400.240.390.360.240.150.290.230.240.270.280.30
(0.04)(0.02)(0.04)(0.02)(0.05)(0.01)(0.05)(0.02)(0.05)(0.02)(0.05)(0.02)
 Relationship-quality index0.440.390.360.220.290.270.240.130.280.390.250.19
(0.04)(0.02)(0.05)(0.01)(0.05)(0.02)(0.05)(0.01)(0.05)(0.02)(0.05)(0.01)
 Progressive-gender-attitudes index0.410.350.380.250.300.240.230.140.280.350.240.21
(0.04)(0.02)(0.04)(0.02)(0.05)(0.02)(0.05)(0.01)(0.05)(0.02)(0.05)(0.02)
 Wife’s age0.360.270.430.320.210.180.310.210.230.280.290.27
(0.04)(0.02)(0.04)(0.02)(0.05)(0.01)(0.05)(0.01)(0.05)(0.02)(0.05)(0.02)
Indicator variablesWife illiterateWife literateWife illiterateWife literateWife illiterateWife literate
 Wife reads and writes0.380.310.480.240.270.210.240.090.250.290.330.25
(0.03)(0.01)(0.08)(0.00)(0.04)(0.01)(0.09)(0.00)(0.04)(0.01)(0.08)(0.00)
Least confidentMost confidentLeast confidentMost confidentLeast confidentMost confident
 Confidence in speaking up in village0.410.340.380.210.250.210.290.150.270.290.250.27
(0.04)(0.02)(0.05)(0.02)(0.05)(0.01)(0.06)(0.02)(0.04)(0.01)(0.05)(0.02)
PolygamousMonogamousPolygamousMonogamousPolygamousMonogamous
 Marriage type0.360.320.430.280.170.210.330.180.190.310.310.26
(0.05)(0.02)(0.04)(0.02)(0.06)(0.02)(0.05)(0.01)(0.05)(0.02)(0.04)(0.02)
No assets to marriageBrought assetsNo assets to marriageBrought assetsNo assets to marriageBrought assets
 Wife brought assets to marriage0.420.400.380.220.250.270.270.140.280.330.250.24
(0.00)(0.02)(0.04)(0.01)(0.00)(0.02)(0.04)(0.01)(0.00)(0.02)(0.04)(0.02)
Emotional IPVPhysical IPVSexual IPV
Below medianAbove medianBelow medianAbove medianBelow medianAbove median
(1)(2)(3)(4)(5)(6)(7)(8)(9)(10)(11)(12)
ListF2FListF2FListF2FListF2FListF2FListF2F
Variable
 Vulnerability index0.430.330.360.260.270.200.260.190.270.300.250.26
(0.04)(0.02)(0.05)(0.02)(0.05)(0.01)(0.05)(0.01)(0.04)(0.02)(0.05)(0.02)
 Unequal-village-gender-norms index0.400.240.390.360.240.150.290.230.240.270.280.30
(0.04)(0.02)(0.04)(0.02)(0.05)(0.01)(0.05)(0.02)(0.05)(0.02)(0.05)(0.02)
 Relationship-quality index0.440.390.360.220.290.270.240.130.280.390.250.19
(0.04)(0.02)(0.05)(0.01)(0.05)(0.02)(0.05)(0.01)(0.05)(0.02)(0.05)(0.01)
 Progressive-gender-attitudes index0.410.350.380.250.300.240.230.140.280.350.240.21
(0.04)(0.02)(0.04)(0.02)(0.05)(0.02)(0.05)(0.01)(0.05)(0.02)(0.05)(0.02)
 Wife’s age0.360.270.430.320.210.180.310.210.230.280.290.27
(0.04)(0.02)(0.04)(0.02)(0.05)(0.01)(0.05)(0.01)(0.05)(0.02)(0.05)(0.02)
Indicator variablesWife illiterateWife literateWife illiterateWife literateWife illiterateWife literate
 Wife reads and writes0.380.310.480.240.270.210.240.090.250.290.330.25
(0.03)(0.01)(0.08)(0.00)(0.04)(0.01)(0.09)(0.00)(0.04)(0.01)(0.08)(0.00)
Least confidentMost confidentLeast confidentMost confidentLeast confidentMost confident
 Confidence in speaking up in village0.410.340.380.210.250.210.290.150.270.290.250.27
(0.04)(0.02)(0.05)(0.02)(0.05)(0.01)(0.06)(0.02)(0.04)(0.01)(0.05)(0.02)
PolygamousMonogamousPolygamousMonogamousPolygamousMonogamous
 Marriage type0.360.320.430.280.170.210.330.180.190.310.310.26
(0.05)(0.02)(0.04)(0.02)(0.06)(0.02)(0.05)(0.01)(0.05)(0.02)(0.04)(0.02)
No assets to marriageBrought assetsNo assets to marriageBrought assetsNo assets to marriageBrought assets
 Wife brought assets to marriage0.420.400.380.220.250.270.270.140.280.330.250.24
(0.00)(0.02)(0.04)(0.01)(0.00)(0.02)(0.04)(0.01)(0.00)(0.02)(0.04)(0.02)

Source: Author’s analysis based on own data.

Note: This table shows the difference in IPV prevalence by a number of respondent characteristics, for each of the three IPV questions. The left column lists the variables correlated with IPV. Columns 1–12 show the mean IPV prevalence by respondent characteristics (above or below median for continuous variables, or by binary characteristics). Coefficients in columns 1, 3, 5, 7, 9, and 11 were estimated using a regression on the number of list items reported. Standard errors are shown in parentheses. IPV, Intimate partner violence; F2F, face to face.

Table A.4.

Nigeria: Mean IPV Prevalence across Methods by Respondent Characteristics

Emotional IPVPhysical IPVSexual IPV
Below medianAbove medianBelow medianAbove medianBelow medianAbove median
(1)(2)(3)(4)(5)(6)(7)(8)(9)(10)(11)(12)
ListF2FListF2FListF2FListF2FListF2FListF2F
Variable
 Vulnerability index0.430.330.360.260.270.200.260.190.270.300.250.26
(0.04)(0.02)(0.05)(0.02)(0.05)(0.01)(0.05)(0.01)(0.04)(0.02)(0.05)(0.02)
 Unequal-village-gender-norms index0.400.240.390.360.240.150.290.230.240.270.280.30
(0.04)(0.02)(0.04)(0.02)(0.05)(0.01)(0.05)(0.02)(0.05)(0.02)(0.05)(0.02)
 Relationship-quality index0.440.390.360.220.290.270.240.130.280.390.250.19
(0.04)(0.02)(0.05)(0.01)(0.05)(0.02)(0.05)(0.01)(0.05)(0.02)(0.05)(0.01)
 Progressive-gender-attitudes index0.410.350.380.250.300.240.230.140.280.350.240.21
(0.04)(0.02)(0.04)(0.02)(0.05)(0.02)(0.05)(0.01)(0.05)(0.02)(0.05)(0.02)
 Wife’s age0.360.270.430.320.210.180.310.210.230.280.290.27
(0.04)(0.02)(0.04)(0.02)(0.05)(0.01)(0.05)(0.01)(0.05)(0.02)(0.05)(0.02)
Indicator variablesWife illiterateWife literateWife illiterateWife literateWife illiterateWife literate
 Wife reads and writes0.380.310.480.240.270.210.240.090.250.290.330.25
(0.03)(0.01)(0.08)(0.00)(0.04)(0.01)(0.09)(0.00)(0.04)(0.01)(0.08)(0.00)
Least confidentMost confidentLeast confidentMost confidentLeast confidentMost confident
 Confidence in speaking up in village0.410.340.380.210.250.210.290.150.270.290.250.27
(0.04)(0.02)(0.05)(0.02)(0.05)(0.01)(0.06)(0.02)(0.04)(0.01)(0.05)(0.02)
PolygamousMonogamousPolygamousMonogamousPolygamousMonogamous
 Marriage type0.360.320.430.280.170.210.330.180.190.310.310.26
(0.05)(0.02)(0.04)(0.02)(0.06)(0.02)(0.05)(0.01)(0.05)(0.02)(0.04)(0.02)
No assets to marriageBrought assetsNo assets to marriageBrought assetsNo assets to marriageBrought assets
 Wife brought assets to marriage0.420.400.380.220.250.270.270.140.280.330.250.24
(0.00)(0.02)(0.04)(0.01)(0.00)(0.02)(0.04)(0.01)(0.00)(0.02)(0.04)(0.02)
Emotional IPVPhysical IPVSexual IPV
Below medianAbove medianBelow medianAbove medianBelow medianAbove median
(1)(2)(3)(4)(5)(6)(7)(8)(9)(10)(11)(12)
ListF2FListF2FListF2FListF2FListF2FListF2F
Variable
 Vulnerability index0.430.330.360.260.270.200.260.190.270.300.250.26
(0.04)(0.02)(0.05)(0.02)(0.05)(0.01)(0.05)(0.01)(0.04)(0.02)(0.05)(0.02)
 Unequal-village-gender-norms index0.400.240.390.360.240.150.290.230.240.270.280.30
(0.04)(0.02)(0.04)(0.02)(0.05)(0.01)(0.05)(0.02)(0.05)(0.02)(0.05)(0.02)
 Relationship-quality index0.440.390.360.220.290.270.240.130.280.390.250.19
(0.04)(0.02)(0.05)(0.01)(0.05)(0.02)(0.05)(0.01)(0.05)(0.02)(0.05)(0.01)
 Progressive-gender-attitudes index0.410.350.380.250.300.240.230.140.280.350.240.21
(0.04)(0.02)(0.04)(0.02)(0.05)(0.02)(0.05)(0.01)(0.05)(0.02)(0.05)(0.02)
 Wife’s age0.360.270.430.320.210.180.310.210.230.280.290.27
(0.04)(0.02)(0.04)(0.02)(0.05)(0.01)(0.05)(0.01)(0.05)(0.02)(0.05)(0.02)
Indicator variablesWife illiterateWife literateWife illiterateWife literateWife illiterateWife literate
 Wife reads and writes0.380.310.480.240.270.210.240.090.250.290.330.25
(0.03)(0.01)(0.08)(0.00)(0.04)(0.01)(0.09)(0.00)(0.04)(0.01)(0.08)(0.00)
Least confidentMost confidentLeast confidentMost confidentLeast confidentMost confident
 Confidence in speaking up in village0.410.340.380.210.250.210.290.150.270.290.250.27
(0.04)(0.02)(0.05)(0.02)(0.05)(0.01)(0.06)(0.02)(0.04)(0.01)(0.05)(0.02)
PolygamousMonogamousPolygamousMonogamousPolygamousMonogamous
 Marriage type0.360.320.430.280.170.210.330.180.190.310.310.26
(0.05)(0.02)(0.04)(0.02)(0.06)(0.02)(0.05)(0.01)(0.05)(0.02)(0.04)(0.02)
No assets to marriageBrought assetsNo assets to marriageBrought assetsNo assets to marriageBrought assets
 Wife brought assets to marriage0.420.400.380.220.250.270.270.140.280.330.250.24
(0.00)(0.02)(0.04)(0.01)(0.00)(0.02)(0.04)(0.01)(0.00)(0.02)(0.04)(0.02)

Source: Author’s analysis based on own data.

Note: This table shows the difference in IPV prevalence by a number of respondent characteristics, for each of the three IPV questions. The left column lists the variables correlated with IPV. Columns 1–12 show the mean IPV prevalence by respondent characteristics (above or below median for continuous variables, or by binary characteristics). Coefficients in columns 1, 3, 5, 7, 9, and 11 were estimated using a regression on the number of list items reported. Standard errors are shown in parentheses. IPV, Intimate partner violence; F2F, face to face.

Table A.5.

Nigeria: No-Design-Effects Test and Proportions Responding with Each Option in List Experiment

Nigeria (Percentage of respondents)
Q1Q2Q3
ResponseNon-sensitiveSensitiveNon-sensitiveSensitiveNon-sensitiveSensitive
013.818.912.548.6111.4711.25
159.9940.6447.9538.0150.1438.72
219.6237.1524.8635.5932.9333.67
36.599.5414.6612.385.4513.17
43.775.413.2
No-design-effect p-value:111
Nigeria (Percentage of respondents)
Q1Q2Q3
ResponseNon-sensitiveSensitiveNon-sensitiveSensitiveNon-sensitiveSensitive
013.818.912.548.6111.4711.25
159.9940.6447.9538.0150.1438.72
219.6237.1524.8635.5932.9333.67
36.599.5414.6612.385.4513.17
43.775.413.2
No-design-effect p-value:111

Source: Author's analysis from author's data.

Note: The table shows Bonferroni-corrected minimum p-values from the no-design-effects test for the list experiment. Null hypothesis is of no design effect. Computed using the List package in R (Blair and Imai 2011).

Table A.5.

Nigeria: No-Design-Effects Test and Proportions Responding with Each Option in List Experiment

Nigeria (Percentage of respondents)
Q1Q2Q3
ResponseNon-sensitiveSensitiveNon-sensitiveSensitiveNon-sensitiveSensitive
013.818.912.548.6111.4711.25
159.9940.6447.9538.0150.1438.72
219.6237.1524.8635.5932.9333.67
36.599.5414.6612.385.4513.17
43.775.413.2
No-design-effect p-value:111
Nigeria (Percentage of respondents)
Q1Q2Q3
ResponseNon-sensitiveSensitiveNon-sensitiveSensitiveNon-sensitiveSensitive
013.818.912.548.6111.4711.25
159.9940.6447.9538.0150.1438.72
219.6237.1524.8635.5932.9333.67
36.599.5414.6612.385.4513.17
43.775.413.2
No-design-effect p-value:111

Source: Author's analysis from author's data.

Note: The table shows Bonferroni-corrected minimum p-values from the no-design-effects test for the list experiment. Null hypothesis is of no design effect. Computed using the List package in R (Blair and Imai 2011).

Table A.6.

Simulated Upper Bound on Physical IPV List Estimate Prevalence from Possible Ceiling Effect.

ResponseNon-sensitiveFrequencySensitiveFrequencyUpper bound sensitive
012.541778.611210
147.9567738.0153437.37
224.8635135.5950035.59
314.6620712.3817412.38
45.417614.66
Mean1.411.6692.04
Prevalence0.2590.630
ResponseNon-sensitiveFrequencySensitiveFrequencyUpper bound sensitive
012.541778.611210
147.9567738.0153437.37
224.8635135.5950035.59
314.6620712.3817412.38
45.417614.66
Mean1.411.6692.04
Prevalence0.2590.630

Source: Author’s analysis based on own data.

Note: This table shows an estimated upper bound on possible underreporting bias from a ceiling effect for the Nigeria physical IPV question using strong assumptions about how people would have reported had there not been a ceiling effect. I assume that the true share of respondents reporting all four items as true in the list sensitive group would resemble the share in the non-sensitive group’s top category (14.66), and that those deterred by the ceiling effect chose the lowest number of items instead of the highest. Essentially, I assume the highest share of respondents were affected by the ceiling effect and that they all reported 0 and 1 instead of a true 4. Under these extreme assumptions, physical IPV prevalence under the list method would be 63 percent instead of 25.9 percent.

Table A.6.

Simulated Upper Bound on Physical IPV List Estimate Prevalence from Possible Ceiling Effect.

ResponseNon-sensitiveFrequencySensitiveFrequencyUpper bound sensitive
012.541778.611210
147.9567738.0153437.37
224.8635135.5950035.59
314.6620712.3817412.38
45.417614.66
Mean1.411.6692.04
Prevalence0.2590.630
ResponseNon-sensitiveFrequencySensitiveFrequencyUpper bound sensitive
012.541778.611210
147.9567738.0153437.37
224.8635135.5950035.59
314.6620712.3817412.38
45.417614.66
Mean1.411.6692.04
Prevalence0.2590.630

Source: Author’s analysis based on own data.

Note: This table shows an estimated upper bound on possible underreporting bias from a ceiling effect for the Nigeria physical IPV question using strong assumptions about how people would have reported had there not been a ceiling effect. I assume that the true share of respondents reporting all four items as true in the list sensitive group would resemble the share in the non-sensitive group’s top category (14.66), and that those deterred by the ceiling effect chose the lowest number of items instead of the highest. Essentially, I assume the highest share of respondents were affected by the ceiling effect and that they all reported 0 and 1 instead of a true 4. Under these extreme assumptions, physical IPV prevalence under the list method would be 63 percent instead of 25.9 percent.

Table A.7.

Nigeria: Robustness Checks with Controls and Enumerator Fixed Effects

(1)(2)(3)(4)(5)(6)(7)(8)(9)(10)(11)(12)
Q1: Past year emotional IPVQ2: Past year physical IPVQ3: Past year sexual IPV
List prevalence0.392***0.393***0.399***0.413***0.259***0.254***0.256***0.268***0.261***0.257***0.263***0.275***
(0.031)(0.032)(0.031)(0.026)(0.035)(0.035)(0.035)(0.028)(0.032)(0.032)(0.032)(0.025)
Observations2,8172,7962,8132,8172,8172,7962,8132,8172,8172,7962,8132,817
PDS LASSO controlsxxx
Controls from baseline imbalancexxx
Enumerator fixed effectsxxx
(1)(2)(3)(4)(5)(6)(7)(8)(9)(10)(11)(12)
Q1: Past year emotional IPVQ2: Past year physical IPVQ3: Past year sexual IPV
List prevalence0.392***0.393***0.399***0.413***0.259***0.254***0.256***0.268***0.261***0.257***0.263***0.275***
(0.031)(0.032)(0.031)(0.026)(0.035)(0.035)(0.035)(0.028)(0.032)(0.032)(0.032)(0.025)
Observations2,8172,7962,8132,8172,8172,7962,8132,8172,8172,7962,8132,817
PDS LASSO controlsxxx
Controls from baseline imbalancexxx
Enumerator fixed effectsxxx

Source: Author’s analysis based on own data.

Note: The table shows IPV prevalence estimates under different specifications. Regressions include strata fixed effects. Robust standard errors in parentheses. ***p < 0.01, **p < 0.05, *p < 0.1.

Table A.7.

Nigeria: Robustness Checks with Controls and Enumerator Fixed Effects

(1)(2)(3)(4)(5)(6)(7)(8)(9)(10)(11)(12)
Q1: Past year emotional IPVQ2: Past year physical IPVQ3: Past year sexual IPV
List prevalence0.392***0.393***0.399***0.413***0.259***0.254***0.256***0.268***0.261***0.257***0.263***0.275***
(0.031)(0.032)(0.031)(0.026)(0.035)(0.035)(0.035)(0.028)(0.032)(0.032)(0.032)(0.025)
Observations2,8172,7962,8132,8172,8172,7962,8132,8172,8172,7962,8132,817
PDS LASSO controlsxxx
Controls from baseline imbalancexxx
Enumerator fixed effectsxxx
(1)(2)(3)(4)(5)(6)(7)(8)(9)(10)(11)(12)
Q1: Past year emotional IPVQ2: Past year physical IPVQ3: Past year sexual IPV
List prevalence0.392***0.393***0.399***0.413***0.259***0.254***0.256***0.268***0.261***0.257***0.263***0.275***
(0.031)(0.032)(0.031)(0.026)(0.035)(0.035)(0.035)(0.028)(0.032)(0.032)(0.032)(0.025)
Observations2,8172,7962,8132,8172,8172,7962,8132,8172,8172,7962,8132,817
PDS LASSO controlsxxx
Controls from baseline imbalancexxx
Enumerator fixed effectsxxx

Source: Author’s analysis based on own data.

Note: The table shows IPV prevalence estimates under different specifications. Regressions include strata fixed effects. Robust standard errors in parentheses. ***p < 0.01, **p < 0.05, *p < 0.1.

Footnotes

1

In more than 60 developing countries, Demographic and Health Surveys (DHS) have included a standardized domestic violence module where enumerators directly ask women about their experience of a set of specific emotional, physical, and sexual abuse episodes by an intimate partner. Using the face-to-face method, the WHO estimates that approximately one in three women worldwide has experienced IPV in her lifetime (García-Moreno et al. 2013). IPV prevalence varies widely, though rates are generally highest in Africa (Devries et al. 2013; Global Burden of Disease 2017). Researchers collecting IPV survey data take steps to minimize the risk of underreporting, for example by instituting ethics protocols about privacy during interviews, asking multiple questions to give respondents several opportunities to disclose violence, and using specially trained female enumerators to conduct women’s interviews (WHO 2016, 2001; Heise and Hossain 2017).

2

In Rwanda we measured, using the three methods, women’s experience of physical IPV and non-partner sexual violence and men’s reported use of controlling behavior and emotional IPV.

3

A further motivation for assessing IPV measurement methods is that the status quo of face-to-face questions places a high emotional burden on both respondents and enumerators. WHO guidelines thus recommend particular protocols for IPV research, for example ensuring respondents have access to referral services (WHO 2016). If an alternative method was identified that was less biased, logistically easier, and more ethical than the status quo, then it would likely be adopted by more researchers and contribute to new research and evidence on IPV (Peterman et al. 2017).

4

A related study by Agüero et al. (2020) in rural Peru compares women’s direct self-reported experience of IPV with a listing of IPV survivors by community leaders. They found that leaders underreport IPV compared to women’s self-reports.

5

Tourangeau and Yan (2007) define sensitive questions as those a respondent deems intrusive, where they fear the threat of disclosure and the possible consequences of giving a truthful response, or where there are social norms that imply there are socially desirable and undesirable responses.

6

For example, Miller (1984) developed the list method to study illicit drug use. Others have used it to study sensitive topics including prejudice in the United States (Kuklinski, Cobb, and Gilens 1997), political preferences in Lebanon (Corstange 2009), support for militant groups in Afghanistan (Blair, Imai, and Lyall 2014), risky sexual behavior (Jamison et al. 2013; Chuang et al. 2021), support for female genital cutting (De Cao and Lutz 2018), vote buying (Gonzalez-Ocantos et al. 2012), and use of microfinance loans (Karlan and Zinman 2012).

7

This can be done through careful piloting and selecting a set of statements that are all plausible but where the co-occurrence of all being true or untrue for any one person is unlikely. See Glynn (2013) and Corstange (2009) for a discussion and suggestions.

8

Droitcour et al. (1991) proposed the double list experiment, where all respondents participate in two list experiments with different non-sensitive items but the same sensitive item. The combined estimate across the two experiments has approximately half the variance of the standard list experiment. Other IPV list experiment trade-offs are discussed in the supplementary online appendix.

9

The authors implemented several double list experiments to measure sexual behavior in Cote d’Ivoire. They found that prevalence sometimes differed between the two lists, potentially because different lists violated key list experiment assumptions.

10

Peterman et al. (2017) measured physical violence prevalence using the list method in Zambia. They found rates of IPV similar to those reported in the Zambian DHS (conducted face-to-face), though the samples are not directly comparable.

11

For example, in Bulte and Lensink’s (2019) Vietnam study, the sensitive list method group responded to a list that included the sensitive statement “I am regularly hit by my spouse,” and then all respondents later in the survey were also asked, “How often did your husband push, slap, beat or hit you during the last 6 months,” with response options never, rarely, sometimes, often, very often, or refuse. With this design, the questions may not be directly comparable given potential differences in interpretation and biased responses to the second question. Similarly, Joseph et al. (2017) first asked all respondents using the list method, with the sensitive statement being “At least one woman member of my household has faced physical aggression from her husband anytime during her life,” followed by all respondents directly being asked, “Has at least one woman member of your household faced physical aggression from her husband anytime during her life?’

12

However, it is also plausible that by presenting the item of interest in a list of other items, the list method affects the way people interpret the severity of violence and/or differentially affects recall, introducing a different type of reporting bias.

13

Cases of domestic violence are hardly ever brought to trial as law and justice agents typically treat IPV as a family matter, and police often fail to respond if they consider cases to be within cultural norms (Benebo, Schumann, and Vaezghasemi 2018). For the very few violence survivors choosing to report or leave their husband (DHS 2019), limited official resources are available.

14

Data were collected from May–July 2018, with the program ending 12 months prior (Bastian, Goldstein, and Papineni 2017). The study and experiment were pre-registered on the American Economic Association trial registry.

15

Married female respondents were first asked about their experience of IPV using the list method (asked either the sensitive or non-sensitive list, depending on her randomly assigned group), followed by the three direct face-to-face IPV questions if she was in the list non-sensitive group.

16

To ensure innumerate respondents could answer list questions, and to help all respondents keep track of their answers, they were advised to put their fist behind their back or under their hijab out of sight of the enumerator and to straighten a finger each time a true statement was read out. Once all statements had been read out, respondents showed the enumerator their hand to count the number of straightened fingers. Some respondents also put their hands behind their backs and transferred a stone from one hand to the other each time a true statement was read out.

The non-sensitive questions that were asked in the list experiments are shown in table A.2. They were selected from a larger pool of piloted questions following design advice discussed in Glynn (2013), and informed by questions used by Peterman et al. (2017) and Agüero and Frisancho (2022).

17

Differences in prevalence estimates across the methods were calculated using the “Suest” command in Stata, used to compare “seemingly unrelated estimations.” Suest combines the stored results from the different estimations into one parameter vector and simultaneous covariance matrix, allowing the testing of cross-model hypotheses.

18

See Blair, Coppock, and Moor (2020) for a comprehensive discussion on list experiment power considerations.

19

Post-double selection LASSO controls for covariates selected by LASSO to predict the outcome. It allows for the parsimonious and consistent selection of controls from a large set of potential variables. It estimates two LASSO regressions, first with the outcome as dependent variable and control variables as regressors, second estimating a LASSO with the treatment as the dependent variable and again the controls as regressors. This second LASSO regression essentially selects unbalanced variables as controls. The final choice of control variables in the OLS regression of the outcome on the treatment indicator includes the controls selected from both LASSO regressions. We use the Stata command “pdslasso” to implement this procedure.

20

The q-values control for the false discovery rate for the three tests for difference in means that we calculate in Nigeria. All regression estimates were calculated with heteroskedastic-consistent robust standard errors.

22

Results are essentially unchanged when we include control variables in the list estimate regressions, including polygamous marriage, region, husband’s alcohol consumption, religion, wife earning a wage income, household recently faced a negative income shock, and wife’s age at marriage.

23

For conceptually linked variables (for example, questions about relationship quality such as trust and empathy in the marriage), we follow Anderson (2008) and generate weighted standardized summary indices as the most efficient weighted average of the set of outcomes to minimize the false discovery rate. The index weights outcomes using the inverse of their variance–covariance matrix. For other indicators of interest where there is less theoretical coherence between the variables, we report coefficients on the raw variable.

Given that this is exploratory analysis only, we do not adjust the p-values of the tests of equivalence for multiple hypothesis testing, though when we do compute q-values to control for the false discovery rate, none of the list–face-to-face differences remain statistically significant at conventional levels. This suggests results should be interpreted with caution.

24

However, we cannot rule out that literate women understood the list experiment better than illiterate women, and thus how much weight to place on these results.

25

Chuang et al. (2021) found that having relatively sensitive “non-sensitive” statements on the list appears to mask the sensitive item of interest, and results in lower variance and higher prevalence estimates. We used relatively innocuous non-sensitive list items, which, if results from the Chuang et al. measurement experiment on risky sexual behavior hold, make it plausible that our list experiment estimates may be downwards biased.

26

The associated test p-values and proportions for Nigeria are shown in table A.5. In Rwanda, where the list non-sensitive group was directly asked each non-sensitive question, there is statistically significant evidence of a design effect for three out of the four questions, suggesting that group assignment affected answers to the list questions. There was no design effect for the first men’s question.

27

The proportion of respondents in the non-sensitive list group reporting three items as true is never greater than 14.7 percent (for the women’s physical violence question), and the proportion of respondents in the sensitive list group reporting the full number of items is never greater than 5.4 percent (for the women’s physical violence question). Table A.6 simulates a possible upper bound on the physical IPV list estimate if there had been no ceiling effect and under extreme assumptions of underreporting with the sensitive list.

28

For example, several randomized control trials of IPV prevention or gender-norms-related interventions have found small or null effects on IPV while having impacts on related outcomes such as gender equitable attitudes or perceptions of norms (Vaillant et al. 2020). It is plausible that such gender norms or couples-focused interventions had an impact on IPV, for good or bad, but that biased reporting producing biased treatment effects may not show this.

29

DHS data on IPV cover more than 60 developing countries and are cited in more than 5,000 papers on Google Scholar. If DHS IPV data are measured with substantial, systematic, and context-specific bias as this paper suggests is likely, this has significant implications for the use of such data.

Notes

Claire Cullen is postdoctoral research fellow from the Blavatnik School of Government, University of Oxford; her email address is [email protected]. The impact evaluations through which these data were collected were conducted by the World Bank Africa Region Gender Innovation Lab and were funded by the World Bank’s Nordic Trust Fund, the Swiss Development Cooperation, and the Government of Rwanda, and in Nigeria by USAID and the Umbrella Facility for Gender Equality, a World Bank Group multi-donor trust fund. The views presented in this paper are those of the author and do not represent those of the World Bank or its member countries. I gratefully acknowledge support from the Blavatnik School of Government, Fund for Women Graduates, and Graduate Women International. I would like to thank anonymous reviewers for their helpful feedback, as well as Julien Labonne, Stefan Dercon, Arthur Alik-Lagrange, Noam Angrist, Dara Kay Cohen, Cheryl Doss, Patricia Fernandes, Markus Goldstein, Harald Hinkel, Eliana La Ferrara, Summer Lindsey, Anandi Mani, Sreelakshmi Papineni, Rachael Pierotti, Simon Quinn, Diana Contreras Suarez, Alex Twahirwa, Julia Vaillant, Marc Witte, and seminar participants at the World Bank Africa Region Gender Innovation Lab, Oxford CSAE, IFPRI, UC Berkeley, Yale EGEN, Australian Gender Economics Workshop, Kigali Impact Hub, and SVRI for helpful comments and suggestions. I would also like to thank the respondents for generously giving their time, and the staff at Laterite Ltd. in Rwanda and TNS Global in Nigeria, and Victoria Isika, Jean Paul Ngabonziza, Maritetou Sanogo, and Oluwatoyin Zakariya for their superb research assistance and data collection work. All remaining errors are my own. This study was approved by the Oxford University Central University Research Ethics Committee, HML IRB, and the Rwandan National Ethics Committee. This paper was previously circulated under the title “Method Matters: Underreporting of Intimate Partner Violence in Nigeria and Rwanda.” A supplementary online appendix is available with this article at The World Bank Economic Review website.

References

Agüero
J.
,
Aldana
Ú.
,
Field
E.
,
Frisancho
V.
,
Romero
J.
.
2020
. “
Is Community-Based Targeting Effective in Identifying Intimate Partner Violence?
AEA Papers and Proceedings
110
(
May 2020
):
605
09
.

Agüero
J. M.
,
Frisancho
V.
.
2022
. “
Measuring Violence against Women with Experimental Methods
.”
Economic Development and Cultural Change
70
(
4
):
1565
90
.

Anderson
M. L.
2008
. “
Multiple Inference and Gender Differences in the Effects of Early Intervention: A Reevaluation of the Abecedarian, Perry Preschool, and Early Training Projects
.”
Journal of the American Statistical Association
103
(
484
):
1481
95
.

Bastian
G. G.
,
Goldstein
M. P.
,
Papineni
S.
.
2017
. “
Are Cash Transfers Better Chunky or Smooth?: Evidence from an Impact Evaluation of a Cash Transfer Program in Northern Nigeria
.”
Technical report
,
World Bank Gender Innovation Lab Policy Brief, Washington, DC
.

Belloni
A.
,
Chernozhukov
V.
,
Hansen
C.
.
2014
. “
Inference on Treatment Effects after Selection among High-Dimensional Controls
.”
Review of Economic Studies
81
(
2
):
608
50
.

Benebo
F. O.
,
Schumann
B.
,
Vaezghasemi
M.
.
2018
. “
Intimate Partner Violence against Women in Nigeria: A Multilevel Study Investigating the Effect of Women’s Status and Community Norms
.”
BMC Women’s Health
18
(
1
):
1
17
.

Benjamini
Y.
,
Hochberg
Y.
.
1995
. “
Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing
.”
Journal of the Royal Statistical Society: Series B (Methodological)
57
(
1
):
289
300
.

Bharadwaj
P.
,
Pai
M. M.
,
Suziedelyte
A.
.
2017
. “
Mental Health Stigma
.”
Economics Letters
159
:
57
60
.

Blair
G.
,
Coppock
A.
,
Moor
M.
2020
. “
When to Worry about Sensitivity Bias: A Social Reference Theory and Evidence from 30 Years of List Experiments
.”
American Political Science Review
114
(
4
):
1297
1315
.

Blair
G.
,
Imai
K.
.
2011
. “
List: Statistical Methods for the Item Count Technique and List Experiment. Comprehensive R Archive NetworK (CRAN)
.”

Blair
G.
,
Imai
K.
.
2012
. “
Statistical Analysis of List Experiments
.”
Political Analysis
20
(
1
):
47
77
.

Blair
G.
,
Imai
K.
,
Lyall
J.
.
2014, Oct.
Comparing and Combining List and Endorsement Experiments: Evidence from Afghanistan
.”
American Journal of Political Science
58
(
4
):
1043
63
.

Blattman
C.
,
Jamison
J.
,
Koroknay-Palicz
T.
,
Rodrigues
K.
,
Sheridan
M.
.
2016
. “
Measuring the Measurement Error: A Method to Qualitatively Validate Survey Data
.”
Journal of Development Economics
120
(
May 2016
):
99
112
.

Bobonis
G. J.
,
González-Brenes
M.
,
Castro
R.
.
2013
. “
Public Transfers and Domestic Violence: The Roles of Private Information and Spousal Control
.”
American Economic Journal: Economic Policy
5
(
51
):
179
205
.

Buller
A. M.
,
Peterman
A.
,
Ranganathan
M.
,
Bleile
A.
,
Hidrobo
M.
,
Heise
L.
.
2018
. “
A Mixed-Method Review of Cash Transfers and Intimate Partner Violence in Low- and Middle-Income Countries
.”
World Bank Research Observer
33
(
2
):
218
58
.

Bullinger
L. R.
,
Carr
J. B.
,
Packham
A.
.
2021
. “
Covid-19 and Crime: Effects of Stay-At-Home Orders on Domestic Violence
.”
American Journal of Health Economics
7
(
3
):
249
80
.

Bulte
E.
,
Lensink
R.
.
2019, Jun
. “
Women’s Empowerment and Domestic Abuse: Experimental Evidence from Vietnam
.”
European Economic Review
115
(
June 2019
):
172
91
.

Chuang
E.
,
Dupas
P.
,
Huillery
E.
,
Seban
J.
.
2021
. “
Sex, Lies, and Measurement: Consistency Tests for Indirect Response Survey Methods
.”
Journal of Development Economics
148
(
January 2021
):
102582
.

Corstange
D
.
2009
. “
Sensitive Questions, Truthful Answers? Modeling the List Experiment with LISTIT
.”
Political Analysis
17
(
1
):
45
63
.

Coutts
E.
,
Jann
B.
.
2011
. “
Sensitive Questions in Online Surveys: Experimental Results for the Randomized Response Technique (RRT) and the Unmatched Count Technique (UCT)
.”
Sociological Methods & Research
40
(
1
):
169
93
.

De Cao
E.
,
Lutz
C.
.
2018
. “
Sensitive Survey Questions: Measuring Attitudes Regarding Female Genital Cutting through a List Experiment
.”
Oxford Bulletin of Economics and Statistics
80
(
5
):
871
92
.

Devries
K. M.
,
Mak
J. Y.
,
García-Moreno
C.
,
Petzold
M.
,
Child
J. C.
,
Falder
G.
,
Lim
S.
et al.
2013
. “
The Global Prevalence of Intimate Partner Violence against Women
.”
Science
340
(
6140
):
1527
28
.

DHS
.
2019
. “
Nigeria Demographic and Health Survey
.”
Technical report, National Population Commission (NPC) Nigeria and ICF International, Abuja, Nigeria, and Rockville, Maryland, USA
.

DHS
.
2021
. “
Rwanda Demographic and Health Survey 2019-2020
.”
Technical report, National Institute of Statistics of Rwanda (NISR), Ministry of Health (MOH), and ICF International, Kigali and Maryland
.

Droitcour
J.
,
Caspar
R. A.
,
Hubbard
M. L.
,
Parsley
T. L.
,
Visscher
W.
,
Ezzati
T. M.
.
1991
. “
The Item Count Technique As A Method of Indirect Questioning: A Review of its Development and a Case Study Application
.” In
Biemer
P.
,
Groves
R.
, eds.
Measurement Errors in Surveys
,
New York, NY
:
John Wiley & Sons
.

Erten
B.
,
Keskin
P.
.
2018, Jan
. “
For Better or for Worse?: Education and the Prevalence of Domestic Violence in Turkey
.”
American Economic Journal: Applied Economics
10
(
1
):
64
105
.

Eswaran
M.
,
Malhotra
N.
.
2011
. “
Domestic Violence and Women’s Autonomy in Developing Countries: Theory and Evidence
.”
Canadian Journal of Economics/Revue canadienne d’économique
44
(
4
):
1222
63
.

Falb
K.
,
Tanner
S.
,
Asghar
K.
,
Souidi
S.
,
Mierzwa
S.
,
Assazenew
A.
,
Bakomere
T.
et al.
2016
. “
Implementation of Audio-Computer Assisted Self-Interview (ACASI) among Adolescent Girls in Humanitarian Settings: Feasibility, Acceptability, and Lessons Learned
.”
Conflict and Health
10
(
1
):
32
.

Fay
R. E.
,
Turner
C. F.
,
Klassen
A. D.
,
Gagnon
J. H.
,
Pleck
J. H.
,
Sonenstein
F. L.
.
1998, Jan
. “
Prevalence and Patterns of Same-Gender Sexual Contact among Men
.”
Science
280
(
5365
):
867
873
.

Fincher
D.
,
VanderEnde
K.
,
Colbert
K.
,
Houry
D.
,
Smith
L. S.
,
Yount
K. M.
.
2015
. “
Effect of Face-to-Face Interview versus Computer-Assisted Self-Interview on Disclosure of Intimate Partner Violence among African American Women in WIC Clinics
.”
Journal of Interpersonal Violence
30
(
5
):
818
38
.

Flavin
P.
,
Keane
M.
.
2009
. “
How Angry am I? Let Me Count the Ways: Question Format Bias in List Experiments.” Unpublished Manuscript, 1–17
.

García-Moreno
C.
,
Pallitto
C.
,
Devries
K.
,
Stöckl
H.
,
Watts
C.
,
Abrahams
N.
.
2013
.
“Global and Regional Estimates of Violence Against Women: Prevalence and Health Effects of Intimate Partner Violence and Non-Partner Sexual Violence.”
World Health Organization
.

Global Burden of Disease
.
2017
.
“Global Burden of Disease Study 2016: Health-related Sustainable Development Goals (SDG) Indicators 1990-2030.”
Seattle, United States
:
Institute for Health Metrics and Evaluation (IHME)
.

Glynn
A. N.
.
2013
. “
What Can We Learn with Statistical Truth Serum?: Design and Analysis of the List Experiment
.”
Public Opinion Quarterly
77
(
S1
):
159
72
.

Gonzalez-Ocantos
E.
,
de Jonge
C. K.
,
Meléndez
C.
,
Osorio
J.
,
Nickerson
D. W.
.
2012
. “
Vote Buying and Social Desirability Bias: Experimental Evidence from Nicaragua
.”
American Journal of Political Science
56
(
1
):
202
17
.

Gottschalk
P.
,
Huynh
M.
.
2010
. “
Are Earnings Inequality and Mobility Overstated? The Impact of Nonclassical Measurement Error
.”
Review of Economics and Statistics
92
(
2
):
302
15
.

Haushofer
J.
,
Ringdal
C.
,
Shapiro
J. P.
,
Wang
X. Y.
.
2019
.
“Income changes and intimate partner violence: Evidence from unconditional cash transfers in Kenya (No. w25627).”
National Bureau of Economic Research
.

Heise
L.
,
Hossain
M.
.
2017
.
“Measuring Intimate Partner Violence.”
STRIVE
,
London
.

Heise
L.
,
Kotsadam
A.
.
2015
. “
Cross-National and Multilevel Correlates of Partner Violence: An Analysis of Data from Population-Based Surveys
.”
Lancet Global Health
3
(
6
):
332
40
.

Holbrook
A. L.
,
Krosnick
J. A.
.
2010
. “
Measuring Voter Turnout by Using the Randomized Response Technique: Evidence Calling into Question the Method’s Validity
.”
Public Opinion Quarterly
74
(
2
):
328
43
.

Imbens
G.
,
Rubin
D. B.
.
2015
.
Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction
.
Cambridge University Press
.

Jamison
J. C.
,
Karlan
D.
,
Raffler
P.
.
2013
.
“Mixed Method Evaluation of A Passive Mhealth Sexual Information Texting Service in Uganda (No. w19107).”
National Bureau of Economic Research
.

Jarlais
D. C. D.
,
Paone
D.
,
Milliken
J.
,
Turner
C. F.
,
Miller
H.
,
Gribble
J.
,
Shi
Q.
,
Hagan
H.
,
Friedman
S. R.
.
1999, May
. “
Audio-Computer Interviewing to Measure Risk Behaviour for HIV among Injecting Drug Users: A Quasi-Randomised Trial
.”
Lancet
353
(
9165
):
1657
1661
.

Jaya
M. J. Hindin
,
Ahmed
S.
.
2008, Jan
. “
Differences in Young People’s Reports of Sexual Behaviors According to Interview Methodology: A Randomized Trial in India
.”
American Journal of Public Health
98
(
1
):
169
74
.

Joseph
G.
,
Javaid
S. U.
,
Andres
L. A.
,
Chellaraj
G.
,
Solotaroff
J. L.
,
Rajan
S. I.
.
2017
. “
Underreporting of Gender-Based Violence in Kerala, India: An Application of the List Randomization Method
.”
World Bank Policy Research Working Paper No. 8044
.

Karlan
D. S.
,
Zinman
J.
.
2012
. “
List Randomization for Sensitive Behavior: An Application for Measuring Use of Loan Proceeds
.”
Journal of Development Economics
98
(
1
):
71
75
.

Krupka
E.
,
Weber
R.
.
2013
. “
Identifying Social Norms Using Coordination Games: Why Does Dictator Game Sharing Vary?
Journal of the European Economic Association
11
(
3
):
495
524
.

Kuklinski
J. H.
,
Cobb
M. D.
,
Gilens
M.
.
1997
. “
Racial Attitudes and the New South
.”
Journal of Politics
59
(
2
):
323
49
.

Langhaug
L. F.
,
Sherr
L.
,
Cowan
F. M.
.
2010, Mar
. “
How to Improve the Validity of Sexual Behaviour Reporting: Systematic Review of Questionnaire Delivery Modes in Developing Countries
.”
Tropical Medicine & International Health
15
(
3
):
362
81
.

Lépine
A.
,
Treibich
C.
,
d’Exelle
B.
.
2020
. “
Nothing but the Truth: Consistency and Efficiency of the List Experiment Method for the Measurement of Sensitive Health Behaviours
.”
Social Science & Medicine
266
(
December 2020
):
113326
.

Leslie
E.
,
Wilson
R.
.
2020
. “
Sheltering in Place and Domestic Violence: Evidence from Calls for Service during COVID-19
.”
Journal of Public Economics
189
(
September 2020
):
104241
.

Mensch
B. S.
,
Hewett
P. C.
,
Gregory
R.
,
Helleringer
S.
.
2008
. “
Sexual Behavior and STI/HIV Status among Adolescents in Rural Malawi: An Evaluation of the Effect of Interview Mode on Reporting
.”
Studies in Family Planning
39
(
4
):
321
34
.

Meyer
B. D.
,
Mittag
N.
.
2019
. “
Using Linked Survey and Administrative Data to Better Measure Income: Implications for Poverty, Program Effectiveness, and Holes in the Safety Net
.”
American Economic Journal: Applied Economics
11
(
2
):
176
204
.

Miller
J. D.
.
1984
.
A New Survey Technique for Studying Deviant Behavior
.
Unpublished doctoral dissertation, George Washington University
.

Murray-Close
M.
,
Heggeness
M.
.
2018
. “
Manning up and Womaning Down: How Husbands and Wives Report their Earnings when She Earns More
.”
US Census Bureau Social, Economic, and Housing Statistics Division Working Paper, 20
.

Peterman
A.
,
Palermo
T. M.
,
Handa
S.
,
Seidenfeld
D.
.
2017
. “
List Randomization for Soliciting Experience of Intimate Partner Violence: Application to the Evaluation of Zambia’s Unconditional Child Grant Program
.”
Health Economics
27
(
3
):
622
28
.

Phillips
A. E.
,
Gomez
G. B.
,
Boily
M.-C.
,
Garnett
G. P.
.
2010, Dec
. “
A Systematic Review and Meta-Analysis of Quantitative Interviewing Tools to Investigate Self-Reported HIV and STI Associated Behaviours in Low- and Middle-Income Countries
.”
International Journal of Epidemiology
39
(
6
):
1541
55
.

Ravindran
S.
,
Shah
M.
.
2020
.
“Unintended Consequences of Lockdowns: COVID-19 and the Shadow Pandemic (No. w27562)”
National Bureau of Economic Research
.

Rosenfeld
B.
,
Imai
K.
,
Shapiro
J. N.
.
2016, Jul
. “
An Empirical Validation Study of Popular Survey Methodologies for Sensitive Questions
.”
American Journal of Political Science
60
(
3
):
783
802
.

Tourangeau
R.
,
Yan
T.
.
2007
. “
Sensitive Questions in Surveys
.”
Psychological Bulletin
133
(
5
):
859
83
.

Traunmüller
R.
,
Kijewski
S.
,
Freitag
M.
.
2019
. “
The Silent Victims of Sexual Violence during War: Evidence from a List Experiment in Sri Lanka
.”
Journal of Conflict Resolution
63
(
9
):
2015
42
.

Vaillant
J.
,
Koussoubé
E.
,
Roth
D.
,
Pierotti
R.
,
Hossain
M.
,
Falb
K. L.
2020
. “
Engaging Men to Transform Inequitable Gender Attitudes and Prevent Intimate Partner Violence: A Cluster Randomised Controlled Trial in North and South Kivu, Democratic Republic of Congo
.”
BMJ Global Health
,
5
(
5
),
e002223
.

WHO
.
2001
.
“Putting Women’s Safety First: Ethical and Safety Recommendations for Research on Domestic Violence Against Women.”
World Health Organization
,
Geneva
.

WHO
.
2016
.
“Ethical and Safety Recommendations for Intervention Research on Violence Against Women.”
World Health Organization
,
Geneva
.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.