>people inflate their self-reports scores generally when they are being given treatment?
Yup, that’s what I meant.
>Is there one or more studies you can point me to so I can read up on this, or is this a hypothetical concern?
I’m afraid I don’t know this literature on blinding very well but a couple of pointers:
(i) StrongMinds notes “social desirability bias” as a major limitation of their Phase Two impact evaluation, and suggest collecting objective measures to supplement their analysis:
“Develop the means to negate this bias, either by determining a corrective percentage factor to apply or using some other innovative means, such as utilizing saliva cortisol stress testing. By testing the stress levels of depressed participants (proxy for depression), StrongMinds could theoretically determine whether they are being truthful when they indicate in their PHQ-9 responses that they are not depressed.” https://strongminds.org/wp-content/uploads/2013/07/StrongMinds-Phase-Two-Impact-Evaluation-Report-July-2015-FINAL.pdf
(ii) GiveWell’s discussion of the difference between blinded and non-blinded trials on water quality interventions when outcomes were self-reported [I work for GiveWell but didn’t have any role in that work and everything I post on this forum is in a personal capacity unless otherwise noted]
May be best to just chat about this in person but I’ll try to put it another way.
Say a single RCT of a cash transfer program in a particular region of Kenya doubled people’s consumption for a year, but had no apparent effect on life satisfaction. What should we believe about the likely effect of a future cash transfer program on life satisfaction? (taking it as an assumption for the moment that the wider evidence suggests that increases in consumption generally lead to increases in life satisfaction).
Possibility 1: there’s something about cash transfer programs which mean they don’t increase life satisfaction as much as other ways to increase consumption.
Possibility 2: this result was a fluke of context; there was something about that region at that time which meant increases in consumption didn’t translate to increases in reported life satisfaction, and we wouldn’t expect that to be true elsewhere (given the wider evidence base).
If Possibility 2 is true, then it would be more accurate to predict the effect of a future cash transfer program on life satisfaction by using the RCT effect of cash on consumption, and then extrapolating from the wider evidence base to the likely effect on life satisfaction. If possibility 1 is true, then we should simply take the measured effect from the RCT on life satisfaction as our prediction.
One way of distinguishing between possibility 1 and possibility 2 would be to look at the inter-study variance in the effects of similar programs on life satisfaction. If there’s high variance, that should update us to possibility 2. If there’s low variance, that should update us to possibility 1.
I haven’t seen this problem discussed before (although I haven’t looked very hard). It seems interesting and important to me.
On (1)
>people inflate their self-reports scores generally when they are being given treatment?
Yup, that’s what I meant.
>Is there one or more studies you can point me to so I can read up on this, or is this a hypothetical concern?
I’m afraid I don’t know this literature on blinding very well but a couple of pointers:
(i) StrongMinds notes “social desirability bias” as a major limitation of their Phase Two impact evaluation, and suggest collecting objective measures to supplement their analysis:
“Develop the means to negate this bias, either by determining a corrective percentage factor to apply or using some other innovative means, such as utilizing saliva cortisol stress testing. By testing the stress levels of depressed participants (proxy for depression), StrongMinds could theoretically determine whether they are being truthful when they indicate in their PHQ-9 responses that they are not depressed.” https://strongminds.org/wp-content/uploads/2013/07/StrongMinds-Phase-Two-Impact-Evaluation-Report-July-2015-FINAL.pdf
(ii) GiveWell’s discussion of the difference between blinded and non-blinded trials on water quality interventions when outcomes were self-reported [I work for GiveWell but didn’t have any role in that work and everything I post on this forum is in a personal capacity unless otherwise noted]
https://blog.givewell.org/2016/05/03/reservations-water-quality-interventions/
On (2)
May be best to just chat about this in person but I’ll try to put it another way.
Say a single RCT of a cash transfer program in a particular region of Kenya doubled people’s consumption for a year, but had no apparent effect on life satisfaction. What should we believe about the likely effect of a future cash transfer program on life satisfaction? (taking it as an assumption for the moment that the wider evidence suggests that increases in consumption generally lead to increases in life satisfaction).
Possibility 1: there’s something about cash transfer programs which mean they don’t increase life satisfaction as much as other ways to increase consumption.
Possibility 2: this result was a fluke of context; there was something about that region at that time which meant increases in consumption didn’t translate to increases in reported life satisfaction, and we wouldn’t expect that to be true elsewhere (given the wider evidence base).
If Possibility 2 is true, then it would be more accurate to predict the effect of a future cash transfer program on life satisfaction by using the RCT effect of cash on consumption, and then extrapolating from the wider evidence base to the likely effect on life satisfaction. If possibility 1 is true, then we should simply take the measured effect from the RCT on life satisfaction as our prediction.
One way of distinguishing between possibility 1 and possibility 2 would be to look at the inter-study variance in the effects of similar programs on life satisfaction. If there’s high variance, that should update us to possibility 2. If there’s low variance, that should update us to possibility 1.
I haven’t seen this problem discussed before (although I haven’t looked very hard). It seems interesting and important to me.