[Tomasik has updated his essay to address some of these issues]
Brian Tomasik’s website, utilitarian-essays.com, contains many thoughtful pieces he has written over the years from the perspective of a utilitarian who is concerned deeply with wild animal suffering. His work has been a great resource of what is now called the effective altrusim community, and I have a lot of respect for his unflinching acceptance and exploration of our large obligations conditional on the moral importance of all animals.
I want to briefly take issue with a small but important part of Brain’s recent essay “Charity cost effectiveness in an uncertain world“. He discusses the difficult problem facing consequentialists who care about the future, especially the far future, on account of how difficult it is predict the many varied flow-through effects of our actions. In several places, he suggests that this uncertainty will tend to wash out the enormous differences in effectiveness attributed to various charities (and highlighted by effective altruists) when measured by direct impact (e.g. lives saved per dollar).
…When we consider flow-through effects of our actions, the seemingly vast gaps in cost-effectiveness among charities are humbled to more modest differences, and we begin to find more worth in the diversity of activities that different people are pursuing…
…For example, insofar as a charity encourages cooperation, philosophical reflection, and meta-thinking about how to best reduce suffering in the future — even if only by accident — it has valuable flow-through effects, and it’s unlikely these can be beaten by many orders of magnitude by something else…
…I don’t expect some charities to be astronomically better than others…
Although I agree on the importance of the uncertain implications of flow-through effects, I disagree with the suggestion that this should generally be expected to even out differences in effectiveness. Here’s a toy example of how this reasoning can fail: Consider two charities, X and Y, that have exactly the same cost and take and take exactly the same actions. In order to assume linearity, suppose their action is some small contribution to a continuous parameter, e.g. scrubbing a tiny percentage of CO2 from the atmosphere, or giving money to poor Kenyans when many other and larger charities already have similar projects. By symmetry, X and Y should have the same expected impact on the future. Now imagine one of the charities X starts burning 99% of their money and using the remaining 1% to accomplish just 1% of the effect of the other charity Y.
Obviously, charity X is now two orders of magnitude less efficient than charity Y in the direct, object-level impact. But is this difference smoothed out when we more broadly consider the uncertainty of flow-through effects? Of course not. There’s still a gap by factor of 100. Unless you think the sign of the expected effect of the action changes, you should not think effectiveness of Y compared to X becomes any less upon consideration of flow-through effects. And even if you have some uncertainty in the sign, you still must assign an expected utility to a unit of action, and you will still see the same proportion discrepancy between X and Y. (If the sign is negative, you contribute to neither.)
Of course, as Tomasik suggests in the second quoted sentence, tangential effects of the charity on people who work for or interact with it may be very important and could even out the expected impact. But it could also magnify them. You simply cannot escape the necessity of choosing a measure (in the sense of measure theory) over which otherwise indistinguishable charities have equal expected impact. The measure could be “number of dollars spent” (which is the measure I think Tomasik is implicitly using), but it could just as well be “number of people employed”, “number of charities” (defined by legal incorporation), “amount of new coverage”, “number of DALYs saved”, “number of schools build”, “amount of time charity exists”, or “number of donors”.
Assuming a general tenancy for uncertainty to smooth out expectation differences is basically the same two-envelope/scope-insensitivity mistake as assuming that two charities you’ve never heard of, but whose very different methods are described to you, have similar expected impact by default. In other words: How much more (or less) impact would saving 3000 sea lions of millions have compared to teaching one child to swim? What if I had asked you about 300 sea lions and 10 children? Since you cannot consistently answer “they have equal expected impact, given my ignorance” to both questions — even when you know nothing about swimming or sea lions — it’s clear that mere uncertainty cannot lead to a universal smoothing.
Lastly, I want to draw out one other statement by Tomasik about research:
But it’s not clear that directly studying future suffering is many orders of magnitude more important. The tools and insights developed in one science tend to transfer to others. And in general, it’s important to pursue a diversity of projects, to discover things you never knew you never knew. Given the choice between 1000 papers on future suffering and 1000 on dung beetles, versus 1001 papers on future suffering and 0 on dung beetles, I would choose the former.
Here, it seems to me like he’s not properly distinguishing between two ideas. I think he is arguing against the claim that some types of research are thousands of times more important than others. A precise statement of that claim is that the marginal impact of a research dollar for some fields is thousands of times more than for others. But the example Tomasik is giving is really about the diminishing returns of research, and that this might smooth out the initially vastly different marginal impacts. Now, in agreement with his earlier reference to broad-market efficiency, we do expect a smart and flexible allocation of resources toward research avenues with sharply diminishing marginal returns to even out the marginal value of investment in various field over time. I certainly think this happens.
But this only works insofar as the research market is efficient, and there are good reasons to believe that (a) new discoveries act as shocks, which can dramatically change (i.e. by many orders of magnitude) the marginal value of different fields and (b) the research market has inefficiencies resulting in large and long-lasting discrepancies in marginal value between fields. (Reasons for (b) include the fact that scientists and funding agencies often are beset by biases, are self-interested, are slow to adapt, and don’t think about the far future.) Therefore, it’s very likely that dramatic differences in marginal value of research fields currently exist and will persist into the future. In particular, I suspect insights about the potential importance of artificial intelligence have not been appreciated by most practicing academics and their funders, so I think it’s certainly possible (but by no means obvious) that the marginal research dollar in this field could be orders of magnitude more important than most others.
EDIT:
Eliezer Yudkowsky provides this pithy summary:
I think it’s kinda disingenuous to use the title ‘Charities Don’t Differ Astronomically’ to defend the thesis that some charities can be a billion times more effective than another, but are unlikely to be 10^30 times more effective than a positive-but-useless charity because the second charity, assuming it is positive, probably has a greater-than-1-in-10^30 accidental and unintentional effect on promoting the goals of the first charity, unless of course that effect is negative. ‘Charities Differ In Impact By A Factor of Billions, Not Decillions, Due To Bad Charities’ Unintentional Flow-Through Effects on Astronomical Stakes, Which May Be Negative’ would be a more accurate title here. More importantly that doesn’t refute any of the main policy points in the astronomical-stakes argument.
Thanks, Jess! I added a new section to the piece to answer your main question (http://utilitarian-essays.com/robustness-against-uncertainty.html#why-even-out). Readers should check that out before finishing this comment.
In response, Jess said:
“With regard to the argument: As you suspect, I do imagine that most of the flow-through effects come from one or two places. However, even if we put that aside I think an expectation of smoothing is still not stable under rescaling. In other words, if you make the claim that two charities should be within an order of magnitude or so of each other when all flow-through effects are accounted for–*without* putting any other conditions on the charities–then I can always modify the scenario by making the lesser charity 10 times further less effective by just having it burn 90% of its money (which is something that many charities effectively do). Without further qualifications your principle can’t distinguish between these two situations (that is, before and after money burning), and so it can’t apply to both of them.”
I replied:
“Sure, but I’m not convinced most charities do that. :P I buy that there can be some important differences in productivity between people (e.g., some academics produce papers with 10 times more citations than average or something), but it’s not clear how much marginal dollars can boost that. If the person is so good, why is she so underfunded for so long? Certainly amazing opportunities could exist, but I wouldn’t expect them to endure for a long time, and they should be relatively hard to find.”
Jess also said:
“If you are willing to grant that many charities may be 1-2 orders of magnitude less effective but not 3-4, well, I think you’ve essentially already given up. Two orders of magnitude means 1%, which rounds to 0%. The take home message of the order-of-magnitude story is the same: because the spread in effectiveness is so large, the only thing that really matters is finding the very best ones. People who give to the lower tier charities have negligible impact.”
I replied:
“It’s still a big shift from the story that many x-risk people tell about x-risk charities being 10^30 times better than others.”
> It’s still a big shift from the story that many x-risk people tell about x-risk charities being 10^30 times better than others.
Ahh, OK, I think I see what you’re saying here, but then it’s rather modest compared to the broad statements you made in the original essay. It definitely seems true that any claim about (say) an AI-research charity having 10^30 time the impact as a developing world charity must be based only on object-level estimated. In this case, the vast difference simply arises from weighing a few hundred lives saved in Africa compared to the 10^many scattered over the far future. Once you take into account the flow through effects, it’s highly improbable that the influence of a developing world charity on the distant future, even if very slight, wouldn’t grossly outweigh its effects on present day Africa and so narrow the 10^30 gap. To avoid this would require its effects to be zeroed toward neutral with microscopic accuracy. But I could still see a gap of, say, 10^6 if the goodness of the future is largely determined by a hard AI take-off scenario that the AI-research charity can influence.
All that is a very different mechanism than the dung beetle vs. AI smoothing which, as I said, is probably best understood through broad market efficiency.
I’m glad we’re closer to understanding each other. :)
The essay said charities are unlikely to differ by “many orders of magnitude,” by which I had in mind something like 4-6 orders. I suspect that most charities are within 1-2 orders of each other, though some special projects may be somewhat more than that.
So the claim is not that all charities are basically on equal footing, and indeed, some may even be negative. On the other hand, when you consider flow-through effects, it becomes a lot less obvious which charities are best. An intervention with broad-scale improvements to society’s wisdom could theoretically beat an object-level project. Of course, many EAs already recognize this in the realm of meta-charities and internal rates of return.