On Evaluating Arguments from Consensus

8 May 2014

I have often been asked how we should evaluate arguments from consensus. That’s where someone says “the consensus of experts is that P, therefore we should agree P is true.” On the one hand, this looks like an Argument from Authority, a recognized fallacy. On the other hand, we commonly think it should add weight to a conclusion that the relevant experts endorse it. Science itself is based on this assumption. As is religion, lest a religionist think they can defeat science by rejecting all appeals to authority–because such a tack would defeat all religion as well, even your own judgment, since if all appeals to authority are invalid, so is every appeal to yourself as an authority (on your religion, or even on your own life and experience).

And yet, it is often enough the case that a consensus of experts is wrong (as proved even by the fact that the scientific consensus has frequently changed, as has the consensus in any other domain of expertise, from history to motorboat repair). And our brains are cognitively biased to over-trust those we accept as authorities (the Asch effect), putting us at significant risk of false belief if we are not sufficiently critical of our relying on an expert. It’s only more complicated when we have warring experts and have to choose between them, even though we are not experts ourselves.

So what do we do?

The best treatments of the Argument from Authority as a fallacy are at Princeton, FallacyFiles, Wikipedia, and Nizkor (all of which have valuable insight worth reading up on) although that last (and many other treatments online) incorrectly state that the Argument from Authority is only a fallacy when the authority appealed to is not legitimate (e.g. “this sort of reasoning is fallacious only when the person is not a legitimate authority in a particular context”). That’s incorrect because the Argument from Authority is a non sequitur in deductive logic: even the most capable and relevant expert authority on the subject of P can be mistaken. Therefore it cannot deductively follow that P is true merely because they say P is true. Wikipedia gets this right.

But that just means authority can only have logical merit in an inductive argument, which in reality means a probabilistic argument: endorsement of P by certain authorities will normally increase the probability that P is true. The question is by how much, and when…and why. I already cover this question, and the underlying mathematical logic of it (including the role of Condorcet’s Jury Theorem) in Proving History (index, “consensus,” p. 334, and “Why History Requires Expertise,” pp. 17-20). The linked webpages above cover it further, adding for example criteria for discerning the weight of an expert opinion (and I discuss the philosophy and epistemology of relying on expert opinion in Sense and Goodness without God, “The Method of Expert Testimony,” II.3.6, pp. 58-59, which should be read in the context of “Finding the Good Method,” II.3.1, pp. 51-53).

Here I will add more observations. But for a fuller understanding, you should read my previous writings on the subject (cited above).

Whose Opinion Should We Count?

Laypeople need to be able to evaluate the argumentative and evidential value of “expert opinion,” and that includes a “consensus” of expert opinion. They need to be able to tell when that has value and when it does not, and how much value it has, without themselves being experts. Indeed, even if they are experts, they often need to make these evaluations without themselves having to re-do all the research and study that that consensus is based on, as otherwise we would be demanding an absurd scale of inefficiency in the expert group, by nixing the ability to divide their labor, and instead requiring every expert to reproduce all the work of every other expert, a patent impossibility.

But a consensus has zero argumentative value when the individual scholars comprising that consensus have neither (a) examined the strongest case against that consensus nor (b) examined enough of it to be able to identify and articulate significant errors of fact or logic in it. So it is fallacious (indeed, a conspicuously unreliable practice) to just cite the consensus on anything, without first ascertaining whose opinions within that consensus actually count. The most reliable population to heed is that which consists of all qualified experts (those who have requisite expertise in the subject being appealed to, e.g. climate science, evolutionary biology, economics, the historicity of Jesus) who have met either condition (a) or (b), and therefore exclude from consideration all such experts who meet neither condition.

Notably, when questioning the historicity of Jesus, this means excluding from consideration nearly all historians of Jesus. Because almost none have met either condition (a) or (b). And this is even apart from other reasons we should discount them, which I enumerate in chapters 1 and 5 of Proving History, where I show that historians of Jesus have all been generating their conclusions from demonstrably invalid methods, and worse, have accordingly generated countless contradictory conclusions from the same body of evidence. As I state there, unless differences are admitted to be a matter of opinion rather than fact (index, “disagreement,” p. 335), “When everyone picks up the same method, applies it to the same facts, and gets a different result, we can be certain that that method is invalid and should be abandoned” (p. 14). And yet this is exactly what we observe has happened in Jesus studies. Therefore the “expert consensus” on the historicity of Jesus cannot be appealed to, because it is useless. Unlike the consensus of historians on almost any other subject. (Although please heed my past remarks on this; as well as my discussion of what this means regarding the burden of evidence in Proving History, “Axiom 6,” pp. 29-30.)

The second cull comes from eliminating from the pool of experts to count, those who articulate their reasons for their conclusion and those reasons are self-evidently illogical (you can directly observe their conclusion is arrived at by a fallacious step of reasoning) or false (you can reliably confirm that a statement of fact they made is false). Cranks, of course, will “believe” they see fallacies and falsehoods in an expert argument, when really there are neither, but I can only give advice to the sane. If you are a crank, you are beyond rational argument. Hopefully most of my readers are not cranks, but have taken the trouble to avoid excess delusionality and become competent evaluators of facts and logic. Or if you haven’t done that yet, please do.

This is where laypeople in the historicity debate can start to get a handle on why they should no longer trust the consensus of experts in Jesus studies. You can thus see why, so far, Bart Ehrman’s opinion is to be discounted, likewise Maurice Casey’s, Akin & Horn’s, Crossan & MacDonald’s, even, astonishingly, that of Goodacre and Bermejo-Rubio. There is something else driving their opinions, something other than a careful and objective examination of the facts. In some cases, I think it’s just institutional error (they are repeating things other experts told them, that they did not know were false) or institutional inertia (it’s just easier to not think about challenging the past consensus), in others, something more (Ehrman I suspect is too arrogant to admit his mistakes and thus has fallen victim to the escalation of commitment bias; Casey I suspect is simply insane). Even Bermejo-Rubio, whose mistakes are all subtle errors of logic (because an expertise in logic is unfortunately lacking from the training of most historians), I think is ultimately really a victim of both institutional inertia and commitment bias.

Counting Experts Who Actually Checked

We needn’t expect experts to always check. Many challenges to authority are vacuous and would be a monumental waste of time to examine. Thus, for the very purpose of efficiency, experts apply reasonable criteria for judging which challenges are worth examining and which ignoring. I discuss important aspects of this in my critique of the delusional mythicist Joseph Atwill. In general, a challenge needs to be presented in an efficient and competent manner (with strong and thorough citations of evidence, and clear and sound logic). Any challenge that fails even that rudimentary standard can safely be ignored. Because if the challenge is valid, there should be no reason why it can’t be presented correctly that way. That it is not being presented that way is a huge red flag. It most often means it can’t be. Because it’s false.

Of course, a challenge presented in an efficient and competent manner can be dismissed very quickly: if an expert looks at it and immediately spots fatal errors (of logic or fact). As I did in the case of Atwill. That meets the (b) requirement noted above. And no further inefficient waste of time is necessary. Unless the challenge can be re-presented without those errors (and the errors are acknowledged, apologized for, and explained…because one needs to explain why they were trusting a conclusion on false facts and fallacious reasoning, indeed how they can be competent to make a challenge at all if they can’t even get basic facts right, because failing to address this, or even an outright refusal to, is an indicator of delusionality, and experts have no obligation to engage with the insane).

But that noted, the strongest consensus argument exists when condition (a) is fully satisfied by every member of the expert community whose opinion you are counting, and you are counting more than a handful, and every one of them subsequently agree, and no other expert who disagrees satisfies even condition (b). Here an “expert” must be someone with a Ph.D. or equivalent in a field significantly encompassing or overlapping the subject in question. And we only count experts (in the counted group or the other group) who do not make their case on a self-evident fallacy or falsehood. But meeting all those conditions is rare. Usually you get something a little short of that. For example, 95% agree, and 5% remain recalcitrant. Or some experts who haven’t met condition (a) but do meet condition (b) gainsay the opinion of the experts meeting condition (a). Or some of the experts in either group rely on a self-evident fallacy or falsehood but not enough to obviously invalidate their conclusion.

This can make evaluating a consensus difficult.

The “strongest consensus” condition should add a strong weight to a conclusion. Everything below that, less weight, by some degree. In reality, few experts actually examine consensus-challenging arguments; most simply ignore them, rendering their opinion on them of less value, unless you can confirm such a challenge already has little prima facie merit (e.g. it does not pass the minimum requirement stated above). And a strong enough consensus argument can still exist when more than a handful of experts satisfy condition (b), and they outnumber experts gainsaying them by a substantial amount. If less than a handful of experts do this, then we can still have a valid argument from authority (i.e. their expertise can still weigh in favor of their being right), it just isn’t an argument from consensus.

To achieve the minimum satisfaction of (b), a representative sample of the consensus body (I would have to say at least three experts whose opinions can be established as independent of each other and whose subjective biases are not strongly aligned) must examine the strongest case at least enough to have found significant (rather than trivial, minor, or non-essential) errors in it (of either fact or logic) if any there be, and prove those errors exist (by presenting the requisite evidence or analysis, and showing that they actually correctly understand the argument they are rebutting and representing it honestly and accurately), or state that they found none (in which case this consensus would be affirming the challenge is correct, and thus the consensus should change).

Of course, a challenger should have the opportunity to expose any errors or dishonesty in the consensus case against them, and other experts should have the opportunity to join the counted consensus group (by meeting condition (b) in the manner above) so as to replicate or challenge their findings. But this is what constructive dialogue in an expert community is for. Eventually, it will become clear to all non-delusional participants that the challenger can’t meet the objections, or the consensus must change. And this does not entail a black-or-white result. The change to the consensus may simply be to admit that the truth about P cannot be presently known, contrary to the previous consensus which assumed P was true. And this can be accomplished by a challenger arguing P is false.

When the number of experts in a field entering the counted group in this manner becomes very large (e.g. at least twenty and ideally a hundred or even a thousand or more) and their collective opinion is substantially consistent (e.g. at least 95% agree) we will again have a strong argument from consensus. But we still have a valid argument from consensus when there is, say, 67% agreement among 10 experts (who have met condition (b), and no one else has). It will just be a weak one. But enough to warrant some degree of agnosticism among non-experts regarding the disputed claim P.

But Still Only Inductive

And yet for all that, it is still possible for the consensus to be incorrect. Indeed, this is still possible even when condition (a) is fully satisfied by hundreds of experts, the most ideal consensus normally achievable. But what we have at that point is a consensus that is increasingly less likely to be wrong. The strongest consensus argument has the lowest probability of being incorrect, while the weakest consensus argument has the highest probability of being incorrect–without that probability being so high as to nullify its value. Weaker consensus arguments are simply invalid.

In Bayesian terms, if we have confirmed that the consensus has achieved at least the minimum requirements to be of any value (per the examples above), then what we would say the probability was of that consensus being wrong in its conclusion that P is true would be the prior probability that P is false (the converse of which would then be the prior probability that P is true). Thus to overcome such a consensus, evidence must be presented that is so much more improbable on any other theory than such a consensus being wrong, that the prior probability against its being wrong is overcome (and overcome by enough to say it is more probable that the consensus is wrong).

This is one reason why anyone challenging a consensus bears the burden of evidence. But note that this again only applies to a soundly established consensus, per the procedures outlined above. Sometimes a consensus of experts is not soundly established, as in Proving History I have shown is the case in Jesus studies. And merely polling experts does not generate a valid consensus in any case (except against specious challenges, i.e. those, as explained above, that don’t even merit an expert’s attention), because all experts won’t have met even condition (b) for any challenge to the consensus, and so their opinion on the matter cannot be reliable even though they are an expert.

But this applies quite broadly. For example, an expert can meet condition (b) even when a challenge is not examined by that expert, but depends on a fact that the expert independently confirms is false, sometimes even without knowing they are addressing a challenge. For example, a valid argument from consensus holds when a layperson checks a standard expert reference book in a field and finds that something a crank is saying is false–even though the expert opinion in that reference wasn’t addressing their challenge or even aware of it. Because an expert confirming a false premise in a challenge meets condition (b), and a standard reference will have been vetted by more than a handful of experts. As another example, often someone will approach me and ask about some crank theory or other, which I haven’t heard of before, and I’ll ask them what facts these cranks hang their case on. If those facts are demonstrably false, and I can show they are false, I don’t need to examine the case further. I have already met condition (b). (Unless the person who asked me about it isn’t correctly reporting what the challenger argued, but then they’ve failed to meet the minimum requirement of warranting an expert’s time and attention.)

Overall, arguments from consensus, if valid at all, only increase the probability of an examined claim P. They don’t guarantee P is true. But a strong argument from consensus can greatly increase the probability of P. For an example of a way for laypersons to evaluate an academic dispute regarding what the consensus should be, see Galatians 1:19, Ancient Grammar, and How to Evaluate Expert Testimony.

Rebutting a Consensus Response to a Challenge

Often a challenger doesn’t accept the consensus rejection of their challenge. Often the challenger is delusional. But let’s suppose we know the challenger’s work well enough to doubt that. How can a layperson evaluate the matter when we have some appreciable expert consensus meeting condition (b) in rejecting a challenge, but the challenger calls foul?

Here we return to checking the experts rejecting P even after meeting condition (b), to see if their rejection is based on logically sound argument: no relevant fallacies, no relevant falsehoods. A non-delusional challenger who cries foul will have identified relevant fallacies or falsehoods in the expert consensus rejection of P. If they don’t, even given the opportunity, then you should probably start changing your opinion about that challenger’s delusionality.

Warranting reasons to conclude a consensus rejection of P is invalid include identifying an obvious fallacy in the expert’s reasoning or the repeated assertion of false facts. The latter is especially damning: if the experts comprising a cited consensus keep citing fact X as a reason for their opinion, and fact X is demonstrably false, then that consensus is worthless. I wrote about a classic example of this in Christian apologetics, where a consensus that there was an empty tomb was based on the belief that women were not trusted as witnesses in antiquity (a factually false claim, as well as a fallacy, since the women aren’t cited as witnesses in any early Christian source: see Habermas and the Devious Trick). Another example I point out in Hitler Homer Bible Christ (n. 9 p. 342): many scholars cited as the consensus in favor of the Testimonium Flavianum in Josephus base their opinion on the claim that an Arabic fragment derives from an earlier text than was employed by Eusebius, not knowing that that has been proved to be false (it derives from Eusebius).

The opinions of such scholars are to be discarded. No matter how expert they may otherwise be, they cannot be counted toward a valid expert consensus on that matter.

Generally, when it is proved (with honest and accurate representations of their arguments) that their opinion (their conclusion) is invalid or unsound (i.e. based on fallacious logic or factually false claims, or both), then a consensus of experts has failed to satisfy the conditions required for a valid argument from consensus. Such a consensus is to be rejected.

Concluding Points

Of course, showing an argument from consensus has no value does not establish the consensus is wrong. It only establishes that the existence of that consensus itself has no value for determining whether that consensus is true. The Argument from Fallacy is still a fallacy: showing that an argument from consensus is fallacious (because that consensus has no argumentative value) does not entail the challenge to that consensus is correct. It only eliminates one argument against that challenge. The challenger still bears the burden of showing that the challenge is also true.

A question that does arise, however, is what to conclude when a consensus of experts does not change even though it persists in arriving at its conclusion invalidly or unsoundly even after being shown that it is doing so. An expert community that behaves this way is discredited. It’s opinions then cease to hold any evidential or argumentative value. This is why fundamentalist experts cannot be counted in any argument from consensus. That requires showing that fundamentalists do indeed persist in sticking by false or fallacious reasoning. But once you’ve done that, such experts should simply be bracketed out of consideration as pseudo-scholars, and only the remaining body of experts considered relevant when citing consensus.

Of course, a discredited body of experts will continue to deny that they have been discredited. Thus the burden will always remain on the outside observer to decide which is the case. Have those experts been discredited, or is the claim that they have been discredited baseless? This can be difficult, but is not beyond the ability of a non-expert, since the only expertise required is that of being able to evaluate the logical validity of either side’s arguments.

It is less common that both sides will continue to claim the facts are different from what the other side claims, but even when that happens, it reduces again to a problem in simple logic: examine on what basis one side claims the facts are X and on what basis the other side claims the facts are ~X. At some point you will be able to identify one side or the other is arriving at that claim through invalid logic–or else you will be able to personally verify one side or the other is incorrect (e.g. if a weatherman says it is raining outside and you can directly observe yourself that it is not). Thus actual expertise is not needed to vet the relative reliability of experts. Except expertise in reasoning, which everyone should endeavor to have.

Finally, it is important to note the logical significance of a divided consensus. It is almost never the case that an expert population agrees 100% on every issue (every single member of that expert community agreeing). Yet if there is disagreement, this calls into question the validity of that expert community’s claims to expertise (since if their methods and standards, by which they qualify as experts in the first place, are so unreliable that they cannot generate consistent results, then it can be questioned whether their expertise has any value in the matter at all). How can two experts, using the same methods on the same facts, get different results? There are several causal hypotheses with enough frequency in practice to be plausible enough to test:

The disagreement is admitted by all sides to be unresolvable on present evidence (e.g. all experts agree that their disagreement is a matter of opinion that cannot be conclusively resolved on present evidence, and they are more stating what they feel to be most probable given their limited data). In this case, the disagreement is insignificant to the function of an argument from consensus, as long as such an argument is being used to establish the general point and not any disputed particulars.
The disagreement is on minor nuances and not substantive matters (e.g. all experts might agree with a more general statement of the matter and only disagree on small details that are not conclusively provable on present evidence). In this case, the disagreement is insignificant to the function of an argument from consensus, as long as such an argument is being used to establish the general point and not any disputed particulars. (This is not the case if the disputed particulars are not minor, but in fact shouldn’t be in dispute if these experts are using valid methods. Then our situation is one of either of the following.)
The disagreement is caused by failures to meet condition (a) or (b), as discussed above. In which case experts are diminishing the value of their authority by affirming opinions as proved that in fact they have not responsibly vetted (i.e. they have not satisfied conditions (a) or (b), which they ought to know invalidates the strength of their opinion in the matter). Such experts should be advised not to do that, and to responsibly vet their own opinions first. Their opinion cannot be cited as part of the consensus until they do that. Counting such opinions is the most common error in making an argument from consensus. It is too often simply assumed that every qualified expert will know all about P, because P falls under subject S and they are experts in subject S. That is almost never true. Unless you can demonstrate meeting at least condition (b), in at least the broadest sense (as illustrated above), then being an expert on S does not in fact make you an expert on P. All you can affirm as an expert in that case is that you’ve never heard of any reason to believe P is true (or false), and probably would have if it were. But that is an extremely weak argument, and should always be acknowledged as such.
The disagreement is caused by subjective biases on one side or the other. In which case, it will be possible to identify which side is violating expert objectivity (by seeing which side most often errs on key facts or logic), and then bracket their opinions out of the pool of experts, no longer to be counted as relevant. Often that means the only valid consensus that remains is that of the other side. This is where we are now in the dispute between scientists and creationists.

That last rule is perhaps the most useful.

Whenever you see two bodies of experts disagreeing with each other, and you are not an expert in the same subject, first identify which of the four categories that disagreement falls under:

[1] noncommittal disagreement
[2] trivial disagreement
[3] uninformed disagreement
[4] biased disagreement

The first two are unimportant and you needn’t trouble yourself over it (experts who admit their disagreement is a matter of opinion, and experts who disagree over things you concede are trivial, are not disagreeing in any manner that poses a problem for the layperson). The second two are important, but of those, the first allows you to determine which side to trust by simply looking at which side has bothered to check the claim they are talking about (i.e. have met at least condition (b) with respect to any challenge to their opinion). If one of them hasn’t, but is just arguing from the armchair, while the other side has examined the best case against them, and appears to have answered it without self-evident fallacy or falsehood, then you know which side is most likely right (depending on how many experts are in their camp: a conclusion must be regarded as tentative until the number of experts conceding it is large).

And you can do all that without having to be an expert yourself.

Which leaves the last scenario: Where both sides appear to have at least tried to meet condition (b), are disagreeing about something that isn’t trivial, and are insisting it’s not an arguable matter of opinion. What you do then is try to test the credibility of both sides. Locate the genuine experts on either side (ignore amateurs taking up their banner; you only want to vet the qualified experts here) and check their references and diagram their logic, until you start finding mistakes. You must necessarily find some, because only one thing can be true, so if two people disagree whether some claim P is true and are sure they are right, one of them must have made a mistake in their reasoning somewhere: either relying on a premise that is false, or on an argument that is fallacious.

Generally, eventually, you will find one side to be disproportionately more dishonest about the facts (citing bogus sources, or misrepresenting what those sources say or demonstrate, or not even citing sources for their claims at all, or any evidence you can independently verify) or illogical in its reasoning (and basic competency in detecting fallacies is all you need here, a competency everyone should have, or certainly labor to develop if they don’t).

Then you will know which side’s opinion you can safely discount.

-:-

Appendix: I’ll be collecting good examples here of challenging an expert consensus, which illustrate the required components for this to have merit: there has to be at least some experts who have successfully made the same criticisms under peer review, and the rest has to be factually correct and logically valid, particularly a demonstration by non-fallacious evidence-based reasoning that the consensus is ill-founded.

Recent examples:

Past examples:

To explore, conversely, problems with peer review as a process see The Korean “Comfort Women” Dust-Up and the Function of Peer Review in History and pertinent articles linked there. And to explore crucial fact-checking principles related to evaluating an expert consensus see A Primer on Actually Doing Your Own Research and A Vital Primer on Media Literacy.

36 Comments

Jim Reed on May 8, 2014 at 5:41 am

I guess this means we should reject the opinions of religious experts because the fact that they are religious experts means they have less understanding of religion and God than a skeptic.
Reply
- Richard Carrier on May 9, 2014 at 9:12 am
  
  Can you elaborate on your reasoning?
  Reply
aggressivePerfector on May 8, 2014 at 8:27 am

Nice work. Perhaps worth emphasizing beyond what you already said is the importance of independence. If expert B’s opinion is a replica of expert A’s opinion, then B also replicates A’s mistakes, and A and B do not constitute a robust consensus. Mathematically, we can still analyze the evidence in the case of non-independence (using e.g. auto-regression or spectral analysis) – there is no territory into which Bayes’ theorem cannot step – but the evidence is weakened considerably by non-independence.

Of particular use in this arena would be a development of simple procedures for assessing degrees of independence, and their likely impacts.

This is not to claim any shortcomings in your analysis, but to point out a possible challenging project for somebody in the future.

*****

Here’s a Thursday super bonus riddle:

I read a book on logic (true story) in which the reader is invited to consider the following,

As Descartes said, appeals to authority are never really reliable.

The author of the book later identified this as self refuting, but he was wrong. Perhaps your readers can see why.
Reply
- Richard Carrier on May 9, 2014 at 9:17 am
  
  Very good point. I do discuss the importance of independence in my discussion of Condorcet’s jury theorem in Proving History. But it’s also worth pointing out as you do that even dependence can be modeled mathematically (it generally just weakens support for a conclusion; often to zero, as when one expert simply repeats what another says uncritically).
  
  (On the riddle. Nice. Clue: the statement was not written simply “appeals to authority are never really reliable.” Although having heard the latter trope from Christian apologists too many times, I also have to caution people against falling for that as well, even though this has nothing to do with your riddle, but what it makes fun of: the covert smuggling in of the word “never” should never be allowed to slide. A statement like “appeals to authority are often unreliable” does not allow the claim of self-contradiction.)
  Reply
- aggressivePerfector on May 9, 2014 at 12:20 pm
  
  In follow up to my own comment, I remembered that there has already been some fascinating work on the topic of analyzing interdependence of researchers. A great paper, How citation distortions create unfounded authority used graph theoretical analysis (essentially the same algorithm Google uses to rank web pages) to examine the citation network associated with a particular hypothesis, that a protein called amyloid accumulates in the brains of subjects with Alzheimer’s disease. The consensus on this hypothesis had long been firmly affirmative. Nearly all papers discussing the topic, however, were found to derive their conclusion from only 4 review papers, all from the same group. The reviews concerned only 4 experimental studies, which supported the hypothesis, and ignored 6 studies that did not support it.
  
  Obviously, not all forms of interdependence leave such a convenient paper trail, and the dependency graph, once known still has to be converted to a probability assignment somehow, but I’d say this inspirational paper makes a fine start.
  Reply
- Will on May 10, 2014 at 2:03 am
  
  So here is my amateurish take on the riddle – “appeals to authority are never really reliable.”
  
  As Richard suggests, “never” is too absolute for an inductive matter. It seems to assume as certain that a.) a universal consensus of authorities (in the relevant field) cannot be established since a disagreement among authorities would in itself prove that they were not all correct. Or that b.) such a consensus, if established, cannot be correct in their agreed position.
  
  I guess something hinges on the interpretation of “really reliable” too. A Bayesian would have to concede some level of uncertainty in any empirical matter (hence the problem with “never“). The term “reliable” itself would necessitate a probabilistic judgement. To what degree does our verifiable experience justify this assertion (i.e. about the truth of a matter coinciding with a universal consensus of experts)? Even if 100% of our past experiences showed that a consensus of authorities was correct in every known case, our epistemic uncertainty (as justified by our background knowledge of human fallibility) would prevent us from actually allowing a full 100% certainty (since we could always be wrong about some of those cases without knowing it). However, if we understand “really reliable” more in terms of a prescriptive methodological point, then it kinda makes sense that we should never ASSUME that a consensus of authorities will ALWAYS be correct in their agreed position. If we did make that assumption we would be using the mirror image fallacy of that expressed in the riddle…both fail to account for an inevitable degree, no matter how small, of epistemic uncertainty. So a universal generalization about the reliability of a consensus of experts has to always be below 100% (no matter which side of the spectrum of priors is suggested by one’s background knowledge of such consensuses). So if reliability is treated as synonymous with “correctness”, then obviously our epistemic uncertainty will make the riddle false. But if we understand “really reliable” to mean “completely trustworthy”, then the riddle seems coherent as a prescriptive methodological assertion rather than a descriptive point about an established fact.
  
  Hopefully that made some kind of sense… lol
  Reply
  - Richard Carrier on May 12, 2014 at 10:51 am
    
    Note that what you are talking about is a mathematical problem already solved by Condorcet. It’s called Condorcet’s Jury Theorem. I discuss how it works and applies to this question in Proving History (see index).
- Will on May 12, 2014 at 1:49 pm
  
  ah yes thanks.. just read the PH section. much more clear.
  Reply
Michael Chase Walker on May 8, 2014 at 9:25 am

Great article and timely!
Reply
- Richard Carrier on May 9, 2014 at 9:18 am
  
  Sorry, I’ve been in a cave for a while. Timely how?
  Reply
peterbollwerk on May 8, 2014 at 2:11 pm

Thanks very much for this. It is an excellent resource for critical thinking advocates like myself. My wife is a high school English teacher who is always looking for resources to help her kids improve their persuasive essay writing skills. I will be sure to point her in this direction (and to the critical thinking section of your website).

Unfortunately for our society, evaluating evidence and sources are not easy, even for folks like us who try really hard to be objective. It is exhausting sometimes to have discussions with people who do not have good critical thinking skills, and I’m by no means an expert at it. I don’t know how you have the patience and endurance to do what you do. But I admire what you do very much. I hope you make progress in persuading other historians to use these methods.
Reply
Susannah on May 8, 2014 at 3:11 pm

… I can only give advice to the sane.

I had to laugh, but this is dead on! We would save ourselves so much time and effort, in so many fields, if we only remembered to follow this principle.
Reply
Randall Johnson on May 8, 2014 at 8:18 pm

I separate objective from subjective first. A consensus on climate change carries a lot of weight; a consensus on Jesus carries very little.

Just how many “experts” on the Jesus story are there who have not spent a lifetime drunk on the Kool-Aid? And what becomes of their perception of the value their lives if it all goes up in smoke and mirrors? Can you imagine a clerical scholar 65 years of age suddenly saying, “Oops?” A whole lifetime wasted?

Even Bart Ehrman, who became agnostic through studying scripture, flinched when it came to taking the next logical step. The tortured logic he employed to try to prove there was at least some substance behind a prior lifetime of belief tells me all I need to know about “consensus authority” when applied to faith.
Reply
pofarmer on May 10, 2014 at 3:47 am

Dr. Carrier, I have sort of an unrelated question, if you don’t mind. I heard it said last night that the High Priests in Jerusalem were sending out Saul/Paul to arrest and perscute Christians. Historically, is this something that the Romans would have typically allowed or tolerated? Thanks.
Reply
- Richard Carrier on May 12, 2014 at 11:11 am
  
  Yes. Possibly.
  
  By imperial decree the Jews were allowed their own laws (in fact even Romans had to obey some of them in Judea; otherwise, only Jews were subject to them). This was because they allied themselves with the winning side in the civil war (supporting Julius Caesar and then Augustus, 50s to 30s BC). That was no longer the case after the Jewish War (66-70 AD). I discuss this in some detail, with references, in my chapter on burial law in The Empty Tomb.
  
  There is some dispute whether this agreement was already being altered before the war. For example, the Talmud and G.John claim the Jews had lost the right to execute without imperial permission 40 years before the war, but that number, in the Talmud, is suspiciously theological, and the evidence of Jewish trials and executions in the 30s and 40s is more than extensive enough to disprove such a legend, although they may have gotten the date wrong (or Roman permission may have been so easy to get it never even got mentioned). Josephus makes no mention of this development, which is a strike against it, although he may have been inclined to conceal this. But his story of Ananus executing James refers only to his assembling the court without imperial permission, and it’s unclear which step in that process was a violation, e.g. Josephus may mean that Ananus had not yet received the endorsement of the Roman authorities to be high priest or chair of the Sanhedrin, and in any event the implication is that a court assembled with imperial permission would be, even then, authorized in issuing death sentences.
  
  So, in answer to your question, Roman law allowed Jewish authorities to arrest, try, and execute blasphemers (for example), but Roman citizens would have been exempt, and also citizens of other non-Jewish polities, and probably any Gentiles, period. For instance, a Gentile could appeal to any court (maybe even a Jewish court) with the defense that they are not even a Jew. More certainly, citizens of Damascus may have had the power to appeal to the Damascene authorities and Damascene law to exempt themselves from Jewish arrest warrants, although (a) Jewish inhabitants, like many others, in Damascus, did not necessarily have Damascene citizenship (you did not have it merely by living there, unlike in the US), and (b) any Jew who used that tactic would likely be shunned as betraying Jewish law and could no longer associate with fellow Jews, unless their fellows agreed the warrant was unjust, etc., so it would be a complex political question, and not a cut-and-dried matter of law. The Romans, meanwhile, wouldn’t care, as long as Roman citizens weren’t being arrested, and the Jewish court didn’t overstep its bounds.
  Reply
abcxyz on May 10, 2014 at 10:15 am

I think there’s another important issue with regard to historical Jesus studies that needs to be addressed, but never is.

The fact is that New Testament scholars are not historians. They don’t have PhD’s in history. They don’t even have undergraduate degrees in history. A New Testament PhD is a hybrid literature/theology degree. It is not a history degree. New Testament PhD’s are granted by theological seminaries and religious studies departments, not history departments. New Testament scholars are theologians and literature experts, not historians.
Reply
- Richard Carrier on May 12, 2014 at 10:44 am
  
  That’s a good point. I’ve noted it myself before. Although NT Studies does include some training as a historian, and some who complete their studies in that field are good historians, so it’s not a cut-and-dried issue. But sometimes their ignorance of how historians work is astonishing.
  
  The irony is that NT Studies guys try to claim historians are unqualified to discuss the historicity of Jesus. No, I am not kidding.
  Reply
Tom Higgs on May 10, 2014 at 11:43 am

Hello Dr. Carrier,

A few questions:

1. How are you able to keep track of all that you have written and read? That is, when you said what and where?

2. How do you take notes about a book and then transfer that to written form in presenting your arguments? How long does it take you to transcribe all your marginalia to a formatted essay etc?

3. What are some software that you use to assist in your workflow:
Examples I am thinking of – Mind Mapping , Argument Mapping, Academic Suite like Note Bene, Evernote etc

4. Do you utilize any memory systems (Roman room, Journey method etc)

5. What goes into the preparation of a post of yours and how long is that process?

6. Do you have a criteria for whether to purchase a book or borrow from a library?

Thanks
Reply
- Richard Carrier on May 12, 2014 at 10:38 am
  
  1. I often don’t. For the rest, my brain, and a searchable computer file system.
  
  2. Too many different ways to list, too many different variables to state a rule.
  
  3. I find all of those kinds of things more time-consuming that not using them (such software is so labor intensive to maintain and organize it’s generally not worth it; I use my own brain, it’s vastly faster, and comes pre-programmed). But I do use EndNote as a catalog to my personal home library (since I forget where my books all are, I have so many of them).
  
  4. No.
  
  5. Varies too much to answer usefully.
  
  6. Basically, if it’s affordable and I am certain I shall need to consult it often or read the whole thing and annotate it all through, then I buy it. Otherwise, I borrow. I borrow hundreds of times more than I buy (maybe thousands, depending on how you count).
  Reply
- Geoff on May 24, 2014 at 7:29 pm
  
  Mendeley is a free reference manager similar to Endnote. I use it and find it useful
  Reply
Tom Higgs on May 10, 2014 at 11:52 am

Further,

Apologies if this question is not specific to the topic (something I have often thought about also) of the post. I thought you may not have sufficient time to answer via email and others may wonder about these very same things about you and so could also prove beneficial to others.

I wonder if there is a way to start quantifying the consensus of scholars on any given subject in a database.

Take the Historicity of Jesus for example:

There is in a single location the research and statement of each scholar.

“I think Jesus is historical because of such and such evidence. This evidence I(the scholar) can be found in xyz work(s)…”

The interested layperson could then examine that evidence for themselves.
Reply
- Richard Carrier on May 12, 2014 at 10:33 am
  
  That’s certainly possible, and would be very valuable.
  
  But no one has the money to invest in making that happen. It would be very expensive to coordinate and motivate.
  
  The industry itself could take on that task–the way PhilPapers Surveys has sort of begun to do for philosophy…although all they are polling are opinions, not reasons, and that badly, e.g. they present polled philosophers with too many false dichotomies and they don’t define any terms. And if philosophy can’t even get that shit together right, biblical studies isn’t likely to.
  Reply
Mobius on May 10, 2014 at 12:14 pm

How many times have I heard “But so many historians think Jesus was real” as an argument that Jesus was real.

While personally, I think it likely that there was a wandering religious teacher (or perhaps several) that provided a tenuous basis on the Jesus myths, I do think you have some very thought provoking arguments. I find it certain that if such a person existed, he was mythologized (with plagiarized myths no less) in a way similar to the mythologized stories we hear of Daniel Boone…who killed a bear with his bare hands.
Reply
Denish on May 13, 2014 at 3:18 pm

I just watched the YouTube video of the debate you had with Zeba Crook. Even though I am a Christian I think your case was extremely interesting. Especially the Philo – Paul comparison. I truly believe that early Christians did make use of some pre-existing categories for expressing their theological thinking about Jesus. So I couldn’t think of it as in anyway threatening for a historicist position. I think Crook didn’t clearly understood your argument from the fact that Paul excluded James from the rank of Apostles. As I had done some recent research on this subject for a book I am writing on the Church Polity, I found this as a very interesting spin from your part. The minimalist approach Zeba Crook adopted was good enough to show that an evolution from [Historical Person] to [Theologized Second Person of Trinity] is far more probable than an evolution from [Myth originated with visions] to [Historicization] to [Theologized Second Person of Trinity] in the absence of any conclusive evidence to prove that Mark’s motivation or reason behind writing his gospel was just to set a ministry model for missionaries. We need some strong evidence to show that the Authors of Gospels and their early readers considered gospels to be just “extended parables”. Showing that some modern scholars could interpret them as parables without taking into consideration all the details is not sufficient to overcome the unbelievability of the extremely complex model that you are proposing when we compare that with the much simpler explanation of the same facts that you are appealing to as found in the case constructed by Crook. Assuming this is the case that you will be making in your up coming book on this subject I think it is safe to predict that major objection you will get from Conservative Christian side will be the “unjustified presuppositions that you are bringing to the text”. Anyway I am looking forward to the release of your book.
Reply
- Richard Carrier on May 14, 2014 at 2:25 pm
  
  Unfortunately I’m about to embark on a long period of travel, so I won’t get to blog about that video for awhile. I’ll do so as soon as I can find time (amidst all the other dozens of things that need blogging). I plan to write up a commentary on it.
  
  Yes, Crook didn’t answer the arguments that took him most by surprise (they wouldn’t have if he’d chosen a different order of presenting, or if it hadn’t just been finals so he would have had time to read my book before the debate, instead of administering and grading tests and papers, the latter limitation quite beyond his control, and the former had pluses and minuses no matter which way he went).
  
  As to whether his model makes more sense than mine, I quite disagree. The evidence goes the other way, as I showed in the debate. Crook was mistaken in claiming it went from Mark’s earthly Jesus to John’s pre-existent Jesus. Paul’s Jesus is already pre-existent and even more supernatural than John’s. And Paul predates Mark by decades. Meanwhile, Mark is demonstrably fiction (I even showed that with Crook’s own examples).
  
  But certainly, to be sure, one needs to weigh and compare a lot more evidence and models than we had time to cover. Such is the case with any debate (something Christians often don’t understand in their love of debates; there were some Christians in the audience, but I think they were as annoyed by Crook’s destruction of their faith as mine, so that was win-win for both of us).
  
  And as you note, we need to see lots of examples and analysis of the structure and content of the Gospels (esp. Mark) to see how they are creating their fictions and why. That I do accomplish in chapter 10 of my book.
  Reply
Paul Thomas on May 14, 2014 at 12:45 am

Hi Richard

Why do you think Mark has the demons (or unclean spirits) recognise Jesus even though no one else knows who he is? I ask this in context of the Ascension of Isaiah story where Jesus descends through the several layers of heaven in disguise and only reveals himself when in lower heavens, leading to surprise from the demons inhabiting that domain. Is this a Markan nod to the Ascension story (even though in Mark he doesn’t seem to reveal himself and is in disguise)?

I’m fascinated about the relationship between these two sources, if indeed your mythicist theory is correct.

Thanks
Reply
- Richard Carrier on May 14, 2014 at 12:25 pm
  
  It would be a reversal of the Ascension story (where the demons don’t know who he is). But I can see what you have in mind, placing the earthlings in the position of the demons (as one would expect someone to do who terrestrialized the original Ascension narrative) could be such a nod. Although the parallel breaks down when Jesus doesn’t reveal himself and shame and subjugate his executors (as he does in the original Ascension). One would also have to suppose making the demons then aware is a literary device. Which it certainly is. But how does it fit the Ascension model?
  
  There may be closer parallels with the Enoch literature than the Ascension. And that may have been a factor in any case.
  
  Still, I think a better fit is MacDonald’s explanation, which I summarize in OHJ:
  
  Mark’s strange theme of the ‘messianic secret’ (which Jesus always insists upon, even though almost no one ever complies) makes no sense as history, or really even as theology or apologetic, yet makes perfect sense as reflecting the theme of Odysseus in disguise among the suitors in his palace who were maliciously courting his wife. Like Jesus, Odysseus endeavors, even when occasionally recognized, to maintain that disguise until he can get his revenge on those suitors (the sinners who would usurp his place to slake their greed) who have inhabited his house–analogous to the priests and Pharisees inhabiting the temple (God’s house), who are likewise corrupt sinners mired in hypocrisy and greed, and likewise courting the same woman: the church.
  Reply
Pierce R. Butler on May 14, 2014 at 8:17 am

“When everyone picks up the same method, applies it to the same facts, and gets a different result, we can be certain that that method is invalid and should be abandoned” (p. 14). And yet this is exactly what we observe has happened in Jesus studies.

Yet one generation ago, and certainly two or more ago, anyone using this measure would have reached the opposite conclusion.

I propose a ~~rule~~ analogy from nutrition: We need a little bit of salt at all times, and an abundance of it when the task at hand makes us hot ‘n’ sweaty.
Reply
- Richard Carrier on May 14, 2014 at 12:17 pm
  
  Sorry, I don’t get your point.
  Reply
- Pierce R. Butler on May 14, 2014 at 4:36 pm
  
  Merely that “well-established facts” can wobble and fail at times.
  
  Examples range from the reputation of Columbus to scientific claims such as Mercury orbiting in tidal lock with the sun, dinosaurs having sub-brains in their lower spines, Martian canals, and other phlogistons of your choice.
  
  By “hot ‘n’ sweaty” I meant to allude to high degrees of passion in certain controversies, and the proportionate need for applied skepticism.
  Reply
Giuseppe on May 19, 2014 at 2:24 am

Hi Richard,
Reading at Infidels.org your review of The Homeric Epics and the Gospel of Mark, by Dennis R. MacDonald, I find this point:

My own hypothesis is that Mark ended the Gospel thus in order to set up a pretext for why little of his particular story had been heard in the Christian community until he wrote it down.

1) Then Mark is selling deliberately his story has Recorded History?

2) your hypothesis is still valid and possible if I assume that the young man at tomb exhorts the woman to command a by now degraded Peter (a Judeo-Christian icon and then rival for the pauline Mark) to go in Galilea of Gentiles because the Risen Jesus is allegory of new Temple and/or Novus Israel (the true church) now present not more in Judaea but into the Diaspora?

If so, then MacDonald is partially correct when he argues wrongly that the exhortation of young man to escape in Galilea is a kind of apologetical ”defensive military tactic” ex evento post-70, because it may be another clue that for Mark the Risen Christ is allegory of Israel that survives to the destruction of Jerusalem in 70.

Thanks for the replies,
Giuseppe
Reply
- Richard Carrier on May 20, 2014 at 9:00 am
  
  I’ve evolved in my thinking now that I understand what Mark is doing literarily. I wasn’t convinced then that Mark could possibly be doing something else. But for my evolved view and it’s basis, you’ll have to await my new book On the Historicity of Jesus. Expected June or July.
  Reply
Giuseppe on May 21, 2014 at 12:56 am

Excuse my impatience. I don’t see the moment to take it!
Reply
ESTA ANN AMMERMAN on June 10, 2014 at 8:38 am

I am looking forward to your live discussion about this article on Inspiring Doubt with Greg Brahe, Wednesday Evening at 7 pm EDT or EST.

I am especially interested in hearing about point 4 in your conclusion in this article: [4] biased disagreement.

Since we all are subject to confirmation bias, I find it difficult to even reason about objectivity giving consideration to your explanation on how to best handle it in the conclusion. 🙂 I’ll be watching. Thank you.

https://www.facebook.com/events/1448687315371416/?ref_newsfeed_story_type=regular
Reply
brianpansky on June 26, 2014 at 11:04 am

So, I’m curious about the relevant people to consensus about the historicity of Jesus. You mention Bart Ehrman, Maurice Casey, Akin & Horn, Crossan & MacDonald, Goodacre and Bermejo-Rubio.

Is there anyone else whose review of the matter will hold weight in a consensus?
Reply
- Richard Carrier on June 26, 2014 at 11:21 am
  
  Hundreds of people.
  Reply