TORTINI

For your delectation and delight, desultory dicta on the law of delicts.

Epistemic Nihilism and Ancient Documents

December 28th, 2016

This year, the Judicial Conference Advisory Committee on Evidence Rules proposed abrogation of the current “ancient documents” exception to the rule against hearsay, Rule 803(16). The proposal would, if adopted, become effective on December 1, 2017. See James A. King & Kirsten Fraser, “Say Goodbye to the ‘Ancient Documents’ Rule,A.B.A. Trial Evidence (Feb. 17, 2016).

Under the Federal Rules of Evidence, an old document, one over 20 years old, found in a place where one would expect to find it is treated as “authentic.” The finder of fact, judge or jury, may accept the document for what it purports to be simply because of its age and the manner of its discovery. Rule 901(b)(8). The Federal Rules, however, go further and permit the document, authenticated as a so-called “ancient document,” to be accepted for the truth of the statements it contains. Rule 803(16).

The rationale offered by the Deans of Evidence Law for this remarkable exception to the rule against hearsay is that old documents predate the legal controversies in which they might later be used in evidence, and that we might not have any other admissible evidence relevant to events in the distant past (greater than twenty years). “[A]ge affords assurance that the writing antedates the present controversy” wrote the Federal Rules of Evidence Advisory Committee. Fed. R. Evid. 803(16) advisory committee note.

Pithy and pathetic. The proffered rationale was not valid when Rule 803(16) was initially drafted or promulgated, and it is not valid today. Old does not suggest or equate with reliability, and the present controversy is not the only source of bias and error in past statements that happened to be put in writing.

First, contrary to the conjecture of the common law, many documents were and are created with a view to influence potential controversies decades later. Mergers, acquisitions, and divestitures of companies are often governed by documents that explicitly acknowledge controversies that could stretch into the distant future.

Second, sometimes the exact motivation to falsify, fabricate, or fudge is not exactly the same as later will exist in later litigation, but it is similar. So when a young patient misrepresents his smoking history to a physician, he may simply be trying to avoid a disapproving lecture from his healthcare provider. Years later when he has developed lung cancer, and he is trying to blame anything but smoking in a lawsuit, he will rely upon the distorted report of his tobacco consumption. Of course, in many other situations, the motive to create misleading documents will arise from the expectation of the possibility or probability of future litigation over intellectual or real property rights, insurance contracts, etc.

Professor Daniel J. Capra of Fordham Law School, for many years the Reporter to the Judicial Conference Advisory Committee on Evidence Rules, has not been shy about the slim to none justification for Rule 803(16). In a podcast interview with Professor Ed Cheng, Capra laid out the case against Rule 803(16), and its evanescent rationale. See Excited Utterance website, Daniel Capra, “Electronically Stored Information and the Ancient Documents Exception” (Aug. 22, 2016). Preliminary Draft of Proposed Amendments to the Federal Rules of Bankruptcy Procedure and the Federal Rules of Evidence (Aug. 2015). Capra had previously deconstructed Rule 803(16) in a law review article, with the usual scholarly apparatus of footnotes and review of historical sources. Daniel J. Capra, “Electronically Stored Information and the Ancient Documents Exception to the Hearsay Rule Exception to the Hearsay Rule: Fix It Before People Find Out About It,” 17 Yale J.L. & Tech. 1 (2015) [cited as Capra].

As Capra puts it, the justification for Rule 803(16) was “never very convincing in the first place,” Capra at 1, 5, and “a radical and irrational hearsay exception – an error of the common law,” Capra at 11. The equating of a document’s authenticity with the trustworthiness of assertions contained within the document is “curious.” Capra at 9. Curiously, Capra balks at complete abrogation.

The law professor may have well said that the rule is capricious. Still, on gossamer grounds, Capra argues to retain the rule for hardcopy old evidence, but to abandon it for electronically stored information. Capra seems to buckle under the prospect of abrogating a rule, which leads him to “split the baby.” As trenchant as Capra’s critique is, most of his defense of retaining the Rule for hardcopy documents is as unsupported as the original Rule 803(16). Capra, for instance, suggests that “the likelihood of finding a hardcopy document that is twenty years old and also relevant to an existing litigation is quite small.” Capra at 5. Capra offers no empirical evidence for this startling assertion. Similarly, Capra claims that Rule 803(16) is rarely invoked, but he cites only to the paucity of cases that discuss this exception. Capra at 12. Capra thus claims negative evidence for the infrequent use, but the failure of judges to discuss this rule in published decisions may well be the result mechanical simplicity of the rule, which rarely leads to post-trial motions and appeals.

The original rational for Rule 803(16), and a rallying cry for its retention, is the supposed necessity to have some evidence to make out the events of times past. The objectors, however, generally fail to make out their case that the residual hearsay rule (Rule 807), as well as the business record and other exceptions, do not accomplish the twin goals of providing some evidence of past events while maintaining some semblance of reliability in fact finding.

The Rule Committee’s proposal has been met with an organized campaign from the lawsuit industry, both through the self-aggrandizingly named American Association for Justice (AAJ) and many of its members.. A rough count suggests that about 82 out of 218 comments came from asbestos plaintiffs’ lawyers. See, e.g., Comment from Robert Jacobs, American Association of Justice; Comment from Larry Tawwater, American Association for Justice (AAJ); Testimony of Marc P. Weingarten; Comment from Robert Paul, NA; Comment from Mark Gallagher, NAComment from Joseph Rice, NA.

The lawsuit industry lawyers argue that they will be deprived of the ability to show scienter or knowledge of a risk in latent disease litigation in which disease outcomes are lagged several decades after first exposure. Their argument, however, misses the point that many documents in company files, while not admissible for their truth, will be evidence of “notice” of a potential hazard, and the documents would be admissible for “state of mind,” and not the truth, in any event. Given the changes in epistemic standards for establishing causation, it is unlikely that really ancient documents will move the fact finder any closer to the truth of the actual fact of asserted causation. SeeTime to Retire Ancient Documents As Hearsay Exception” (Aug. 23, 2015); “Ancient Truths” (May 5, 2016).

The asbestos plaintiffs’ lawyers thus argue that they would be deprived of important evidence such as the “Sumner Simpson” documents in asbestos cases. See Threadgill v. Armstrong World Indus., Inc., 928 F.2d 1366 (3d Cir. 1991). The plaintiffs’ argument rests upon an epistemic mistake. If knowledge is true, justified belief, then they do not need Rule 803(16) to show that the Sumner Simpson documents contain true statements; it is enough that they show that the authors believed what they said. In other words, the plaintiffs need only show that the documents reflect the declarants’ state of mind. Whether the statements are justified as true will require a complex mixture of current evidence about what health effects can be shown to be caused by the exposures of old, and what justifications were valid in today’s knowledge that were available and embraced by the declarants in the ancient documents.

Insurance coverage plaintiffs’ counsel have argued that they need Rule 803(16) to meet their burden of proof, and that insureds rely on both internal and external records. A policyholder’s internal records might include financial statements, annual reports, meeting minutes, check registers, contracts referencing insurance, insurance policies referencing other insurance policies, and/or accounting ledgers. Comment from Sherilyn Pastor, NA.

Most of the special pleading of these interest groups is wide of the mark. Old statements may be relevant and admissible for the speaker’s or reader’s statement of mind, and thus not hearsay. Old reliable documents can still be admitted under the residual exception, Rule 807, or under the business records exception, 803(6). Statements made in the making of contracts are operative facts of offer and acceptance, “speech acts,” and not offered for their truth.

The fact that a document is old may perhaps add to its authenticity, but in many technical, scientific, and medical contexts, the “ancient” provenance actually makes the content unlikely to be true. As such, the rule as now in effect is capable of much mischief and undermines accurate fact finding. The pace of change of technical and scientific opinion and understanding is too fast to indulge this exception that permits out-dated, false statements of doubtful validity to confuse the finder of fact. With respect to statements or claims to scientific knowledge, the Federal Rules of Evidence have evolved towards a system of evidence-based opinion, and away from naked opinion based upon the apparent authority or prestige of the speaker. Similarly, the age of the speaker or of the document provides no warrant for the truth of the document’s content. Of course, the statements in authenticated ancient documents remain relevant to the declarant’s state of mind, and nothing in the proposed amendment would affect this use of the document. As for the contested truth of the document’s content, there will usually be better, more recent, and sounder scientific evidence to support the ancient document’s statements if those statements are indeed correct. In the unlikely instance that more recent, more exacting evidence is unavailable, and the trustworthiness of the ancient document’s statements can be otherwise established, then the statements would probably be admissible pursuant to other exceptions to the rule against hearsay, as noted by the Committee. The proposed abrogation of this exception to the rule against hearsay should be welcomed; it is long overdue. And if it Capra is correct that ancient hardcopy rarely exists, and that the ancient document rule is rarely invoked, then abrogation cannot have the effect of defeating expectations and reliance upon this dubious mode of proof.

Of course, witnesses who are the declarants in the ancient documents may have died or moved away, but that is precisely why the law generally has statutes of limitations. When the law has generously created discovery rules, it should not then promulgate unreasonable, unreliable rules of evidence simply because it has extended the life on what otherwise would be stale claims.

Fake Science News

December 17th, 2016

Fakers to the left; fakers to the right. Everyone has his or her knickers in a knot over fake news these days.  But who will speak out against fake science news?

Oberle Communications LLC[1] puts out a “Product Safety Letter,” with almost daily emails that link to published articles of interest to lawyers and others who are concerned with product safety. According to its self-description, Product Safety Daily is a “fair-use news-link service,” and its owner claims not to be responsible for the accuracy, or truthfulness, of linked articles.

Sounds like Facebook; no?

The Guardian is British newspaper, with affiliates in the United States and elsewhere, owned by the Guardian Media Group, which in turn is owned by The Scott Trust Limited. The Scott Trust declares that it exists to produce The Guardian, and “to safeguard the journalistic freedom and liberal values of The Guardian free from commercial or political interference.” Lofty goals, those are. Oberle Communications might feel secure in pointing to an article in The Guardian, on product safety. As far as newspapers are concerned, The Guardian enjoys a good reputation, and has won awards for its investigative journalism, most recently on unlawful government surveillance.

Recently, the Product Safety Letter linked to an article by an Assistant Editor of The Guardian on supposed health effects of plastics. Chukwuma Muanya, “How Plastics Cause Autism, Diabetes, Cancer, Birth Defects,” The Guardian (London, England) (Dec. 13, 2016).

The Mr. Muanya’s headline shouts about causation, but there is nothing in the text of the article to support, even remotely, anyone’s conclusion about causality. The text of the article states, without support, that “[r]ecent studies have associated the rise in autism, diabetes, cancer and birth defects to increase in the use of plastics in making everyday containers, toys and baby teethers or pacifiers.” One would think, hope, pray that The Guardian would know the difference between association and causation, but there is no evidence in this article to support an imputation of knowledge or understanding.

A photograph of baby bottles contains a caption that ramps up the Guardian’s rhetoric and propaganda:

KILLER PLASTICS… The invisible chemical cause neurological and behavioral disorders like autism and attention deficit and hyperactivity disorder (ADHD). They also affect IQ. And they manipulate hormones in a way that can cause cancer, diabetes, male infertility, and endometriosis. PHOTO CREDIT: http://www.viewzone.com/plastic-header.jpg

Wow.  These plastics are bad-ass actors. They manipulate; they cause; they kill.

From causation, to association, the author moves to the most abused journalese term in science reportage: “link”:

“the presence of toxins commonly found in plastic that have been linked to increased risk of cancer, heart disease and obesity.”

Mr. Muanya references, without link, citation, or mention of authors, a study by the American Chemical Society, which apparently reported that infants’ pacifiers contained Bisphenol A (BPA), Bisphenol S (BPS) or Bisephenol F (BPF), and that many also contained parabens, and antimicrobials such as triclosan and triclocarban. But the Society’s paper was about chemical content, not about health consequences. Without any reference or citation to published or unpublished studies, Mr. Muanya labels BPA, BPS, and BPF as “so-called endocrine-disrupting chemicals,” and tells us these chemicals “manipulate hormones in a way that can cause cancer, diabetes, male infertility, and endometriosis,” and that these chemicals “cause neurological and behavioral disorders like autism and ADHD,” and that they “also affect IQ.”

Apropos of nothing having to do with endocrine disrupters, or human disease, Mr. Muanya inserts a discussion of a Japanese study, reportedly published at the PLoS (but without providing link or citation) about how older male mice have offspring that exhibited “hyperlocomotion.” Still, Mr. Muanya, who has been billed as “Head Insight Team, Science & Technology,” does show that journalists can provide, when it suits them, actual references at least to authors by name. Nothing in the mouse study, however, explains the hypocognition exhibited by the Guardian’s science editor, or the shoddy journalistic practices.

So here we have a respected newspaper publishing a news story that at best is internally inconsistent and un-sourced, and which grossly misinterprets or overinterprets the available scientific evidence. Behold fake science news.


[1] 4915 St Elmo Ave, #204, Bethesda, MD 20814; Phone: (301) 215-9236.

Statistical Analysis Requires an Expert Witness with Statistical Expertise

November 13th, 2016

Christina K. Connearne sued her employer, Main Line Hospitals, for age discrimination. Main Line charged Connearne with fabricating medical records, but Connearne replied that the charge was merely a pretext. Connearney v. Main Line Hospitals, Inc., Civ. Action No. 15-02730, 2016 WL 6569292 (E.D. Pa. Nov. 4, 2016) [cited as Connearney]. Connearne’s legal counsel engaged Christopher Wright, an expert witness on “human resources,” for a variety of opinions, most of which were not relevant to the action. Alas, for Ms. Connearne, the few relevant opinions proffered by Wright were unreliable. On a Rule 702 motion, Judge Pappert excluded Wright from testifying at trial.

Although not a statistician, Wright sought to offer his statistical analysis in support of the age discrimination claim. Connearney at *4. According to Judge Pappert’s opinion, Wright had taken just two classes in statistics, but perhaps His Honor meant two courses. (Wright Dep., at 10:3–4.) If the latter, then Wright had more statistical training than most physicians who are often permitted to give bogus statistical opinions in health effects litigation. In 2015, the Medical College Admission Test apparently started to include some very basic questions on statistical concepts. Some medical schools now require an undergraduate course in statistics. See Harvard Medical School Requirements for Admission (2016). Most medical schools, however, still do not require statistical training for their entering students. See Veritas Prep, “How to Select Undergraduate Premed Coursework” (Dec. 5, 2011); “Georgetown College Course Requirements for Medical School” (2016).

Regardless of formal training, or lack thereof, Christopher Wright demonstrated a profound ignorance of, and disregard for, statistical concepts. (Wright Dep., at 10:15–12:10; 28:6–14.) Wright was shown to be the wrong expert witness for the job by his inability to define statistical significance. When asked what he understood to be a “statistically significant sample,” Wright gave a meaningless, incoherent answer:

I think it depends on the environment that you’re analyzing. If you look at things like political polls, you and I wouldn’t necessarily say that serving [sic] 1 percent of a population is a statistically significant sample, yet it is the methodology that’s used in the political polls. In the HR field, you tend to not limit yourself to statistical sampling because you then would miss outliers. So, most HR statistical work tends to be let’s look at the entire population of whatever it is we’re looking at and go from there.”

Connearney at *5 (Wright Dep., at 10:15–11:7). When questioned again, more specifically on the meaning of statistical significance, Wright demonstrated his complete ignorance of the subject:

Q: And do you recall the testimony it’s generally around 85 to 90 employees at any given time, the ER [emergency room]?

A: I don’t recall that specific number, no.

Q: And four employees out of 85 or 90 is about what, 5 or 6 percent?

A: I’m agreeing with your math, yes.

Q: Is that a statistically significant sample?

A: In the HR [human resources] field it sure is, yes.

Q: Based on what?

A: Well, if one employee had been hit, physically struck, by their boss, that’s less than 5 percent. That’s statistically significant.”

Connearney at *5 n.5 (Wright Dep., at 28:6–14)

In support of his opinion about “disparate treatment,” Wright’s report contained nothing than a naked comparison of two raw percentages and a causal conclusion, without any statistical analysis. Even for this simplistic comparison of rates, Wright failed to explain how he obtained the percentages in a way that permitted the parties and the trial court to understand his computation and his comparisons. Without a statistical analysis, the trial court concluded that Wright had failed to show that the disparity in termination rates among younger and older employees was not likely consistent with random chance. See also Moultrie v. Martin, 690 F. 2d 1078 (4th Cir. 1982) (rejecting writ of habeas corpus when petitioner failed to support claim of grand jury race discrimination with anything other than the numbers of white and black grand jurors).

Although Wright gave the wrong definition of statistical significance, the trial court relied upon judges of the Third Circuit who also did not get the definition quite right. The trial court cited a 2010 case in the Circuit, which conflated substantive and statistical significance and then gave a questionable definition of statistical significance:

The Supreme Court has not provided any definitive guidance about when statistical evidence is sufficiently substantial, but a leading treatise notes that ‘[t]he most widely used means of showing that an observed disparity in outcomes is sufficiently substantial to satisfy the plaintiff’s burden of proving adverse impact is to show that the disparity is sufficiently large that it is highly unlikely to have occurred at random.’ This is typically done by the use of tests of statistical significance, which determine the probability of the observed disparity obtaining by chance.”

See Connearney at *6 & n.7, citing and quoting from Stagi v. National RR Passenger Corp., 391 Fed. Appx. 133, 137 (3d Cir. 2010) (emphasis added) (internal citation omitted). Ultimately, however, this was all harmless error on the way to the right result.

Benhaim v. St. Germain – Supreme Court of Canada Wrestles With Probability

November 11th, 2016

On November 10, 2016, the Supreme Court of Canada handed down a divided (four-to-three decision) in a medical malpractice case, which involved statistical evidence, or rather probabilistic inference. Benhaim v. St-Germain, 2016 SCC 48 (Nov. 10, 2016).  The case involved an appeal from a Quebec trial court, and the Quebec Court of Appeal, and some issues peculiar to Canadian lawyers. For one thing, Canadian law does not appear to follow lost-chance doctrine outlined in the American Law Institute’s Restatement. The consequence seems to be that negligent omissions in the professional liability context are assessed for their causal effect by the Canadian “balance of probabilities” standard.

The facts were reasonably clear, although their interpretation were disputed. In November 2005, Mr. Émond was 44 years old, a lifelong non-smoker, and in good health. At his annual physical with general practitioner Dr. Albert Benhaim, Émond had a chest X-ray (CXR). Benhaim at 11, 6. Remarkably, neither the majority nor the dissent commented upon the lack of reasonable medical necessity for a CXR in a healthy, non-smoking 40-something male. Few insurers in the United States would have paid for such a procedure. Maybe Canadian healthcare is more expansive than what we see in the United States.

The radiologist reviewing Mr. Émond’s CXR reported a 1.5 to 2.0 cm solitary lesion, and suggested a review with previous CXRs and a recommendation for a CT scan of the thorax. Dr. Benhaim did not follow the radiologist’s suggestions, but Mr. Émond did have a repeat CXR two months later, on January 17, 2006, which was interpreted as unchanged. A recommendation for a follow-up third CXR in four months was not acted upon. Benhaim at 11, 7. The trial court found that the defendant physicians deviated from the professional standard of care, a finding from which there was no appeal.

Mr. Émond did have a follow-up CXR at the end of 2006, on December 4, 2006, which showed that the solitary lung nodule had grown. Follow up CT and PET scans confirmed that Mr. Émond had Stage IV lung cancer. Id.

The issues in controversy turned on the staging of Mr. Émond’s lung cancer at the time of his first CXR, in November 2005, the medical consequences of the delay in diagnosis. Plaintiffs presented expert witness opinion testimony that Mr. Émond’s lung cancer was only Stage I (or at most IIA), at initial radiographic discovery of a nodule, and that he was at Stage III or IV in December 2006, when CT and PET scans confirmed the actual diagnosis of lung cancer. In the view of plaintiff’s expert witnesses, the delay in diagnosis, and the accompanying growth of the tumor and change from Stage I to IV, dramatically decreased Émond’s chance of survival. Id. At 13, 15-16. Indeed, plaintiff’s expert witnesses opined that had Mr. Émond been timely diagnosed and treated in November 2005, he probably would have been cured.

The defense expert witness, Dr. Ferraro, testified that Mr. Émond’s lung cancer was Stage III or IV in November 2005, when the radiographic nodule was first seen, and his chances of survival at that time were already quite poor. According to Dr. Ferraro, earlier intervention and treatment would probably not have been successful in curing Mr. Émond, and the delay in diagnosis was not a cause of his death.

The trial court rejected plaintiffs’ expert witnesses’ opinions on factual grounds. These witnesses had argued that Mr. Émond’s lung cancer was at Stage I in November 2005 because the lung nodule was less than 3 cm., and because Mr. Émond was asymptomatic and in good health. These three points of contention were clearly unreliable because they were all present in January 2007, when Mr. Émond was diagnosed with Stage IV cancer, according to all the expert witnesses. Every point cited by plaintiffs’ expert witnesses in support of their staging failed to discriminate Stage I from Stage III. In Her Honor’s opinion, the lung cancer was probably Stage III in November 2005, and this staging implied a poor prognosis on all the expert witnesses’ opinions. The failure to diagnose until late 2006 was thus not, on the “balance of probabilities” a cause of death. Id. At 15, ¶21.

The intermediate appellate court reversed on grounds of a presumption of causation, which comes into being when the defendant’s negligence interferes with plaintiff’s ability to show causation, and there is some independent evidence of causation to support the case. I will leave this presumption, which the Supreme Court of Canada held inappropriate on the facts of this case, to Canadian lawyers to debate. What was more interesting was the independent evidence adduced by plaintiffs. This evidence consisted of statistical evidence in the form of generality that 78 percent of fortuitously discovered lung cancers are at Stage I, which in turn is associated with a cure rate of 70 percent. Id. at 18 30.

The plaintiffs’ witnesses hoped to apply this generality to this case, notwithstanding that Émond’s nodule was close to 2 cm. on CXR, that the general statistic was based up more sensitive CT studies, and that Émond had been a non-smoker (which may have influenced tumor growth and staging). Furthermore, there was an additional, ominous finding in Mr. Émond’s first CXR, of hilar prominence, which supported the defense’s differentiation of his case from the generality of fortuitously discovered (presumably small, solitary lung nodules without hilar involvement). Id. at 44 83.

The trial court rejected the inference from the group statistic of 70% survival to the conclusion that Mr. Émond had a 70% probability of survival. Tellingly, there was no discussion of the variance for the 70% figure; nor any mention of relevant subgroups. The Court of Appeals, however, would have turned this statistic into a binding presumption by virtue of accepting the 78 percent as providing strong evidencec that the 70% survival figure pertained to Mr. Émond. The intermediate appellate court would then have taken the group survival rate as providing a more likely than not conclusion about Mr. Émond, while rejecting the defense expert witness’s statistics as mere speculation. Id. at 36 ¶67.

Adopting a skeptical stance with respect to probabilistic evidence, the Supreme Court reversed the Quebec Court of Appeal’s reversal of the trial court’s judgment. The Court cited Richard Wright and Jonathan Cohen’s criticisms of probabilistic evidence (and Cohen’s Gatecrasher’s Paradox), and urged caution in applying class or group statistics to generate probabilities that class members share the group characteristic.

Appellate courts should generally not interfere with a trial judge’s decision not to draw an inference from a general statistic to a particular case. Statistics themselves are silent about whether the particular parties before the court would have conformed to the trend or been an exception from it. Without an evidentiary bridge to the specific circumstances of the plaintiff, statistical evidence is of little assistance. For this reason, such general trends are not determinative in particular cases. What inferences follow from such evidence — whether the generalization that a statistic represents is instantiated in the particular case — is a matter for the trier of fact. This determination must be made with reference to the whole of the evidence.”

Benhaim at 39, 74, 75 (internal citations omitted).

To some extent, the Supreme Court’s comments about statistical evidence were rather wide of there mark. The 78% statistic was based upon a high level of generality, namely all cases, without regard for the size of the radiographically discovered lesion, the manner of discovery (CXR versus CT), presence or absence of hilar pathology, or group or individual’s smoking status. In the context of the facts of the case, however, the trial court clearly had a factual basis for resisting the application of the group statistic (78% fortuitously discovered tumors were Stage I with 70% five-year survival).

The Canadian Supreme Court seems to have navigated these probabilistic waters fairly adeptly, although the majority opinion contains broad brush generalities and inaccuracies, which will, no doubt, show up in future lower court cases. For instance:

This is because the law requires proof of causation only on a balance of probabilities, whereas scientific or medical experts often require a higher degree of certainty before drawing conclusions on causation (p. 330). Simply put, scientific causation and factual causation for legal purposes are two different things.”

Benhaim at 24, 47. The Court cited legal precedent for its observation, and not any scientific treatises. And then, the Supreme Court suggested that all one needs to prevail in a tort case in Canada is a medical expert witness who speculates:

Trial judges are empowered to make legal determinations even where medical experts are not able to express an opinion with certainty.

Benhaim at 37, 72Clearly dictum on the facts of Benhaim, but it seems that judges in Canada are like those in the United States. Black robes empower them to do what mere scientists could not do. If we were to ignore the holding of Benhaim, we might think that all one needs in Canada is a medical expert who speculates.

Talc Litigation – Stop the Madness

November 10th, 2016

Back in September, Judge Johnson, of New Jersey, wrapped up a talc ovarian cancer case in Kemp, and politely excused the case from any further obligations to show up in court. Carl v. Johnson & Johnson, No. ATL-L-6546-14, 2016 WL 4580145 (N.J. Super. Ct. Law Div., Atl. Cty., Sept. 2, 2016) [cited as Carl]. See “New Jersey Kemps Ovarian Cancer – Talc Cases” (Sept. 16, 2016).

In Giannecchini v. Johnson & Johnson, a Missouri jury returned a substantial verdict for plaintiff. The jury, by a 9 to 3 vote, awarded $575,000 for claimed economic loss, and $2 million for non-economic compensatory damages. The jury also found defendant Johnson & Johnson in need of punishment to the tune of $65,000,000, and Imerys Talc America Inc. for $2.5 million. Plaintiffs, having sought $285 million, were no doubt disappointed. The Giannecchini verdict was the third large verdict in the Missouri talc litigation. See Myron Levin, “Johnson & Johnson Hammered Again in Talc-Ovarian Cancer Verdict of $70 Million,” (Oct. 27, 2016); Brandon Lowrey, “J & J, Talc Co. Hit With $70M Baby Powder Cancer Verdict,” Law360 (Oct. 2016).

In his closing argument, Giannecchini’s lawyer, R. Allen Smith, reportedly accused Johnson & Johnson of having “rigged” regulatory agencies to ignore the dangers of talc, and of having “falsified” medical records to hide the problem. Smith implored the jury to “make them stop”; make them “stop this madness.”

Make them stop the madness, indeed. The November 2016 issue of Epidemiology features a publication of the “Sister Study,” which explored whether there was any association between perineal talc use and ovarian cancer. The authors acknowledged, as had Judge Johnson in the Carl case, that some prior case-control studies had found an increased risk of ovarian cancer, but that prospective cohort studies have not confirmed an association. Nicole L. Gonzalez, Katie M. O’Brien, Aimee A. D’Aloisio, Dale P. Sandler, and Clarice R. Weinberg, “Douching, Talc Use, and Risk of Ovarian Cancer,” 27 Epidemiology 797 (2016).

The Sister Study (2003–2009) followed a cohort of 50,884 women whose sisters had been diagnosed with breast cancer. Talc use was ascertained at baseline, before diagnosis of subsequent disease and before any chance for selective recall. The cohort was followed for a median of 6.6 years, in which time there were 154 cases of ovarian cancer during the follow up, available for analysis using Cox’s proportional hazards model. Perineal talc use at baseline was not associated with later ovarian cancer. The authors reported a hazard ratio of 0.73, less than expected, with a 95% confidence interval of 0.44, 1.2.

So, yes, make them stop this madness; close the gate.

Jersey Devil and the Occult (Asbestos)

November 7th, 2016

Amosite Mine in Collingswood, New Jersey

Amosite is a commercial term for the fibrous mineral grunerite (Fe7Si8O22(OH)2). Grunerite and its fibrous form, amosite, does not occur naturally in the sedimentary terrain of southern New Jersey.

And yet, my son and I encountered a treasure trove of amosite in the attic of a house on East Franklin Avenue, in Collingswood, New Jersey, not far from the Cooper River. We were within 24 hours of buying a house to renovate, when an astute building inspector called our attention to unusual attic insulation.1 Most houses built right after World War I have no insulation, or perhaps cork insulation between the attic joists. This house had loose, greyish-brown tufted fibrous material several inches thick spread across the entire attic floor. A local analytical laboratory confirmed that the loose mineral fiber was largely amosite.2

Asbestos in Attic1

The where, when, how, and why there were many cubic yards of amosite in the attic of a house in Collingswood is a mystery, but one clue is that there are several shipyards nearby, including what once was the New York Shipbuilding & Dry Dock, the Philadelphia Naval Shipyard, and Cramps Shipyard. My unverified hunch is that after World War II, these shipyards gave away, or sold, loose amosite asbestos, which was no longer needed, and the use of which was discouraged by Naval regulations in favor of molded, pre-formed asbestos-containing insulation. The presence of such loose, fibrous amphibole asbestos in residential construction raises serious issues for exposure assessment in epidemiologic studies and in litigation cases, both of which proceed on the glib assumption that workplace exposures are the only meaningful asbestos exposures to be measured. Occult exposure to amosite or even worse, crocidolite, may well explain any number of isolated “black swan” cases of mesothelioma among workers with limited exposure to chrysotile. The amosite in the house’s attic was truly occult – hidden and scary – to the sellers of the house and to us as potential buyers.3 The contract for the sale of the house fell apart over the sellers’ and buyers’ inability to agree upon what the appropriate remediation would be.

One of the serious disservices performed by Dr. Irving Selikoff was his conflation of the various asbestos mineral types as equally dangerous.4 His motivation was quite transparent. He and his staff were working closely with plaintiffs’ counsel, and other plaintiffs’ expert witnesses, in asbestos personal injury and property damages lawsuits. Although the amphibole asbestos minerals were known to be much more dangerous than chrysotile (white asbestos), the mining and distributing companies were mostly South African, and judgment proof in United States courts. The lawsuit industry required propagating the myth of equal risk in order to keep the chrysotile mining and milling companies from avoiding liability by drawing scientific comparisons between and among the different fiber types.film Despicable Me 3


1 Inspections Plus, LLC, Clementon, New Jersey 08021.

EMSL Analytical, Inc., 200 Route 130 North, Cinnaminson, NJ 08077.

See alsoAsbestos isn’t just in old fibro sheeting it can be in everything from fences to carpets,” The Courier Mail (No. 28, 2015) (“Bulk loose fill insulation is now rarely found but may be encountered unexpectedly, e.g. DIY lost insulation and fire-stop packing around cables between floors.”).

SeeHide the Substantial Factors in Asbestos Litigation”; “Selikoff and the Mystery of the Disappearing Amphiboles.”

Omalu and Science — A Bad Weld

October 22nd, 2016

Bennet Omalu is a star of the silver screen and in the minds of conspiratorial thinkers everywhere. Actually Will Smith[1] stood in for Omalu in the movie Concussion (2015), but Smith’s skills as an actor bring out the imaginary best in Omalu’s persona.

Chronic Traumatic Encephalopathy (CTE) is the name that Bennet Omalu, a pathologist, gave to the traumatic brain injuries resulting from repeated concussions experienced by football players.[2]  The concept is not particularly new; the condition of dementia pugilistica had been described previously in boxers. What was new with Omalu was his fervid imagination and his conspiratorial view of the world.[3] The movie Concussion  actually gives an intimation of some of the problems in Omalu’s scientific work.  See, e.g., Daniel Engber, “Concussion Lies: The film about the NFL’s apparent CTE epidemic feeds the pervasive national myths about head trauma,” Slate (Dec. 21 2015); Bob Hohler, “BU rescinds award to ‘Concussion’ trailblazer,” Boston Globe (June 16, 2016).

Omalu has more dubious claims to fame. He has not cabined his unique, stylized approach to science to the subject of head trauma. Although Omalu is a pathologist, not a clinician, Omalu recently he weighed in with observations that Hillary Clinton was definitely unwell. Indeed, Bennet Omalu has now made a public nuisance of himself by floating conspiratorial theories that Hilary Clinton has been poisoned. Cindy Boren, “The man who discovered CTE thinks Hillary Clinton may have been poisoned,” Wash. Post (Sept. 12, 2016); Christine Rushton, “‘Concussion’ doctor suggests without evidence that poison a factor in Clinton’s illness,” Los Angeles Times. (Sept. 13, 2016).

In the courtroom, in civil cases, Omalu has a poor track record for scientific rigor. The United States Court of Appeals, for the Third Circuit, which can be tough and skeptical of Rule 702 expert witness exclusions, readily affirmed an exclusion of Omalu’s testimony in Pritchard v. Dow Agro Sciences, 705 F. Supp. 2d 471 (W.D. Pa. 2010), aff’d, 430 F. App’x 102, 104 (3d Cir. 2011). In Pritchard, Omalu was caught misrepresenting the statistical data from published studies in a so-called toxic tort case. Fortunately, robust gatekeeping was able to detoxify the proffered testimony.[4]

More recently, Omalu was at it again in a case in which a welder claimed that exposure to welding and solvent fumes caused him to develop Parkinson’s disease. Brian v. Association of Independent Oil Distributors, No. 2011-3413, Westmoreland Cty. Ct. Common Pleas, Order of July 18, 2016. [cited here as Order].

James G. Brian developed Parkinson disease (PD), after 30 years of claimed exposure to welding and solvent fumes. It is America, so Brian sued Lincoln Electric and various chemical companies on his theory that his PD was caused by his welding and solvent exposures, either alone or together. Now although manganese in very high exposures can cause a distinctive movement disorder, manganism, manganese in welding fume does not cause PD in humans.[5] Omalu was undeterred, however, and proceeded by conjecturing that welding fume interacted with solvent fumes to cause Brian’s PD.

At the outset of the case, Brian intended to present testimony of expert witnesses, Bennet Omalu, Richard A. Parent, a toxicologist, and Jordan Loyal Holtzman, a pharmacologist.  Parent commenced giving a deposition, but became so uncomfortable with his own opinion that he put up a white flag at the deposition, and withdrew from the case.  On sober reflection, Holtzman also withdrew from the case.

Omalu was left alone, to make the case on general and specific causation. Defendant Lincoln Electric and others moved to exclude Omalu, under Pennsylvania’s standard for admissibility of expert witness opinion testimony, which is based upon a patch-work version of Frye v. United States, 293 F. 1013 (D. C. Cir. 1923).

Invoking a quirky differential diagnosis, and an idiosyncratic reading of Sir Austin Bradford Hill’s work, Omalu defended his general and specific causation opinions. After briefing and a viva voce hearing, President Judge Richard E. McCormick ruled that Omalu had misapplied both methodologies in reaching his singular opinion. Order at 8.

Omalu did not make the matter easy for Judge McCormick. There was no question that Brian had PD.  Every clinician who had examined him made the diagnosis. Knowing that PD is generally regarded as idiopathic, with no known cause, Omalu thought up a new diagnosis: chronic toxic encephalopathy.

When confronted with the other clincians’ diagnoses, Omalu did not dispute the diagnosis of PD. Instead, he attempted to evade the logical implications of the diagnosis of idiopathic PD by continually trying to change the terminology to suit his goals. Judge McCormick saw through Omalu’s semantic evasions, which bolstered the case for excluding him at trial.

Madness to His Method

In scrutinizing Omalu’s opinions, Judge McCormick found more madness than method. Omalu claimed that he randomly selected studies to rely upon, and he failed to explain the strengths and weaknesses of the cited studies when he formed his opinion.

Despite his claim to have randomly selected studies, Omalu remarkably managed to ignore epidemiologic studies that were contrary to his causal conclusions. Order at 9.  Indeed, Omalu missed more than half the published studies on welding and PD.  Not surprisingly, Omalu did not record his literature search; nor could explain, in deposition or at the court hearing, his inclusionary or exclusionary criteria for pertinent studies. Id. at 10. When confronted about his “interaction” opinions concerning welding and solvent fumes, Omalu cited several studies, none of which measured or assessed combined exposures.  Some of the papers flatly contradicted Omalu’s naked assertions. Id. at 9.

Judge McCormick rejected Omalu’s distorted invocation of the Bradford Hill factors to support a causal association when no association had yet been found. The court quoted from the explanation provided by Prof. James A. Mortimer, the defense neuroepidemiologist, at the Frye hearing:

“First, the Bradford Hill criteria should not be applied until you have ruled out a chance association, which [Omalu] did not do. In fact, as I will point out, carefully done epidemiologic studies will show there is no increased risk of Parkinson’s disease with exposure to welding fume and/or solvents, therefore the application of these criteria is inappropriate.”

Order at 11, citing to and quoting from Frye Hearing at 318 (Oct. 14, 2015).

When cornered, Omalu asserted that he never claimed that Mr. Brian’s PD was caused by welding or solvents; rather his contention was simply that occupational exposures had created a “substantial increased risk” of PD. Id. at 14. Risk creation, however, is not causation; and Omalu had not even shown unquantified evidence of increased risk before Brian developed PD. The court found that Omalu had not used any appropriate methodology with respect to general causation. Id. at 14.

Specific Causation

Undaunted, Omalu further compromised his credibility by claiming that Bradford Hill’s factors allowed him to establish specific causation, even in the absence of general causation. Id. at 12. Omalu suggested that he had performed a differential diagnosis, even though he is not a clinician, and as a pathologist had not evaluated any brain tissue. Id. at 10. The court deftly saw through these ruses. Id. at 11.

Judge McCormick’s conclusion should be a precautionary lesson to future courts that must gatekeep Omalu’s opinions, or Omalu-like opinions:

“In conclusion, we agree with the Defendants that while Dr. Omalu’s stated methodology in this case is generally accepted in the medical and scientific community, Dr. Omalu failed to properly apply it. He misused and demonstrated a lack of understanding of the Bradford Hill criteria and the Schaumburg criteria when he attempted to employ these methodologies to conduct a differential diagnosis or differential etiology analysis.”

Id. at 16. Gatekeeping is sometimes viewed as more difficult in Frye jurisdictions, but the exclusion of Omalu shows that it can be achieved when expert witnesses deviate materially from scientifically standard methodology.


[1] For other performances by Will Smith in this vein, see Six Degrees of Separation (1993); Focus (2015).

[2] See Bennet I. Omalu, Steven DeKosky, Ryan Minster, M. Ilyas Kamboh, Ronald Hamilton, Cyril H. Wecht, “Chronic Traumatic Encephalopathy in a National Football League Player, Part I,” 57 Neurosurgery 128 (2005); Bennet I. Omalu, Steven DeKosky, Ronald Hamilton, Ryan Minster, M. Ilyas Kamboh, Abdulrezak Shakir, and Cyril H. Wecht, “Chronic Traumatic Encephalopathy in a National Football League Player, Part II,” 59 Neurosurgery 1086 (2006).

[3] See Jeanne Marie Laskas, “The Doctor the NFL Tried to Silence,” Wall St. J. (Nov. 24, 2015).

[4] SeePritchard v. Dow Agro – Gatekeeping Exemplified” (Aug. 25, 2014).

[5] See, e.g., Marianne van der Mark, Roel Vermeulen, Peter C.G. Nijssen, Wim M. Mulleners, Antonetta M.G. Sas, Teus van Laar, Anke Huss, and Hans Kromhout, “Occupational exposure to solvents, metals and welding fumes and risk of Parkinson’s disease,” 21 Parkinsonism Relat Disord. 635 (2015); James Mortimer, Amy Borenstein & Laurene Nelson, Associations of Welding and Manganese Exposure with Parkinson’s Disease: Review and Meta-Analysis, 79 Neurology 1174 (2012); Joseph Jankovic, “Searching for a relationship between manganese and welding and Parkinson’s disease,” 64 Neurology 2012 (2005).

 Another Haack Article on Daubert

October 14th, 2016

In yet another law review article on Daubert, Susan Haack has managed mostly to repeat her past mistakes, while adding a few new ones to her exegesis of the law of expert witnesses. See Susan Haack, “Mind the Analytical Gap! Tracing a Fault Line in Daubert,” 654 Wayne L. Rev. 653 (2016) [cited as Gap].  Like some other commentators on the law of evidence, Haack purports to discuss this area of law without ever citing or quoting the current version of the relevant statute, Federal Rule of Evidence 703. She pours over Daubert and Joiner, as she has done before, with mostly the same errors of interpretation. In discussing Joiner, Haack misses the importance of the Supreme Court’s reversal of the 11th Circuit’s asymmetric standard of Rule 702 trial court decisions. Gap at 677. And Haack’s analysis of this area of law omits any mention of Rule 703, and its role in Rule 702 determinations. Although you can safely skip yet another Haack article, you should expect to see this one, along with her others, cited in briefs, right up there with David Michael’s Manufacturing Doubt.

A Matter of Degree

“It may be said that the difference is only one of degree. Most differences are, when nicely analyzed.”[1]

Quoting Holmes, Haack appears to complain that the courts’ admissibility decisions on expert witnesses’s opinions are dichotomous and categorical, whereas the component parts of the decisions, involving relevance and reliability, are qualitative and gradational. True, true, and immaterial.

How do you boil a live frog so it does not jump out of the water?  You slowly turn up the heat on the frog by degrees.  The frog is lulled into complacency, but at the end of the process, the frog is quite, categorically, and sincerely dead. By a matter of degrees, you can boil a frog alive in water, with a categorically ascertainable outcome.

Humans use categorical assignments in all walks of life.  We rely upon our conceptual abilities to differentiate sinners and saints, criminals and paragons, scholars and skells. And we do this even though IQ, and virtues, come in degrees. In legal contexts, the finder of fact (whether judge or jury) must resolve disputed facts and render a verdict, which will usually be dichotomous, not gradational.

Haack finds “the elision of admissibility into sufficiency disturbing,” Gap at 654, but that is life, reason, and the law. She suggests that the difference in the nature of relevancy and reliability on the one hand, and admissibility on the other, creates a conceptual “mismatch.” Gap at 669. The suggestion is rubbish, a Briticism that Haack is fond of using herself.  Clinical pathologists may diagnose cancer by counting the number of mitotic spindles in cells removed from an organ on biopsy.  The number may be characterized by as a percentage of cells in mitosis, a gradational that can run from zero to 100 percent, but the conclusion that comes out of the pathologist’s review is a categorical diagnosis.  The pathologist must decide whether the biopsy result is benign or malignant. And so it is with many human activities and ways of understanding the world.

The Problems with Daubert (in Haack’s View)

Atomism versus Holism

Haack repeats a litany of complaints about Daubert, but she generally misses the boat.  Daubert was decisional law, in 1993, which interpreted a statute, Federal Rule of Evidence 702.  The current version of Rule 702, which was not available to, or binding on, the Court in Daubert, focuses on both validity and sufficiency concerns:

A witness who is qualified as an expert by knowledge, skill, experience, training, or education may testify in the form of an opinion or otherwise if:

(a) the expert’s scientific, technical, or other specialized knowledge will help the trier of fact to understand the evidence or to determine a fact in issue;

(b) the testimony is based on sufficient facts or data;

(c) the testimony is the product of reliable principles and methods; and

(d) the expert has reliably applied the principles and methods to the facts of the case.

Subsection (b) renders most of Haack’s article a legal ignoratio elenchi.

Relative Risks Greater Than Two

Modern chronic disease epidemiology has fostered an awareness that there is a legitimate category of disease causation that involves identifying causes that are neither necessary nor sufficient to produce their effects. Today it is a commonplace that an established cause of lung cancer is cigarette smoking, and yet, not all smokers develop lung cancer, and not all lung cancer patients were smokers.  Epidemiology can identify lung cancer causes such as smoking because it looks at stochastic processes that are modified from base rates, or population rates. This model of causation is not expected to produce uniform and consistent categorical outcomes in all exposed individuals, such as lung cancer in all smokers.

A necessary implication of categorizing an exposure or lifestyle variable as a “cause,” in this way is that the evidence that helps establish causation cannot answer whether a given individual case of the outcome of interest was caused by the exposure of interest, even when that exposure is a known cause.  We can certainly say that the exposure in the person was a risk for developing the disease later, but we often have no way to make the individual attribution.  In some cases, more the exception than the rule, there may be an identified mechanism that allows the detection of a “fingerprint” of causation. For the most part, however, risk and cause are two completely different things.

The magnitude of risk, expressed as a risk ratio, can be used to calculate a population attributable risk, which can in turn, with some caveats, be interpreted as approximating a probability of causation.  When the attributable risk is 95%, as it would be for people with light smoking habits and lung cancer, treating the existence of the prior risk as evidence of specific causation seems perfectly reasonable.  Treating a 25% attributable risk as evidence to support a conclusion of specific causation, without more, is simply wrong.  A simple probabilistic urn model would tell us that we would most likely be incorrect if we attributed a random case to the risk based upon such a low attributable risk.  Although we can fuss over whether the urn model is correct, the typical case in litigation allows no other model to be asserted, and it would be the plaintiffs’ burden of proof to establish the alternative model in any event.

As she has done many times before, Haack criticizes Judge Kozinski’s opinion in Daubert,[2] on remand, where he entered judgment for the defendant because further proceedings were futile given the small relative risks claimed by plaintiffs’ expert witnesses.  Those relative risks, advanced by Shanna Swan and Alan Done, lacked reliability; they were the product of a for-litigation juking of the stats that were the original target of the defendant and the medical community in the Supreme Court briefing.  Judge Kozinski simplified the case, using a common legal strategem of assuming arguendo that general causation was established.  With this assumption favorable to plaintiffs made, but never proven or accepted, Judge Kozinski could then shine his analytical light on the fatal weakness of the specific causation opinions.  When all the hand waving was put to rest, all that propped up the plaintiff’s specific causation claim was the existence of a claimed relative risk, which was less than two. Haack is unhappy with the analytical clarity achieved by Kozinski, and implicitly urges a conflation of general and specific causation so that “all the evidence” can be counted.  The evidence of general causation, however, does not advance plaintiff’s specific causation case when the nature of causation is the (assumed) existence of a non-necessary and non-sufficient risk. Haack quotes Dean McCormick as having observed that “[a] brick is not a wall,” and accuses Judge Kozinski of an atomistic fallacy of ruling out a wall simply because the party had only bricks.  Gap at 673, quoting from Charles McCormick, Handbook of the Law of Evidence at 317 (1954).

There is a fallacy opposite to the atomistic fallacy, however, namely the holistic “too much of nothing fallacy” so nicely put by Poincaré:

“Science is built up with facts, as a house is with stones. But a collection of facts is no more a science than a heap of stones is a house.”[3]

Poincaré’s metaphor is more powerful than Haack’s call for holistic evidence because it acknowledges that interlocking pieces of evidence may cohere as a building, or they may be no more than a pile of rubble.  Poorly constructed walls may soon revert to the pile of stones from which they came.

Haack proceeds to criticize Judge Kozinski for his “extraordinary argument” that

“(a) equates degrees of proof with statistical probabilities;

(b) assesses each expert’s testimony individually; and

(c) raises the standard of admissibility under the relevance prong to the standard of proof.”

Gap at 672.

Haack misses the point that a low relative risk, with no other valid evidence of specific causation, translates into a low probability of specific causation, even if general causation were apodictically certain. Aggregating the testimony, say between  animal toxicologists and epidemiologists, simply does not advance the epistemic ball on specific causation because all the evidence collectively does not help identify the cause of Jason Daubert’s birth defects on the very model of causation that plaintiffs’ expert witnesses advanced.

All this would be bad enough, but Haack then goes on to commit a serious category mistake in confusing the probabilistic inference (for specific causation) of an urn model with the prosecutor’s fallacy of interpreting a random match probability as the evidence of innocence. (Or the complement of the random match probability as the evidence of guilt.) Judge Kozinski was not working with random match probabilities, and he did not commit the prosecutor’s fallacy.

Take Some Sertraline and Call Me in the Morning

As depressing as Haack’s article is, she manages to make matters even gloomier by attempting a discussion of Judge Rufe’s recent decision in the sertraline birth defects litigation. Haack’s discussion of this decision illustrates and typifies her analyses of other cases, including various decisions on causation opinion testimony on phenylpropanolamine, silicone, bendectin, t-PA, and other occupational, environmental, and therapeutic exposures. Maybe 100 mg sertraline is in order.

Haack criticizes what she perceives to be the conflation of admissibility and sufficiency issues in how the sertraline MDL court addressed the defendants’ motion to exclude the proffered testimony of Dr. Anick Bérard. Gap at 683. The conflation is imaginary, however, and the direct result of Haack’s refusal to look at the specific, multiple methodological flaws in plaintiffs’ expert witness Anick Bérard’s methodologic approach taken to reach a causal conclusion. These flaws are not gradational, and they are detailed in the MDL court’s opinion[4] excluding Anick Bérard. Haack, however, fails to look at the details. Instead Haack focuses on what she suggests is the sertraline MDL court’s conclusion that epidemiology was necessary:

“Judge Rufe argues that reliable testimony about human causation should generally be supported by epidemiological studies, and that ‘when epidemiological studies are equivocal or inconsistent with a causation opinion, experts asserting causation opinions must thoroughly analyze the strengths and weaknesses of the epidemiological research and explain why [it] does not contradict or undermine their opinion’. * * *

Judge Rufe acknowledges the difference between admissibility and sufficiency but, when it comes to the part of their testimony he [sic] deems inadmissible, his [sic] argument seems to be that, in light of the defendant’s epidemiological evidence, the plaintiffs’ expert testimony is insufficient.”

Gap at 682.

This précis is a remarkable distortion of the material facts of the case. There was no plaintiffs’ epidemiology evidence and defendants’ epidemiologic evidence.  Rather there was epidemiologic evidence, and Bérard ignored, misreported, or misrepresented a good deal of the total evidentiary display. Bérard embraced studies when she could use their risk ratios to support her opinions, but criticized or ignored the same studies when their risk ratios pointed in the direction of no association or even of a protective association. To add to this methodological duplicity, Anick Bérard published many statements, in peer-reviewed journals, that sertraline was not shown to cause birth defects, but then changed her opinion solely for litigation. The court’s observation that there was a need for consistent epidemiologic evidence flowed not only from the conception of causation (non-necessary, not sufficient), but from Berard’s and her fellow plaintiffs’ expert witnesses’ concessions that epidemiology was needed.  Haack’s glib approach to criticizing judicial opinions fails to do justice to the difficulties of the task; nor does she advance any meaningful criteria to separate successful from unsuccessful efforts.

In attempting to make her case for the gradational nature of relevance and reliability, Haack acknowledges that the details of the evidence relied upon can render the evidence, and presumably the conclusion based thereon, more or less reliable.  Thus, we are told that epidemiologic studies based upon self-reported diagnoses are highly unreliable because such diagnoses are often wrong. Gap at 667-68. Similarly, we are told that in consider a claim that a plaintiff suffered an adverse effect from a medication, that epidemiologic evidence showing a risk ratio of three would not be reliable if it had inadequate or inappropriate controls,[5] was not double blinded, and lacked randomization. Gap at 668-69. Even if the boundaries between reliable and unreliable are not always as clear as we might like, Haack fails to show that the gatekeeping process lacks a suitable epistemic, scientific foundation.

Curiously, Haack calls out Carl Cranor, plaintiffs’ expert witness in the Milward case, for advancing a confusing, vacuous “weight of the evidence” rationale for the methodology employed by the other plaintiffs’ causation expert witnesses in Milward.[6] Haack argues that Cranor’s invocation of “inference to the best explanation” and “weight of the evidence” fails to answer the important questions at issue in the case, namely how to weight the inference to causation as strong, weak, or absent. Gap at 688 & n. 223, 224. And yet, when Haack discusses court decisions that detailed voluminous records of evidence about how causal inferences should be made and supported, she flies over the details to give us confused, empty conclusions that the trial courts conflated admissibility with sufficiency.


[1] Rideout v. Knox, 19 N.E. 390, 392 (Mass. 1892).

[2] Daubert v. Merrell Dow Pharm., Inc., 43 F.3d 1311, 1320 (9th Cir. 1995).

[3] Jules Henri Poincaré, La Science et l’Hypothèse (1905) (chapter 9, Les Hypothèses en Physique)( “[O]n fait la science avec des faits comme une maison avec des pierres; mais une accumulation de faits n’est pas plus une science qu’un tas de pierres n’est une maison.”).

[4] In re Zoloft Prods. Liab. Litig., 26 F. Supp. 3d 466 (E.D. Pa. 2014).

[5] Actually Haack’s suggestion is that a study with a relative risk of three would not be very reliable if it had no controls, but that suggestion is incoherent.  A risk ratio could not have been calculated at all if there had been no controls.

[6] Milward v. Acuity Specialty Prods., 639 F.3d 11, 17-18 (1st Cir. 2011), cert. denied, 132 S.Ct. 1002 (2012).

Toward Better Definitions & Assessments of Conflicts of Interest in Science

October 13th, 2016

In capitalist and in communist societies, industry has a responsibility to conduct research on health and safety concerns. For corporate research to be credible, it should be methodologically sound, transparent, and available. So should non-corporate research.

In the United States and in Europe, much important research is done only by private corporate sponsors. Of course, private funding of research raises questions about potential conflicts of interest (COIs), but political frenzy over such COIs is a serious diversion often motivated by a desire to live in a faith-based world in which industry and chemicals are demonized far beyond what even precautionary principles would support. Susan Sarandon’s superstitions about the herbicide Round Up come to mind.

Although members of the Lobby, the Litigation Industry, and the environmental groups of the less rational kind frequently find their knickers in a knot over corporate scientific COIs, the fact is that publicly funded and self-styled “public interest” research is often afflicted by non-financial COIs that are more mind numbing than the anticipation of money.[1] Some groups, such as the Society of Toxicology, have implemented more complete definitions of COI to include advocacy and positional conflicts.[2]

Joseph Huggard recently posted an interesting piece at Innovative Science Solutions’ blog, on the need to “Follow the Science Not the Money,” to remind us of the first principle, that research should be evaluated primarily on its merits, and not on its perceived or imagined COIs. Huggard likens the current situation of proliferating ad hominem attacks to less talented footballer who approaches the game thinking “If you can’t play the ball, play the man.” (Or if you are Donald Trump, then play the ref.)

Huggard cites an interesting meta-observational study in which researchers attempted to obtain research protocols from epidemiologic studies on phthalate exposure. Not surprisingly, researchers who published studies that purported to find adverse associations involving phthalates were three times less likely to share their study protocols.[3] A request for study protocols is hardly an intrusive or difficult request to meet. Of course, there are “reasons,” such as researchers’ desire to privilege their methods when so-called positive studies will serve as stepping stones to funding for future studies, but future studies should be conditioned on making past protocols available, and the failure to share protocols generally is pretty dubious scientific behavior.

As grim as the situation has been in the United States, Huggard suggests that an upcoming Luxembourg Chamber of Commerce conference this week, on October 10th, will seek to redress the imbalance in European COI rhetoric by calling attention to the importance of non-financial conflicts and biases.[4] Let’s hope so, but the more likely outcome is that the Chamber of Commerce’s sponsorship will disqualify any conference recommendations among the “political scientists,” those who practice science to achieve political aims.


[1] Simon N. Young, “Bias in the research literature and conflict of interest: an issue for publishers, editors, reviewers and authors, and it is not just about the money,” 34 J. Psychiatry & Neurosci. 412 (2009) (positional conflicts, based upon prior beliefs, can create much more intractable bias than financial rewards). See also “Conflict Over Conflicts of Interest” (July 12, 2015); “Conflicts of Interest in Asbestos Studies – the Plaintiffs’ Double Standard” (Sept. 24, 2013); “Conflicted Public Interest Groups” (Nov 3, 2013).

[2] See, e.g., Society of Toxicology, Conflict of Interest, Bias and Advocacy: Definitions and Statements.

[3] Gerard M.H. Swaen, Miriam J.E. Urlings, and Maurice P. Zeegers, “Outcome reporting bias in observational epidemiology studies on phthalates,” 26 Ann. Epidemiol. 597E4 (2016)

[4] “Managing Bias and Conflict of Interest: Ensuring that Policy-Makers and Regulators Access the Best Quality Scientific Advice,” at the Chambre de Commerce Luxembourg, at 7, rue Alcide de Gasperi, Luxembourg (Kirchberg).

New Jersey Kemps Ovarian Cancer – Talc Cases

September 16th, 2016

Gatekeeping in many courtrooms has been reduced to requiring expert witnesses to swear an oath and testify that they have followed a scientific method. The federal rules of evidence and most state evidence codes require more. The law, in most jurisdictions, requires that judges actively engage with, and inspect, the bases for expert witnesses’ opinions and claims to determine whether expert witnesses who want to heard in a courtroom have actually, faithfully followed a scientific methodology.  In other words, the law requires judges to assess the scientific reasonableness of reliance upon the actual data cited, and to evaluate whether the inferences drawn from the data, to reach a stated conclusion, are valid.

We are getting close to a quarter of a century since the United States Supreme Court outlined the requirements of gatekeeping, in Daubert v. Merrell Dow Pharms., Inc., 509 U.S. 579 (1993). Since the Daubert decision, the Supreme Court’s decisional law, and changes in the evidence rules themselves, have clarified the nature and extent of the inquiry judges must conduct into the reasonable reliance upon facts and data, and into the inferential steps leading to a conclusion.  And yet, many judges resist, and offer up excuses and dodges for shirking their gatekeeping obligations.  See generally David E. Bernstein, “The Misbegotten Judicial Resistance to the Daubert Revolution,” 89 Notre Dame L. Rev. 27 (2013).

There is a courtroom in New Jersey, in which gatekeeping is taken seriously from beginning to end.  There is at least one trial judge who encourages and even demands that the expert witnesses appear and explain their methodologies and actually show their methodological compliance.  Judge Johnson first distinguished himself in In re Accutane, No. 271(MCL), 2015 WL 753674, 2015 BL 59277 (N.J.Super. Law Div. Atlantic Cty. Feb. 20, 2015).[1] And more recently, in two ovarian cancer cases, Judge Johnson dusted two expert witnesses, who thought they could claim their turn in the witness chair by virtue of their credentials and some rather glib hand waving. Judge Johnson conducted the New Jersey analogue of a Federal Rule of Evidence 104(a) Daubert hearing, as required by the New Jersey Supreme Court’s decision in Kemp v. The State of New Jersey, 174 N.J. 412 (2002). The result was disastrous for the two expert witnesses who opined that use of talcum powder by women causes ovarian cancer. Carl v. Johnson & Johnson, No. ATL-L-6546-14, 2016 WL 4580145 (N.J. Super. Ct. Law Div., Atl. Cty., Sept. 2, 2016) [cited as Carl].

Judge Johnson obviously had a good epidemiology teacher in Professor Stephen Goodman, who testified in the Accutane case.  Against this standard, it is easy to see how the plaintiffs’ talc expert witnesses, Drs. Daniel Cramer and Dr. Graham Colditz, fell “significantly” short. After presiding over seven days of court hearings, and reviewing extensive party submissions, including the actual studies relied upon by the expert witnesses and the parties, Judge Johnson made no secret of his disappointment with the lack of rigor in the analyses proffered by Cramer and Colditz:

“Throughout these proceedings the court was disappointed in the scope of Plaintiffs’ presentation; it almost appeared as if counsel wished the court to wear blinders. Plaintiffs’ two principal witnesses on causation, Dr. Daniel Cramer and Dr. Graham Colditz, were generally dismissive of anything but epidemiological studies, and within that discipline of scientific investigation they confined their analyses to evidence derived only from small retrospective case-control studies. Both witnesses looked askance upon the three large cohort studies presented by Defendants. As confirmed by studies listed at Appendices A and B, the participants in the three large cohort studies totaled 191,090 while those case-control studies advanced by Plaintiffs’ witnesses, and which were the ones utilized in the two meta-analyses performed by Langseth and Terry, total 18,384 participants. As these proceedings drew to a close, two words reverberated in the court’s thinking:

“narrow and shallow.” It was almost as if counsel and the expert witnesses were saying, Look at this, and forget everything else science has to teach as.

Carl at *12.

Judge Johnson did what for so many judges is unthinkable; he looked behind the curtain put up by highly credentialed Oz expert witnesses in his courtroom. What he found was unexplained, unjustified selectivity in their reliance upon some but not all the available data, and glib conclusions that gloss over significant limits in the resolving power of the available epidemiologic studies. Judge Johnson was particularly unsparing of Graham Colditz, a capable scientist, who deviated from the standards he set for himself in the work he had published in the scientific community:

“Dr. Graham Colditz is a brilliant scientist and a dazzling witness. His vocal inflection, cadence, and adroit use of histrionics are extremely effective. Dr. Colditz’s reputation for his breadth of knowledge about cancer and the esteem in which he is held by his peers is well deserved. Yet, at times, it seemed that issues raised in these proceedings, and the questions posed to him, were a bit mundane for a scientist of his caliber.”

Carl at *15. Dr. Colditz and the plaintiffs’ cause were not helped by Dr. Colditz’s own previous publications of studies and reviews that failed to support any “substantial association between perineal talc use and ovarian cancer risk overall,” and failed to conclude that talc was even a “risk factor” for ovarian cancer.  Carl at *18.

Relative Risk Size

Many courts have fumbled their handling of the issue whether applicable relative risks must exceed two before fact finders may infer specific causation between claimed exposures and specific diseases. There certainly can be causal associations that involve relative risks between 1.0, up to and including 2.0.  Eliminating validity concerns may be more difficult with such smaller relative risks, but there is nothing theoretically insuperable about having a causal association based upon such small relative risks. Judge Johnson apparently saw the diversity of opinions on this relative risk issue, many of which opinions are stridently maintained, and thoroughly fallacious.

Judge Johnson ultimately did not base his decision, with respect to general or specific causation, on the magnitude of relative risk, or the covering Bradford Hill factor of “strength of association.” Dr. Cramer appropriately acknowledged that his meta-analysis result, of an odds ratio of 1.29 was “weak,” Carl at *19, and Judge Johnson was critical of Dr. Colditz for failing to address the lack of strength of the association, and for engaging in a constant refrain that the association was “significant,” which is a precision not a size estimate for the measurement. Carl at *17.

Aware of the difficulty that New Jersey appellate courts have had with the issues surrounding relative risks greater than two, Judge Johnson was realistic to steer clear of any specific judicial reliance on the small size of the relative risk.  His Honor’s prudence is unfortunate however because ultimately small relative risks, even assuming that general causation is established, do nothing to support specific causation.  Indeed, relative risks of 1.29 (and odds ratios generally overstate the size of the underlying relative risk) would on a stochastic model support the conclusion that specific causation was less than 50% probable.  Critics have pointed out that risk may not be stochastically distributed, which is a great point, except that

(1) plaintiffs often have no idea how the risk, if real, is distributed in the observed sample, and

(2) the upshot of the point is that even for relative risks greater than 2.0, there is no warrant for inferring specific causation in a given case.

Judge Johnson did wade into the relative risk waters by noting that when relative risks were “significantly” less than two, establishing biological plausibility became essential.  Carl at *11.  This pronouncement is muddled on at least two fronts.  First, the relative risk scale is a continuum, and there is no standard reference for what relative risks greater than 1.0 are “significantly” less than 2.0.  Presumably, Judge Johnson thought that 1.29 was in the “significantly less than 2.0” range, but he did not say so; nor did he cite a source that supported this assessment. Perhaps he was suggesting that the upper bound of some meta-analysis was less than two. Second, and more troubling, the claim that biological plausibility becomes “essential” in the face of small relative risks is also unsupported. Judge Johnson does not cite any support for this claim, and I am not aware of any.  Elsewhere in his opinion, Judge Johnson noted that

“When a scientific rationale doesn’t exist to explain logically the biological mechanism by which an agent causes a disease, courts may consider epidemiologic studies as an alternate [sic] means of proving general causation.”

Carl at *8. So it seems that biological plausibility is not essential after all.

This glitch in the Carl opinion is likely of no lasting consequence, however, because epidemiologists are rarely at a loss to posit some biologically plausible mechanism. As the Dictionary of Epidemiology explains the matter:

“The causal consideration that an observed, potentially causal association between an exposure and a health outcome may plausibly be attributed to causation on the basis of existing biomedical and epidemiological knowledge. On a schematic continuum including possible, plausible, compatible, and coherent, the term plausible is not a demanding or stringent requirement, given the many biological mechanisms that often can be hypothesized to underlie clinical and epidemiological observations; hence, in assessing causality, it may be logically more appropriate to require coherence (biological as well as clinical and epidemiological). Plausibility should hence be used cautiously, since it could impede development or acceptance of new knowledge that does not fit existing biological evidence, pathophysiological reasoning, or other evidence.”

Miquel Porta, et al., eds., “Biological plausibility,” in A Dictionary of Epidemiology at 24 (6th ed. 2014). Most capable epidemiologists have thought up half a dozen biologically plausible mechanisms each morning before they have had their first cup of coffee. But the most compelling reason that this judicial hiccup is inconsequential is that the plaintiffs’ expert witnesses’ postulated mechanism, inflammation, was demonstrably absent in the tissue of the specific plaintiffs.  Carl at *13. The glib invocation of “inflammation” would seem bound to fail even as the most liberal test of plausibility when talc has anti-cancer properties that result from its ability to inhibit new blood vessel formation, a necessity of solid tumor growth, and the completely unexplained selectivity for ovarian tissue to the postulated effect, which leaves vaginal, endometrial, or fallopian tissues unaffected. Carl at *13-14. On at least two occasions, the United States Food and Drug Administration rejected “Citizen Petitions” for ovarian cancer warnings on talc products, advanced by the dubious Samuel S. Epstein for the Cancer Prevention Coalition, in large measure because of Epstein’s undue selectivity in citing epidemiologic studies and because a “cogent biological mechanism by which talc might lead to ovarian cancer is lacking… .” Carl at *15, citing Stephen M. Musser, Directory FDA Director, Letter Denying Citizens’ Petition (April 1, 2014).

Large Studies

Judge Johnson quoted the Reference Manual on Scientific Evidence (3d ed.  2011) for his suggestion that establishing causation requires large studies.  The quoted language, however, really does not bear on his suggestion:

“Common sense leads one to believe that a large enough sample of individuals must be studied if the study is to identify a relationship between exposure to an agent and disease that truly exists. Common sense also suggests that by enlarging the sample size (the size of the study group), researchers can form a more accurate conclusion and reduce the chance of random error in their results…With large numbers, the outcome of test is less likely to be influenced by random error, and the researcher would have greater confidence in the inferences drawn from the data.”

Reference Manual at page 576.  What the Reference Manual simply calls for studies with “large enough” samples.  How large is large enough is a variable that depends upon the magnitude of the association to be detected, the length of follow up, and the base rate or incidence of the outcome of interest. As far as “common sense,” goes, the Reference Manual is correct only insofar as larger is better with respect to sampling error.  Increasing sample size does nothing to address internal or external validity of studies, and may lead to erroneous interpretations by allowing results to achieve statistical significance at predetermined levels, when the observed associations result from bias or confounding, and not from any underlying relationship between exposure and disease outcome.

There is a more disturbing implication in Judge Johnson’s criticism of Graham Colditz for relying upon the smaller number of subjects in the case-control studies than are found in the available cohort studies. Ovarian cancer is a relatively rare cancer (compared with breast and colon cancer), and case-control studies are more efficient at assessing increased risk than are cohort studies for a rare outcome.  The number of cases in a case-control study represents an implied population many times larger than the number of actual cases in a case-control study.  If Judge Johnson had looked at the width of the confidence intervals for the “small” case-control studies, and compared those widths to the interval widths of the cohort studies, he would have seen that “smaller” case-control studies (fewer cases, as well as fewer total subjects) can generate more statistical precision than the larger cohort studies (with many more cohort and control subjects).  A more useful comparison would have been to the number of actual ovarian cancer cases in the meta-analyzed case-control studies with the number of actual ovarian cancer cases in the cohort studies. On this comparison, the cohort studies might not fare so well.

The size of the cohort for a rare outcome is thus fairly meaningless in terms of the statistical precision generated.  Smaller case-control studies will likely have much more power, and that should be reflected in the confidence intervals of the respective studies.

The issue, as I understand the talc litigation, is not size of the case-control versus cohort studies, but rather their analytical resolving power.  Case-control studies for this sort of exposure and outcome will be plagued by recall and other biases, as well as difficulty in selecting the right control group.  And the odds ratio will tend to overestimate the relative risk, in both directions.  Cohort studies, with good, pre-morbid exposure assessments, would thus be much more rigorous and accurate in estimating the true rate ratios. In the final analysis, Judge Johnson was correct to be critical of Graham Colditz for dismissing the cohort studies, but his rationale for this criticism was, in a few places, confused and confusing. There was nothing subtle about the analytical gaps, ipse dixits, and cherry picking shown by these plaintiffs’ expert witnesses.


[1] SeeJohnson of Accutane – Keeping the Gate in the Garden State” (Mar. 28, 2015).

The opinions, statements, and asseverations expressed on Tortini are my own, or those of invited guests, and these writings do not necessarily represent the views of clients, friends, or family, even when supported by good and sufficient reason.