Abstract
Context: Since its launch in 2001, Wikipedia has become the most popular general reference site on the Internet and a popular source of health care information. To evaluate the accuracy of this resource, the authors compared Wikipedia articles on the most costly medical conditions with standard, evidence-based, peer-reviewed sources.
Methods: The top 10 most costly conditions in terms of public and private expenditure in the United States were identified, and a Wikipedia article corresponding to each topic was chosen. In a blinded process, 2 randomly assigned investigators independently reviewed each article and identified all assertions (ie, implication or statement of fact) made in it. The reviewer then conducted a literature search to determine whether each assertion was supported by evidence. The assertions found by each reviewer were compared and analyzed to determine whether assertions made by Wikipedia for these conditions were supported by peer-reviewed sources.
Results: For commonly identified assertions, there was statistically significant discordance between 9 of the 10 selected Wikipedia articles (coronary artery disease, lung cancer, major depressive disorder, osteoarthritis, chronic obstructive pulmonary disease, hypertension, diabetes mellitus, back pain, and hyperlipidemia) and their corresponding peer-reviewed sources (P<.05) and for all assertions made by Wikipedia for these medical conditions (P<.05 for all 9).
Conclusion: Most Wikipedia articles representing the 10 most costly medical conditions in the United States contain many errors when checked against standard peer-reviewed sources. Caution should be used when using Wikipedia to answer questions regarding patient care. Since its 2001 launch, Wikipedia (
http://www.wikipedia.org/) has become the most popular general reference site on the Internet, ranking 6th globally based on Internet traffic.1 As of March 2014, it contained more than 31 million articles in 285 languages.2 Wikipedia's prominence has been made possible by its fundamental design as a wiki, or collaborative database, allowing all users the ability to add, delete, and edit information at will. However, it is this very feature that has raised concern in the medical community regarding the reliability of the information it contains. Despite these concerns, Wikipedia has become a popular source of health care information,3 with 47% to 70% of physicians and medical students admitting to using it as a reference.4-6 In actuality, these figures may be higher because some researchers suspect its use is underreported.7 Although the effect of Wikipedia's information on medical decision making is unclear, it almost certainly has an influence.
Wikipedia has several mechanisms in place to deal with unverifiable information and vandalism.8 Because of the frequency of editing and revisions, most instances of vandalism only exist for a few days after being identified, with half of the corrections being posted less than 3 minutes after being identified.9 One study found that some corrections were made almost instantaneously in 42% of cases.10 There is a push on Wikipedia to have statements backed by references and unverifiable statements being called out to readers.11 Haigh12 observed that, in general, medically related articles on Wikipedia are accompanied by a sufficient amount of reputable citations. To evaluate Wikipedia's accuracy, we compared Wikipedia articles on the 10 most costly medical conditions in the United States with recognized peer-reviewed sources.
Methods: The 10 most costly conditions in the United States by public and private expenditure in 2008—the year that the most complete data were available for the present study—were identified from the publicly available database from the Agency for Healthcare Research and Quality.13 We then identified 10 Wikipedia articles that we believed most closely related to each of those conditions. Because Wikipedia articles are dynamic and subject to frequent changes and updates, we printed the selected articles on April 25, 2012, for our research purposes. In a blinded process, we randomly selected 10 reviewers to examine 2 of the selected Wikipedia articles. Each reviewer was an internal medicine resident or rotating intern at the time of the assignment. This arrangement created redundancy, giving the study 2 independent reviewers for each article. Also, by using physicians as reviewers, we ensured a baseline competency in medical literature interpretation and research. We used a Web-based randomizer (
http://www.random.org) to assign the selected Wikipedia articles to each reviewer. Reviewers were asked to identify every assertion (ie, implication or statement of fact) in the Wikipedia article and to fact-check each assertion against a peer-reviewed source that was published or updated within the past 5 years. Reviewers were sent an e-mail containing examples of assertions (eg, “diuretics are the initial drug of choice for essential hypertension without co-morbidities”).
The authors instructed the reviewers to use UpToDate (
http://www.uptodate.com/) as the initial means by which to search for peer-reviewed sources. If UpToDate did not produce adequate results, then each reviewer was instructed to use PubMed (
http://www.ncbi.nlm.nih.gov/pubmed), Google Scholar (
http://scholar.google.com/), or a search engine of their choice. Each reviewer then reported concordance or discordance between Wikipedia and the peer-reviewed sources. Two researchers who did not participate in the original review process then compared both reviews of each article for similar assertions as well as dissimilar assertions and tallied the concordance and discordance for each. The null hypothesis of the study was that there would be concordance between the Wikipedia article and the peer-reviewed sources (P>.05). The alternative hypothesis was that there would be discordance (ie, no concordance) between the Wikipedia article and the peer-reviewed sources (P<.05). A McNemar test for correlated proportions was conducted for the assertions that were similar, dissimilar, or both, as assessed by the blinded reviewers.14(pp171-178)
Results: The Agency for Healthcare Research and Quality13 listed the following 10 conditions as the costliest: heart disease, cancer, mental disorders, trauma-related disorders, osteoarthritis, chronic obstructive pulmonary disease/asthma, hypertension, diabetes, back problems, and hyperlipidemia. The corresponding Wikipedia articles15-24 are listed in Table 1. Examples of the descriptive terms we used to categorize the findings of each reviewer are listed on Table 2. Reviewers found a statically significant discordance between Wikipedia and peer-reviewed sources for assertions that were similar (P<.05) in all but 1 of the conditions: trauma-related disorders (ie, concussions). The same was true for all assertions found by the blinded reviewers of the articles (P<.05 for all conditions except concussions). In 4 articles—major depressive disorder, osteoarthritis, chronic obstructive pulmonary disease, and diabetes mellitus—there was a statistically significant discordance between Wikipedia articles and peer-reviewed sources for dissimilar assertions. The interpretation of the P value is true for similar assertions between the 2 reviewers as well as for dissimilar assertions (Table 3).
Discussion: researchers28-30 have determined that Wikipedia is unsuitable as a reference for drugs. Except for psychiatric conditions,26 scientific research has never, to our knowledge, focused on Wikipedia's content on prevalent medical conditions. A recent study by Azer31 concluded that Wikipedia is not a reliable information source for medical students in gastroenterology and hepatology. The present study demonstrated that most Wikipedia articles on the 10 most costly conditions in the United States contained assertions that are inconsistent with peer-reviewed sources. Because our standard was the peer-reviewed published literature, it can be argued that these assertions on Wikipedia represent factual error. [...]
Conclusion: Most Wikipedia articles for the 10 costliest conditions in the United States contain errors compared with standard peer-reviewed sources. Health care professionals, trainees, and patients should use caution when using Wikipedia to answer questions regarding patient care. Our findings reinforce the idea that physicians and medical students who currently use Wikipedia as a medical reference should be discouraged from doing so because of the potential for errors.