I'm working on a potential blog entry about the difference between the old "crowdsourcing" days on WP compared to whatever you want to call it now, and how this might be one reason for the decline in recruitment and retention. This article might be a good example, but given that a huge section was added in one edit in 2010, I'm wondering if it might be scraped, or perhaps someone posting their paper. Could be either one, given the style of referencing.keep There are in fact references provided, but they are not in-line and do not provide page numbers. I don't think this is an essay so much as it is something written back in the days when the standards for article writing/referencing were a lot more lax. The topic is surely notable. Mangoe (talk) 17:22, 1 March 2013 (UTC)
Checking for plagiarism
- SB_Johnny
- Habitué
- Posts: 4640
- kołdry
- Joined: Thu Mar 15, 2012 1:26 am
- Wikipedia User: SB_Johnny
- Wikipedia Review Member: SB_Johnny
Checking for plagiarism
I'm curious if anyone knows a good tool for checking WP articles for plagiarized content, specifically for Death in Jainism (T-H-L). I ran across it while trawling through AfD yesterday, and found an interesting comment:
This is not a signature.✌
- Peter Damian
- Habitué
- Posts: 4206
- Joined: Thu Mar 15, 2012 8:14 pm
- Wikipedia User: Peter Damian
- Wikipedia Review Member: Peter Damian
- Location: London
- Contact:
Re: Checking for plagiarism
The introduction is poorly written and incoherent. "Death in Jainism consist [sic] of a variety of concepts and reactions for the soul. Jainism considers death to be a mysterious event that can be experienced by ordinary souls and higher types of souls".
However, parts of the rest are written in a different, more intelligible and educated style, suggesting plagiarism (or a different author). The only way to detect plagiarism mechanically is via Google, and that only works when the source in online. It returned nothing.
[edit] Checking on the edit history, I was right that the introductory (incoherent) part had a different author. So where did the big edit from 2010 come from?
However, parts of the rest are written in a different, more intelligible and educated style, suggesting plagiarism (or a different author). The only way to detect plagiarism mechanically is via Google, and that only works when the source in online. It returned nothing.
Sorry, what difference would be the reason for the decline in recruitment and retention?SB_Johnny wrote:I'm working on a potential blog entry about the difference between the old "crowdsourcing" days on WP compared to whatever you want to call it now, and how this might be one reason for the decline in recruitment and retention.
[edit] Checking on the edit history, I was right that the introductory (incoherent) part had a different author. So where did the big edit from 2010 come from?
οὐκ ἀγαθὸν πολυκοιρανίη: εἷς κοίρανος ἔστω
Re: Checking for plagiarism
I do but for the life of me I cant remember its name. It was oddly enough a web-based tool that 4chan users used to pick various types of media from small parts of photos/text/blurbs etc. They mostly used it to source porn. (I was using it to source cover art from sci-fi books.) It worked pretty well with text as well as pictures.
I will see if I still have the link when I have access to my main comp later tonight.
I will see if I still have the link when I have access to my main comp later tonight.
- Peter Damian
- Habitué
- Posts: 4206
- Joined: Thu Mar 15, 2012 8:14 pm
- Wikipedia User: Peter Damian
- Wikipedia Review Member: Peter Damian
- Location: London
- Contact:
Re: Checking for plagiarism
This will not work when the source is not online, as with this case.Anroth wrote:I do but for the life of me I cant remember its name. It was oddly enough a web-based tool that 4chan users used to pick various types of media from small parts of photos/text/blurbs etc. They mostly used it to source porn. (I was using it to source cover art from sci-fi books.) It worked pretty well with text as well as pictures.
I will see if I still have the link when I have access to my main comp later tonight.
You have a clue in the edit itself, http://en.wikipedia.org/w/index.php?tit ... =332243696 , right at the bottom ('bibliography').
οὐκ ἀγαθὸν πολυκοιρανίη: εἷς κοίρανος ἔστω
- Peter Damian
- Habitué
- Posts: 4206
- Joined: Thu Mar 15, 2012 8:14 pm
- Wikipedia User: Peter Damian
- Wikipedia Review Member: Peter Damian
- Location: London
- Contact:
Re: Checking for plagiarism
Ah I've found it. Macmillan Encyclopedia of Death and Dying (2003): A-K - Page 491. Some parts of books are not available to the main search engine. Yes, it's clearly plagiarised.
οὐκ ἀγαθὸν πολυκοιρανίη: εἷς κοίρανος ἔστω
-
- Retired
- Posts: 4130
- Joined: Thu Nov 01, 2012 1:40 pm
- Wikipedia User: Scott
- Location: London
- Contact:
Re: Checking for plagiarism
The Macmillan Encyclopedia of Death and Dying.Peter Damian wrote:So where did the big edit from 2010 come from?
Hence I have nuked it.
Google Books is probably the best tool we have for identifying plagiarism of this sort.
Edit: You just beat me to it!
My question, to this esteemed Wiki community, is this: Do you think that a Wiki could successfully generate a useful encyclopedia? -- JimboWales
Yes, but in the end it wouldn't be an encyclopedia. It would be a wiki. -- WardCunningham (Jan 2001)
Yes, but in the end it wouldn't be an encyclopedia. It would be a wiki. -- WardCunningham (Jan 2001)
- Peter Damian
- Habitué
- Posts: 4206
- Joined: Thu Mar 15, 2012 8:14 pm
- Wikipedia User: Peter Damian
- Wikipedia Review Member: Peter Damian
- Location: London
- Contact:
Re: Checking for plagiarism
Idiot. Now they will claim that crowdsourcing works. I used to publish errors all the time. People would follow my blog and correct them. Oh well.Hex wrote:Hence I have nuked it.
(I still have a nice collection of stuff that I haven't published, which I am keeping for the book).
This does prove my general principle though, that some parts of Wikipedia are original and some are good, but the parts which are good are not original, and the parts which are original are not good.
Last edited by Peter Damian on Sat Mar 09, 2013 4:10 pm, edited 3 times in total.
οὐκ ἀγαθὸν πολυκοιρανίη: εἷς κοίρανος ἔστω
Re: Checking for plagiarism
Hmm, well I wouldnt take it as granted parts of it might not be online somewhere. If it was quoted elsewhere extensively or lifted entirely from secondary source. But I thought the OU Press was pretty good at digitizing its stuff past/present?
If the sources are not online then there is no tool other thank shanks pony and a decent library.
-edit- Doh as I write this its already been found.
If the sources are not online then there is no tool other thank shanks pony and a decent library.
-edit- Doh as I write this its already been found.
-
- Retired
- Posts: 4130
- Joined: Thu Nov 01, 2012 1:40 pm
- Wikipedia User: Scott
- Location: London
- Contact:
Re: Checking for plagiarism
I do hope that you meant that ironically, Peter.Peter Damian wrote:Idiot. Now they will claim that crowdsourcing works.Hex wrote:Hence I have nuked it.
My question, to this esteemed Wiki community, is this: Do you think that a Wiki could successfully generate a useful encyclopedia? -- JimboWales
Yes, but in the end it wouldn't be an encyclopedia. It would be a wiki. -- WardCunningham (Jan 2001)
Yes, but in the end it wouldn't be an encyclopedia. It would be a wiki. -- WardCunningham (Jan 2001)
- Peter Damian
- Habitué
- Posts: 4206
- Joined: Thu Mar 15, 2012 8:14 pm
- Wikipedia User: Peter Damian
- Wikipedia Review Member: Peter Damian
- Location: London
- Contact:
Re: Checking for plagiarism
I rarely speak without irony.Hex wrote:I do hope that you meant that ironically, Peter.Peter Damian wrote:Idiot. Now they will claim that crowdsourcing works.Hex wrote:Hence I have nuked it.
οὐκ ἀγαθὸν πολυκοιρανίη: εἷς κοίρανος ἔστω
-
- Retired
- Posts: 4130
- Joined: Thu Nov 01, 2012 1:40 pm
- Wikipedia User: Scott
- Location: London
- Contact:
Re: Checking for plagiarism
My question, to this esteemed Wiki community, is this: Do you think that a Wiki could successfully generate a useful encyclopedia? -- JimboWales
Yes, but in the end it wouldn't be an encyclopedia. It would be a wiki. -- WardCunningham (Jan 2001)
Yes, but in the end it wouldn't be an encyclopedia. It would be a wiki. -- WardCunningham (Jan 2001)
- SB_Johnny
- Habitué
- Posts: 4640
- Joined: Thu Mar 15, 2012 1:26 am
- Wikipedia User: SB_Johnny
- Wikipedia Review Member: SB_Johnny
Re: Checking for plagiarism
Well, no big surprise there. The weird thing is that the account only made those 2 edits to that one article (no deleted contribs either). Why would someone do that?Hex wrote:The Macmillan Encyclopedia of Death and Dying.Peter Damian wrote:So where did the big edit from 2010 come from?
Hence I have nuked it.
I wonder if the article would have fared as well at the AfD without that section.
Ched from WP recommended this plagiarism checker.Hex wrote:Google Books is probably the best tool we have for identifying plagiarism of this sort.
This is not a signature.✌
- thekohser
- Majordomo
- Posts: 13410
- Joined: Thu Mar 15, 2012 5:07 pm
- Wikipedia User: Thekohser
- Wikipedia Review Member: thekohser
- Actual Name: Gregory Kohs
- Location: United States
- Contact:
Re: Checking for plagiarism
Those sorts of tools are very easy to fool. For example, checking against some of my own scholarly work...SB_Johnny wrote:Ched from WP recommended this plagiarism checker.
The Luftwaffe's medium-sized bombers were not sufficient to inflict the necessary damage that a strategic campaign supposedly required. This lack of heavy bombers was not the only handicap. -- Results: Plagiarism suspected - use links above to check
The Luftwaffe's mid-sized bombers were insufficient to inflict the necessary damage that a strategic campaign supposably required. This lack of large bombers was not the only handicap. -- Results: No plagiarism suspected
"...making nonsensical connections and culminating in feigned surprise, since 2006..."
- SB_Johnny
- Habitué
- Posts: 4640
- Joined: Thu Mar 15, 2012 1:26 am
- Wikipedia User: SB_Johnny
- Wikipedia Review Member: SB_Johnny
Re: Checking for plagiarism
Huh, so all it takes is a "wikignome" fixing grammar mistakes and off to the races.thekohser wrote:Those sorts of tools are very easy to fool. For example, checking against some of my own scholarly work...SB_Johnny wrote:Ched from WP recommended this plagiarism checker.
The Luftwaffe's medium-sized bombers were not sufficient to inflict the necessary damage that a strategic campaign supposedly required. This lack of heavy bombers was not the only handicap. -- Results: Plagiarism suspected - use links above to checkThe Luftwaffe's mid-sized bombers were insufficient to inflict the necessary damage that a strategic campaign supposably required. This lack of large bombers was not the only handicap. -- Results: No plagiarism suspected
This is not a signature.✌
- Randy from Boise
- Been Around Forever
- Posts: 12231
- Joined: Sun Mar 18, 2012 2:32 am
- Wikipedia User: Carrite
- Wikipedia Review Member: Timbo
- Actual Name: Tim Davenport
- Nom de plume: T. Chandler
- Location: Boise, Idaho
Re: Checking for plagiarism
This is a great topic.SB_Johnny wrote:I'm curious if anyone knows a good tool for checking WP articles for plagiarized content, specifically for Death in Jainism (T-H-L). I ran across it while trawling through AfD yesterday, and found an interesting comment:
I'm working on a potential blog entry about the difference between the old "crowdsourcing" days on WP compared to whatever you want to call it now, and how this might be one reason for the decline in recruitment and retention. This article might be a good example, but given that a huge section was added in one edit in 2010, I'm wondering if it might be scraped, or perhaps someone posting their paper. Could be either one, given the style of referencing.keep There are in fact references provided, but they are not in-line and do not provide page numbers. I don't think this is an essay so much as it is something written back in the days when the standards for article writing/referencing were a lot more lax. The topic is surely notable. Mangoe (talk) 17:22, 1 March 2013 (UTC)
The Norton case currently creeping towards a close at ArbCom had an enormous component of the "two Wikipedias" behind it. Some more or less "normal" 2005-style editing, an unattributed paste-in of an official church history from the web, for example, came back to bite him on the ass. ("Here is the official church history: <blockquote>")
You multiply scores of similar small infractions over time and get the volunteers at Contributor Copyright Investigations all pissed off over a massive pile of articles to review that will take 12 to 20 years to complete (based on current staffing and procedures) and first thing you know you've got a scaffold being erected to "cleanse the community" of a "serial copyright violator."
Like Norton or not — he's not my pal, I'm neutral on the topic — it's pretty much dirty pool to shut him out of Wikipedia due MOSTLY to changing site standards of acceptable work.
This is not to say he hasn't committed certain sins over the years or that he doesn't need some sort of restrictions or oversight, mind you. Only that the first reaction at WP is frequently to round up a lynch mob...
There is absolutely no doubt that much of the decline in editing relates to the way that in-line footnoting has become an essential part of acceptable editing practice. And that is fine.
RfB