Wikipedia in the news - rip and read.
-
EricBarbour
-
- Posts: 10891
- kołdry
- Joined: Wed Mar 14, 2012 11:32 pm
- Location: hell
Unread post
by EricBarbour » Sun Apr 28, 2013 9:43 pm
http://www.guardian.co.uk/technology/20 ... et-archive
Note:
Philosophical allies include
www.wikimedia.org, Mozilla, the free software community, the Electronic Frontier Foundation, a digital rights advocacy group, and the internet activist Aaron Swartz, until his death in January.
I'd like to corner Brewster Kahle someday, and ask him if he's aware that his "allies" at Wikipedia and the Wikimedia Foundation have repeatedly censored their own databases, and are using "nofollow" on all Wikimedia sites partly to keep the Internet Archive from saving copies of the censored items.
-
thekohser
- Majordomo
- Posts: 13406
- Joined: Thu Mar 15, 2012 5:07 pm
- Wikipedia User: Thekohser
- Wikipedia Review Member: thekohser
- Actual Name: Gregory Kohs
- Location: United States
-
Contact:
Unread post
by thekohser » Mon Apr 29, 2013 11:04 am
I don't think "nofollow" prevents any sort of scraping or archiving mechanism. Are you maybe confusing with robots.txt "Disallow"?
"...making nonsensical connections and culminating in feigned surprise, since 2006..."
-
Poetlister
- Genius
- Posts: 25599
- Joined: Wed Jan 02, 2013 8:15 pm
- Nom de plume: Poetlister
- Location: London, living in a similar way
-
Contact:
Unread post
by Poetlister » Mon Apr 29, 2013 11:34 am
thekohser wrote:I don't think "nofollow" prevents any sort of scraping or archiving mechanism. Are you maybe confusing with robots.txt "Disallow"?
Can a robots.txt actually prevent scraping? I know that the major search engines observe these rules, but I was under the impression that this was no more than a gentleman's agreement.
"The higher we soar the smaller we appear to those who cannot fly" - Nietzsche
-
lilburne
- Habitué
- Posts: 4446
- Joined: Thu Mar 15, 2012 6:18 pm
- Wikipedia User: Nastytroll
- Wikipedia Review Member: Lilburne
Unread post
by lilburne » Mon Apr 29, 2013 12:32 pm
Currently commented out:
# Don't allow the wayback-maschine to index user-pages
#User-agent: ia_archiver
#Disallow: /wiki/User
#Disallow: /wiki/Benutzer
They have been inserting little memes in everybody's mind
So Google's shills can shriek there whenever they're inclined