Brewster's trillions: Internet Archive strives to keep web h

Wikipedia in the news - rip and read.
EricBarbour
 
Posts: 10891
kołdry
Joined: Wed Mar 14, 2012 11:32 pm
Location: hell

Brewster's trillions: Internet Archive strives to keep web h

Unread post by EricBarbour » Sun Apr 28, 2013 9:43 pm

http://www.guardian.co.uk/technology/20 ... et-archive

Note:
Philosophical allies include www.wikimedia.org, Mozilla, the free software community, the Electronic Frontier Foundation, a digital rights advocacy group, and the internet activist Aaron Swartz, until his death in January.
I'd like to corner Brewster Kahle someday, and ask him if he's aware that his "allies" at Wikipedia and the Wikimedia Foundation have repeatedly censored their own databases, and are using "nofollow" on all Wikimedia sites partly to keep the Internet Archive from saving copies of the censored items.

User avatar
thekohser
Majordomo
Posts: 13406
Joined: Thu Mar 15, 2012 5:07 pm
Wikipedia User: Thekohser
Wikipedia Review Member: thekohser
Actual Name: Gregory Kohs
Location: United States
Contact:

Re: Brewster's trillions: Internet Archive strives to keep w

Unread post by thekohser » Mon Apr 29, 2013 11:04 am

I don't think "nofollow" prevents any sort of scraping or archiving mechanism. Are you maybe confusing with robots.txt "Disallow"?
"...making nonsensical connections and culminating in feigned surprise, since 2006..."

User avatar
Poetlister
Genius
Posts: 25599
Joined: Wed Jan 02, 2013 8:15 pm
Nom de plume: Poetlister
Location: London, living in a similar way
Contact:

Re: Brewster's trillions: Internet Archive strives to keep w

Unread post by Poetlister » Mon Apr 29, 2013 11:34 am

thekohser wrote:I don't think "nofollow" prevents any sort of scraping or archiving mechanism. Are you maybe confusing with robots.txt "Disallow"?
Can a robots.txt actually prevent scraping? I know that the major search engines observe these rules, but I was under the impression that this was no more than a gentleman's agreement.
"The higher we soar the smaller we appear to those who cannot fly" - Nietzsche

User avatar
lilburne
Habitué
Posts: 4446
Joined: Thu Mar 15, 2012 6:18 pm
Wikipedia User: Nastytroll
Wikipedia Review Member: Lilburne

Re: Brewster's trillions: Internet Archive strives to keep w

Unread post by lilburne » Mon Apr 29, 2013 12:32 pm

Currently commented out:

# Don't allow the wayback-maschine to index user-pages
#User-agent: ia_archiver
#Disallow: /wiki/User
#Disallow: /wiki/Benutzer
They have been inserting little memes in everybody's mind
So Google's shills can shriek there whenever they're inclined

Post Reply