Grab your TwapperKeeper Archive before Shutdown!
Yesterday, TwapperKeeper announced:
Twapper Keeper’s archiving is now available in HootSuite! As a result, we will be shutting down Twapper Keeper. Existing archives will be kept running until Jan 6, 2012, after which you will not be able to access your archives anymore.
For those who wants to save archives before the shut down, this very simple R-code may give you an idea . It allows you to query for archived tweets referencing a hashtag. Furthermore, it downloads up to 50.000 archived tweets containing this hashtag as csv-file from TwapperKeeper.
require(XML) hashtag <- "ala11" #your hashtag tweet.df <- data.frame() url <- paste("http://twapperkeeper.com/rss.php?type=hashtag&name=",hashtag,"&l=50000", sep="") doc <- xmlTreeParse(url,useInternal=T) tweet <- xpathSApply(doc, "//item//title", xmlValue) pubDate <- xpathSApply(doc, "//item//pubDate", xmlValue) tweet.df <- cbind(tweet,pubDate) write.csv(tweet.df,"myTweets.csv")
More
Gary Green: No More Access To Your Twapper Keeper Archives .
Cornelius Puschmann: Academic replacements for TwapperKeeper.com
(NJ)
[…] and your to control!) here’s a way to export your existing TwapperKeeper archives.Inspired by LIBREAS.Library Grab your TwapperKeeper Archive before Shutdown! I’ve developed a Google Spreadsheet to export TwapperKeeper Archives. Here’s how:Open this […]
[…] more selfish approach might be to grab one or more Twapperkeeper archives onto your own computer. Grab your TwapperKeeper Archive before Shutdown! describes how to use R to do this, and is claimed to work for rescues of up to 50,000 tweets from […]
[…] archives using Google Spreadsheet. Martin’s post also links to a post entitled LIBREAS.Library Grab your TwapperKeeper Archive before Shutdown! which describes a technique which can be used by those familiar with R code. Tont Hirst on the […]
Hi!
When I’m trying to run the following command:
doc <- xmlTreeParse(url,useInternal=T)
I'm receiving this error:
No such file or directory2: failed to load external entity "http://twapperkeeper.com/rss.php?type=hashtag&name=myhashtag&l=50000"
What is wrong?
Hi,
you have to assign the hashtag of the archive you want to fetch in the vector, eg
hashtag <- "ala10"
There is no twapperkeeper archive named "myhashtag"
Have a look as well on the other blogs referencing our approach, they offer far more user-friendly and/or sophisticated possibilities to fetch twappperkeeper archives
[…] Hash-Tag-Rankings bei Twitter selbst – dazu, akute Popularitäten in der Zeit zu messen und – sofern die Archivierung des Materials gelingt – auf lange Sicht für bestimmte ebenfalls auf Themen, Akteure oder auch einzelne Formulierungen […]
[…] haben hier bereits früh einen Weg dargelegt, wie sich Archive leicht anhand eines Hash-Tags aus TwapperKeeper […]