About this Dataroom
This dataset lists the ~ 58k tweets that mentioned a scientific article (broadly speaking anything with a DOI, PMID or arxiv ID) between the 1st and 31st of July 2011.
Recall isn't 100%: my best estimate is that it's missing another ~ 6k tweets where the article couldn't be identified, the link was malformed or the journal involved is new or gets very low traffic.
Twitter's TOS prohibit re-distribution of the tweets themselves but the dataset contains the extracted links, the tweet ID and some information about the tweeter (screen name, country & lat/lng derived from their location using Yahoo! Placemaker).
The links, pmids, dois and arxiv_ids columns can contain more than one value and are pipe (|) delimited.
The RTs column contains a pipe delimited list of screen names credited with a RT / MT / via in the tweet body.
If you use this dataset please credit http://www.altmetric.com somewhere - doesn't need to be a prominent link or a graphic or anything, some text tucked away on an about page will do!
Recommended Similar Datasets
Geographical distribution of tweeters mentioning scientific articles - July 2011
Geographical distribution of tweeters mentioning scientific articles - July 2011
Btw, I just looked up Altmetric. We just had a really interesting talk with Michael Nielsen about how to apply meaningful metrics/scores to science data yesterday! Neat! Look forward to when Altmetric is public ...
I've split up lat/lng into separate columns to make life easier and added an "RTs" column containing a pipe delimited list of usernames that were credited with an RT / MT / via in the tweet body.