[Air-L] Two large post-bin Laden tweet collections can be downloaded in XML format

Stuart Shulman stuart.shulman at gmail.com
Wed May 4 05:21:19 PDT 2011


Two large post-bin Laden tweet collections can be downloaded in XML format.

http://discovertext.com/osamabinladen.aspx

The datafiles are samples taken from live feed Twitter imports starting
shortly after the announcement that Osama bin Laden’s death.

- Twitter searches for "bin laden" (647,585 documents, 505 MB)
- Twitter searches for "osama" (586,665 documents, 451 MB)

Smaller random samples (<50,000 - <10,000) are posted as sample datasets
inside DiscoverText.

Holders of valid email addresses from the following Universities have
Enterprise license access to DiscoverText text analytic Web services:

- Carnegie Mellon University
- SUNY College at Oneonta
- Tennessee State University
- University at Buffalo
- University of Manchester
- University of Massachusetts Amherst
- University of Pittburgh
- University of Washington

If you have a valid email address from any of these schools, there is a easy
method to request your license key.

https://discovertext.com/RegisterEduSelect.aspx

If you do not see your school listed, please contact me about getting an
educational site license. Alternately, the 30-day free trial is available to
any new user.

With DiscoverText you can:

+ Archive social media content from various sources including YouTube,
Twitter & Facebook
+ Search for key concepts & code text with powerful tools
+ Remove duplicates & cluster similar comments automatically
+ Auto-highlight unique & offensive language
+ Generate & drill into tag clouds
+ Form peer & project networks to work on projects and tasks
+ Establish credentials & permissions to give you complete control over your
data
+ Assign multiple coders to tasks giving you greater qualitative analysis
+ Annotate coding with shared memos to further leverage human insight &
knowledge
+ Easily measure inter-coder reliability for accurate analysis
+ Adjudicate valid & invalid coder decisions
+ Generate reports in RTF, CSV, PDF or XML format
+ Archive or share completed projects online

~Stu

-- 

Stuart Shulman
President & CEO
Texifter, LLC <http://www.texifter.com/>


Have you tried DiscoverText?
http://discovertext.com
*Featuring the Facebook Graph & Twitter APIs*



More information about the Air-L mailing list