[Air-L] Two large post-bin Laden tweet collections can be downloaded in XML format
Stuart Shulman
stuart.shulman at gmail.com
Wed May 4 05:21:19 PDT 2011
Two large post-bin Laden tweet collections can be downloaded in XML format.
http://discovertext.com/osamabinladen.aspx
The datafiles are samples taken from live feed Twitter imports starting
shortly after the announcement that Osama bin Laden’s death.
- Twitter searches for "bin laden" (647,585 documents, 505 MB)
- Twitter searches for "osama" (586,665 documents, 451 MB)
Smaller random samples (<50,000 - <10,000) are posted as sample datasets
inside DiscoverText.
Holders of valid email addresses from the following Universities have
Enterprise license access to DiscoverText text analytic Web services:
- Carnegie Mellon University
- SUNY College at Oneonta
- Tennessee State University
- University at Buffalo
- University of Manchester
- University of Massachusetts Amherst
- University of Pittburgh
- University of Washington
If you have a valid email address from any of these schools, there is a easy
method to request your license key.
https://discovertext.com/RegisterEduSelect.aspx
If you do not see your school listed, please contact me about getting an
educational site license. Alternately, the 30-day free trial is available to
any new user.
With DiscoverText you can:
+ Archive social media content from various sources including YouTube,
Twitter & Facebook
+ Search for key concepts & code text with powerful tools
+ Remove duplicates & cluster similar comments automatically
+ Auto-highlight unique & offensive language
+ Generate & drill into tag clouds
+ Form peer & project networks to work on projects and tasks
+ Establish credentials & permissions to give you complete control over your
data
+ Assign multiple coders to tasks giving you greater qualitative analysis
+ Annotate coding with shared memos to further leverage human insight &
knowledge
+ Easily measure inter-coder reliability for accurate analysis
+ Adjudicate valid & invalid coder decisions
+ Generate reports in RTF, CSV, PDF or XML format
+ Archive or share completed projects online
~Stu
--
Stuart Shulman
President & CEO
Texifter, LLC <http://www.texifter.com/>
Have you tried DiscoverText?
http://discovertext.com
*Featuring the Facebook Graph & Twitter APIs*
More information about the Air-L
mailing list