[Air-L] Downloading online forum postings and analyzing occurence of keywords in the data
Patrick Williams
subcultures at gmail.com
Wed Jun 29 04:52:24 PDT 2011
Hi Koen,
I don't have any experience with scraping software, but I recently did a
keyword analysis as part of a study of ethno-nationalist interaction on a
Transylvanian forum. I used Wordsmith
5<http://www.lexically.net/wordsmith/version5/index.html>.
It had a bit of a steep learning curve, but it was recommended by a corpus
linguist and was quite efficient once I understood how it worked.
I have a short paper partly based on using Wordsmith from a corpus
linguistics (methodological) perspective and its relevance for
microsociology (i.e., symbolic interactionism) in case you're interested.
Best,
patrick.
On Wed, Jun 29, 2011 at 3:12 AM, Leurs, K.H.A. (Koen) <K.H.A.Leurs at uu.nl>wrote:
>
> Dear list-members,
>
> I'm a Utrecht University based (Netherlands) researcher working on the
> European http://www.mignetproject.eu/ project working on transnational
> digital networks, migration and gender.
> More specifically I'm studying knowledge and education processes across
> formal (university) and informal (forums, wikipedia) knowledge domains.
>
> I was wondering if anyone on the list has experiences with scraping online
> forum data and searching it for the occurence of keywords. Especially I'm
> wondering which software best suits my preferences, in selecting for
> instance a certain timeline to scrape as I would like to compare the
> curriculum of a year of university courses
> and their attention for diversity issues and a year of forum postings and
> their attention for diversity issues.
>
> Scraping software that I have used so far are:
> Wget
>
> Software that I have been looking at for searching keywords in digital data
> are:
> Provalis Research Wordstat
> NVivo
> Digitalmethods.net > issuecrawler
>
> I'm looking forward to any suggestions, and thank you for your time.
>
> Kind regards,
>
> Koen.
>
>
> Koen Leurs | Aio / PhD student Gender Studies | Research Institute for
> History and Culture |
> | Utrecht University, the Netherlands | Muntstraat 2a | 3512 EV
> Utrecht | Room 1.12 |
> www.koenleurs.net | www.uu.nl/wiredup | www.mignetproject.eu/ |
>
>
> _______________________________________________
> The Air-L at listserv.aoir.org mailing list
> is provided by the Association of Internet Researchers http://aoir.org
> Subscribe, change options or unsubscribe at:
> http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
>
> Join the Association of Internet Researchers:
> http://www.aoir.org/
>
--
Patrick Williams, Ph.D.
http://www.jpatrickwilliams.net
More information about the Air-L
mailing list