[Air-L] Chinese language social media data mining tools

C.H. chainsawtiney at gmail.com
Tue May 9 09:23:00 PDT 2017


I don't aware of any ready to use data mining tools. Probably you need
to develop the tool yourself.

For one project (e.g. Weiboscope [1]), we need to gather data from the
API first and then do the data analysis. The tricky part about the
Chinese language is that it is not space delimited and therefore one
cannot tokenize a sentence into words as in the case of English (or
other space delimited languages such as French or German.) It can be
solved partially using text segmenters such as Stanford NLP toolkits
or Jieba.

[1] Fu, Chan, Chau. Assessing Censorship on Microblogs in China.
https://hub.hku.hk/bitstream/10722/183851/1/content.pdf?accept=1

On Tue, May 9, 2017 at 11:58 PM, Helen Kennedy
<h.kennedy at sheffield.ac.uk> wrote:
> Hello clever AOIR folks
>
> Asking for postgrad students: any recommendations of social media data
> mining tools that work on Chinese social media platforms / with Chinese
> languages?
>
> Thanks!
>
> Helen
>
>
> --
> Professor Helen Kennedy, Chair in Digital Society
> Department of Sociological Studies / Faculty of Social Sciences
> Elmfield, Northumberland Road
> Sheffield S10 2TU
> T: 0114 2226488
> E: h.kennedy at sheffield.ac.uk
>
> LATEST ARTICLE: *'*The Feeling of Numbers: emotions in everyday engagements
> with data and their visualisation
> <http://journals.sagepub.com/doi/abs/10.1177/0038038516674675?journalCode=soca>',
> *Sociology*, 2017.
> _______________________________________________
> The Air-L at listserv.aoir.org mailing list
> is provided by the Association of Internet Researchers http://aoir.org
> Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
>
> Join the Association of Internet Researchers:
> http://www.aoir.org/



More information about the Air-L mailing list