[Air-L] Help with Facebook Research
elw at stderr.org
elw at stderr.org
Wed Oct 3 07:20:08 PDT 2007
> I am planning a survey of Facebook members at NJIT, where I am a PhD
> student. I would like to write a web crawl or similar program to
> identify through Facebook who is part of the NJIT network. I have seen
> other papers discuss this technique, but I need more specific details as
> to how to accomplish it. Any ideas?
The basic sketch of the technique is this:
1) identify a starting point [initial URL]
2a) programmatically collect all linked pages (in effect, use a regex that
matches "a href=")...
2b) ...that match criteria you specify
3) recurse
however....
Web crawls of Facebook are against the Terms of Service of the site. You
might be able to work something out using the Facebook API, rather than by
crawling. It will take some work.
Your campus IRB will be highly unlikely to approve a project that
explicitly violates the site's TOS; the TOS exists to give both Facebook
and the other users of the site some notion of what sort of privacy
exposure they are likely to be surrendering.
--elijah
More information about the Air-L
mailing list