[Air-L] archiving entire blogs

Jarkko Moilanen Jarkko.Moilanen at uta.fi
Fri Jan 20 00:40:24 PST 2012


hi,

Quoting Stuart Shulman <stuart.shulman at gmail.com>:

> WORDPRESS has a feature for this:
>
> http://en.blog.wordpress.com/2006/06/12/xml-import-export/
>
> If it is a WORDPRESS blog, you can ask the owner to create a bulk export in
> XML.
>

If you are archiving blog that you don't have access to export  
functions, I would use 'wget'. It contains features to get everything,  
no matter how deep the structure is.

http://en.wikipedia.org/wiki/Wget

/Jarkko


> Better still is the new offering from GNIP:
>
> http://blog.gnip.com/gnip-and-automattic-make-whole-new-universe-of-data-available/
>
> The future is bright for getting big collections.
>
> ~Stu
>
> On Thu, Jan 19, 2012 at 9:31 PM, C Sosnowy <c_sosnowy at yahoo.com> wrote:
>
>> I would like to be able to archive an entire blog (and ideally be able to
>> download it) for analysis. I've looked at WebCite and Zotero but neither
>> seem to have this capability. Does anyone know of another way?
>>
>>
>> Collette Sosnowy
>> M.A., Ph.D. Candidate
>> Environmental Psychology Program
>> The Graduate Center of the City University of New York
>> _______________________________________________
>> The Air-L at listserv.aoir.org mailing list
>> is provided by the Association of Internet Researchers http://aoir.org
>> Subscribe, change options or unsubscribe at:
>> http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
>>
>> Join the Association of Internet Researchers:
>> http://www.aoir.org/
>>
>
>
>
> --
>
> Dr. Stuart W. Shulman
> people.umass.edu/stu
>
> Editor Emeritus, JITP
> jitp.net <http://www.jitp.net>
>
> Director, QDAP-UMass
> umass.edu/qdap <http://www.umass.edu/qdap>
>
> Founder and CEO, Texifter
> texifter.com <http://www.texifter.com>
>
> LinkedIn:  
> linkedin.com/pub/stuart-shulman/10/351/899<http://www.linkedin.com/pub/stuart-shulman/10/351/899>
> Twitter:  
> twitter.com/#!/StuartWShulman<http://twitter.com/#%21/StuartWShulman>
> _______________________________________________
> The Air-L at listserv.aoir.org mailing list
> is provided by the Association of Internet Researchers http://aoir.org
> Subscribe, change options or unsubscribe at:  
> http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
>
> Join the Association of Internet Researchers:
> http://www.aoir.org/
>



****************************
  Jarkko Moilanen (+358 45 8877 150)
  M.Soc.Sc. (Political Science)
  PhD Student, Information studies, University of Tampere
  Blog: http://blog.ossoil.com/
  -------------------------
  Founder of Hackerspace 5w, Finland, Tampere - http://5w.fi
  Founder of MeeGo Network Finland, http://meegonetwork.fi
  Founder of Open Coral - http://open-coral.org
  Founder of Finnish Biohacker community, http://biohakkeri.fi
****************************



More information about the Air-L mailing list