[wp-hackers] Convert WP site into static content
Roy Schestowitz
r at schestowitz.com
Wed Oct 4 04:20:41 GMT 2006
___/ On Tue 03 Oct 2006 22:23:33 BST, [ Alan J Castonguay ] wrote : \___
> Trevor Turk wrote:
>> I'm trying to convert an existing WP site I've got with many Pages (with
>> subpages, etc) into some kind of non-web format. I've seen some PDF
>> outputting plugins, but they don't seem to have a bulk one aside from this:
>> http://wp2pdf.sourceforge.net/. I'm wondering if anyone has used that?
I have used it happily for over a year, but I suspect that
it /might/ not be fully compatible with 2.0.x. It doesn't
appear to be maintained anymore (I tried to report some bugs
that I had fixed, to no avail). There are a few minor issues
like restoration from MySQL dumps, which may drop
wp2pdf-specific settings. Lastly, the nice thing about this
plugin/project is that you can assemble PDF's in large
batches using typical filters/pseudo-'wildcards' in
WordPress, e.g. produce a document that contains all posts
from September. This may suit you well.
>> The Staticize Reloaded plugin seems to be broken, and I haven't been able to
>> find anything else to fit this need.
>>
>> So, if anyone has had to dump a WP site into static non-web content, please
>> speak up if you have any tips!
>
> If you can produce PDFs of posts / pages via a plugin on demand, then
> wget or curl can easily handle the bulk processing.
You can get a list of links (full URL's) to all your posts
quite easily. I would personally use narchives.php to
achieve this although one SQL query might be enough. Then,
put all the URL's, newline-separated in a file.
use wget in the following way:
wget -q -erobots=off -i links_file.txt
Another approach (just a recursive crawl):
wget -q -r -nd -N -np -A.html -erobots=off http://example.org
Upload file to server at end. They should prevserve the
directory structure if you choose/set the right option (man
wget, plenty of stuff there). To avoid breaking old
(perma)links, careful attention and testing will be needed.
Hope it helps,
Roy
--
Roy S. Schestowitz, Ph.D. Candidate in Medical Biophysics
http://Schestowitz.com | GNU/Linux | PGP-Key: 0x74572E8E
http://othellomaster.com - GPL'd 3-D Othello
http://iuron.com - proposing a non-profit search engine
More information about the wp-hackers
mailing list