Attractively formatting Project Gutenberg texts
|
![]() |
License
Downloading GutenMark
Installing GutenMark
Compiling GutenMark
Other Stuff You Might Want
Software
|
|
Current development source package (20021124) | GutenMark_source_dev.tar.gz |
20020722 Win32 base package | Zipfile |
20020714 Linux 'x86 base package | Tarball |
20020714 Linux PPC base package | Tarball |
20020714 FreeBSD base package | Tarball |
20020714 Mac OS X base package | Tarball |
20020714 source package | Tarball |
Obsolete versions | FTP |
The "wordlists" and "namelists" are optional files that you can download or not, as you choose. The wordlists are categorized as highly recommended, recommended , or available, based on my own admittedly subjective experience. Click here for an extended explanation of what wordlists do. If you want to download several (or all) wordlists, you might prefer to use an FTP client rather than your browser.
Wordlists and Namelists
|
|
|
My own special English wordlist | Nov. 24, 2002 | 1K |
U.S. namelist | Nov. 10, 2001 | 348K |
U.S. place names (NEW) | Dec. 18, 2001 | 144K |
French namelist | Nov. 11, 2001 | 7K |
English wordlist | Nov. 10, 2001 | 449K |
French wordlist | Nov. 17, 2001 | 373K |
German wordlist | Nov. 24, 2001 | 582K |
Older, smaller German wordlist | Nov. 11, 2001 | 209K |
Latin wordlist | Nov. 16, 2001 | 195K |
Italian wordlist | Nov. 11, 2001 | 383K |
Spanish wordlist | Nov. 11, 2001 | 322K |
Non-U.S. place names (NEW) | Dec. 22, 2001 | 5992K (Really really big!!) |
Norwegian wordlist | Nov. 16, 2001 | 2078K (Really big!) |
Gaelic wordlist | Nov. 11, 2001 | 298K |
Danish wordlist | Nov. 11, 2001 | 558K |
Swedish wordlist | Nov. 11, 2001 | 254K |
Finnish wordlist | Nov. 11, 2001 | 285K |
My own special non-English wordlist | Nov. 24, 2001 | 1K |
browse ... | all | FTP |
NOTE: In versions later than 20011113, support for Borland's free C++ compiler (see Borland's web site ) has been dropped, because it was just too much effort for me without knowing if anyone was interested. If for some reason you don't want to use mingw32, and if you figure out how to get other C compilers such as Borland's or Microsoft's to work, tell me ; I'll post the instructions here.
make GutenMark.exe(This assumes that the name of the GNU make program that you got with mingw32 is actually accessible by typing "make". If it instead calls up some other make program, such as Microsoft's or Borland's, then the software-build will not work properly.) In addition to compiling GutenMark.exe, this will attempt to test the compilation by running GutenMark.exe to produce sample HTML file (bldhb10.html) which it compares to an equivalent HTML file (bldhb10.txt.html) provided with the distribution.
In addition to compiling GutenMark, this will attempt to test the compilation by running GutenMark to produce sample HTML file (bldhb10.html) which it compares to an equivalent HTML file (bldhb10.txt.html) provided with the distribution.
If the HTML is all you want -- if you want to read the etext online, or to set up a web site that displays PG texts in HTML, or if you're fine with printing etexts from your browser, or if you want to use the HTML as a starting point for further markup -- then you're all set!
If, on the other hand, you don't want to use LaTeX and you are looking for an end-to-end solution that can produce attractive printable texts like this sample , then you need some better way of printing HTML than your browser can provide. You could, of course, load the HTML into Microsoft Word or some other word processing program, and manipulate the document format manually.
The solution I would choose instead is to use a utility program that can convert HTML to Postscript printer language, or to PDF format. Several such free utilities are available.
|
page |
|
file |
8.5"×5.5" 9pt New Century Schoolbook font | page9schoolbook.pdf | Nov. 18, 2001 | half9schoolbook.rc |
8.5"×5.5" 10pt Times Roman font | page10times.pdf | Nov. 17, 2001 | half10times.rc |
8.5"×5.5" 10pt Bookman font | page10bookman.pdf | Nov. 17, 2001 | half10bookman.rc |
8.5"×5.5" 12pt New Century Schoolbook font | page12schoolbook.pdf | Nov. 25, 2001 | half12schoolbook.rc |
htmldoc is available for either for Win32 or in source-code form (for Linux systems), and has some very nice properties. I personally find it a little buggy, but it's apparently under active development and can presumably only get better. The main problem is that it is very bad at right justification (or at least, I haven't figured it out), and so you need to use ragged-right text.