Formatting Guidelines

This is version 2.0 of the Formatting Guidelines issued May 2012.

Formatting Guidelines refers to a document that contains all the default instructions and standards for formatting (such as markup for italics, illustrations, footnotes, and poetry) in rounds F1 and F2. These standards apply to all projects, unless specifically overridden by instructions from the Project Manager in the Project Comments or Project Discussion.

For complete clarity, the PM may make any exceptions to these Guidelines in the Project Comments, and the PPer can make any changes deemed appropriate in the PP stage. The only principle for the PPer to follow is consistency--either make the same consistent change throughout the book, or consistently stick to the original treatment of the author.

IMPORTANT: this is a reference document--beginning formatters do not need to memorize the entire document, but everyone should consult it when problems or questions arise. If the matter is still not clear, raise an inquiry in the Documentation Project forum.

Please note: In the examples where there are multiple blank lines between text, gray bars have been inserted--one bar per one blank line. This is a visual aid only for clarification of the number of blank lines between various parts of the text.[September 2012]

The Primary Rule

"Don't change what the author wrote!"

The final electronic book seen by a reader, possibly many years in the future, should accurately convey the intent of the author. If the author spelled words oddly, we leave them spelled that way. If the author wrote outrageous racist or biased statements, we leave them that way. If the author puts italics, bold text or a footnote every third word, we mark them italicized, bolded or footnoted. We are proofreaders, not editors. (See Printer Errors/Misspellings for proper handling of obvious misprints.)

We do change minor typographical conventions that don't affect the sense of what the author wrote. For example, we rejoin words that were broken at the end of a line. Changes such as these help us produce a consistently proofed version of the book. The proofreading rules we follow are designed to achieve this result. Please carefully read the rest of the Formatting Guidelines with this concept in mind. There is a separate set of Proofreading Guidelines. These guidelines are intended for formatting only. The proofreaders matched the image's content, and now as a formatter you match the image's look.

To assist the next formatter and the Post-Processor, we also preserve Line Breaks. This allows them to compare the lines in the text easily to the lines in the image.

Each Page is a Separate Unit

Since each project is distributed among many formatters, each of whom is working on different pages, there is no guarantee that you will see the next page of the project. With this in mind, be sure to open and close all markup tags on each page. This will make it easier for the post-processor to combine all these pages into one e-book eventually.

Project Comments

When you select a project for formatting, the Project Page is loaded. On this page, there is a section called "Project Comments" containing information specific to that project (book). Read these before you start formatting pages! If the Project Manager wants you to do something in this book differently from the way specified in these Guidelines, that will be noted here. Instructions in the Project Comments override the rules in these Guidelines, so follow them. There may also be instructions in the project comments that apply to the formatting phase, which do not apply during proofing. Finally, this is also where the Project Manager may give you interesting tidbits of information about the author or the project.

Please also read the Project Thread (Forum). The Project Manager may clarify project-specific guidelines here, and it is often used by formatters to alert other volunteers to recurring issues within the project and how they can best be addressed. (See the next section).

On the Project Page, the link 'Images, Pages Proofread, & Differences' allows you to see how other volunteers have made changes.

[2011] If you need to leave a note to attract the attention of the PPer, enclose it in square brackets and use **. So the note would appear as [**note to PPer].

Forum/Discuss This Project

On the Project Page where you start formatting pages, on the line "Forum", there is a link titled "Discuss this Project" (if the discussion has already started), or "Start a discussion on this Project" (if it hasn't). Clicking on that link will take you to a thread in the projects forum dedicated to this specific project. That is the place to ask questions about this book, inform the Project Manager about problems, etc. Using this project forum thread is the recommended way to communicate with the Project Manager and other volunteers who are working on this book.

Fixing Errors on Previous Pages

The Project Page contains links to pages from this project that you have recently worked on. (If you haven't formatted any pages yet, no links will be shown.)

Pages listed under either "DONE" or "IN PROGRESS" are available to make corrections or to finish formatting. Just click on the link to the page. Thus, if you discover that you made a mistake on a page or marked something incorrectly, you can click on that page here and reopen it to fix the error.

You may also use the "Images, Pages Proofread, & Differences" or "Just My Pages" links on the Project Page. These pages will display an "Edit" link next to the pages you have worked on in the current round that can still be corrected.

Formatting at the Character Level

Placement of Inline Formatting Markup

Inline formatting refers to markup such as <i> </i>, <b> </b>, <sc> </sc>, <f> </f>, or <g> </g>. Place punctuation outside the tags unless the markup is around an entire sentence or paragraph, or the punctuation is itself part of the phrase, title, or abbreviation that you are marking. If the formatting goes on for multiple paragraphs, put the markup around each paragraph.

If the markup extends over a page break, mark it as in this example of italic tags that "straddle" a page-end: On the first page The <i>pontifex maxi</i>-* and on the following page *<i>mus</i> had recorded.

The periods that mark an abbreviated word in the title of a journal such as Phil. Trans. are part of the title, so they are included within the tags, thus: <i>Phil. Trans.</i>.

Similarly with periods that follow speakers' names in Drama--they should be included inside the markup.

Many typefaces found in older books used the same design for numbers in both regular text and italics or bold. For dates and similar phrases, format the entire phrase with one set of markup, rather than marking the words as italics (or bold) and not the numbers.

If there is a series/list of words or phrases (such as names, titles, etc.), mark each item of the list individually.

In poetry, mark each line of the poem separately if the formatting goes on for multiple lines. See the Tables section for handling markup in tables.

Examples:

 Original Image:                                               Correctly Proofread Text:
 Enacted 4 July, 1776                      <i>Enacted 4 July, 1776</i>
 It cost 9l. 4s. 1d.                       It cost 9<i>l.</i> 4<i>s.</i> 1<i>d.</i>
 God knows what she saw in me! I spoke  <b>God knows what she saw in me!</b> I spoke 
 in such an affected manner.                  in such an affected manner.
 As in many other of these Studies, and    As in many other of these <i>Studies</i>, and
 (Psychological Review, 1898, p. 160)     (<i>Psychological Review</i>, 1898, p. 160)
 L. Robinson, art. "Ticklishness,"         L. Robinson, art. "<sc>Ticklishness</sc>,"
                  December 3, morning.     /*
                  1323 Picadilly Circus    <i>December 3, morning.</i>
                                           1323 Picadilly Circus
                                           */
 Volunteers may be tickled pink to read    Volunteers may be tickled pink to read
 Ticklishness, Tickling and Laughter,      <i>Ticklishness</i>, <i>Tickling and Laughter</i>,
 Remarks on Tickling and Laughter          <i>Remarks on Tickling and Laughter</i>
 and Ticklishness, Laughter and Humour.    and <i>Ticklishness, Laughter and Humour</i>.
That's the idea!” exclaimed Tacks.       "<i>That's the idea!</i>" exclaimed Tacks.
 The professor set the reading assignment   The professor set the reading assignment
 for  E r l e b n i s  G e s c h i c h t e  for <g>Erlebnis Geschichte
 D e u t s c h l a n d                      Deutschland
 s e i t  1 8 4 5.                          seit 1845</g>.

Italics

Format italicized text with <i> inserted at the start and </i> inserted at the end of the italics. (Note the "/" in the closing tag.)

See also Placement of Inline Formatting Markup

Bold Text

Format bold text (text printed in a heavier typeface) with <b> inserted before the bold text and </b> after it. (note the "/" in the closing tag.)

See also Placement of Inline Formatting Markup and Chapter Headings.

Underlined Text

Format underlined text as Italics with <i> and </i>. (Note the "/" in the closing tag.) Underlining was often used to indicate emphasis when the typesetter was unable to actually italicize the text, for example in a typewritten document.

See also Placement of Inline Formatting Markup

Some Project Managers may specify in the Project Comments that underlined text be marked up with the <u> and </u> tags.

Spaced Out Text (gesperrt)

Format s p a c e d o u t text with <g> inserted before the text and </g> after it. (Note the "/" in the closing tag.) Remove the extra spaces between letters in each word. This was a typesetting technique used for emphasis in some older books, especially in German.

See also Placement of Inline Formatting Markup and Chapter Headings.

Font Changes

Some Project Managers may request that you mark a change of font within a paragraph or line of normal text by inserting <f> before the change in font and </f> after it. (Note the "/" in the closing tag.) This markup may be used to identify a special font or other formatting that does not already have its own markup (such as italics and bold).

Possible uses of this markup include:

The particular use or uses of this markup in a project will usually be spelled out in the Project Comments. Formatters should post in the Project Discussion if the markup appears to be needed and has not yet been requested.

See also Placement of Inline Formatting Markup.

Words in Small Capitals

The formatting is different for Mixed Case Small Caps and all small caps:

Format words that are printed in Mixed Small Caps as Mixed Upper and Lowercase. Format words that are printed in all small caps as ALL-CAPS. For both mixed case and all small caps, surround the text with <sc> and </sc> markup.

Headings ( Chapter Headings, Section Headings, Captions, etc.) may appear to be in all small caps, but this is usually the result of a change in font size and should not be marked as small caps. The first word or two of a new chapter may be printed in small caps or all caps but those are treated as ordinary text unless the project comments say otherwise. .

See also Placement of Inline Formatting Markup.

 Original Image:                                               Correctly Proofread Text:
 This is Small Caps                                          <sc>This is Small Caps</sc>
 You cannot be serious about aardvarks!                      You cannot be serious about <sc>AARDVARKS</sc>!

Words in All Capitals

Format words that are printed in all capital letters as all capital letters.

The exception to this is the first word of a chapter: many old books typeset the first word of these in all caps; this should be changed to upper and lower case, so "ONCE upon a time," becomes "Once upon a time,"


Font Size Changes

Normally we do not do anything to mark changes in font size. The exceptions to this are when it indicates a Block quotations or when the font size changes within a single paragraph or line of text (see Font Changes).

Extra Spaces or Tabs Between Words

Extra spaces between words are common in OCR output. You generally don't need to bother removing these—that can be done automatically during post-processing. However, extra spaces around punctuation, em-dashes, quote marks, etc. do need to be removed when they separate the symbol from the word. In addition, within the /* */ markup that preserves spacing, be sure to remove any extra spaces since they will not be automatically removed later on.

Finally, if you find any tab characters in the text you should remove them.


Superscripts

Older books often abbreviated words as contractions, and printed them as superscripts. Format these by inserting an up-arrow (^) followed by the superscripted text. Surround the superscripted text with curly braces { and } as well. For example:

 Original Image:
 Genrl Washington defeated Ld Cornwall's army.
 Correctly Formatted Text:
 Gen^{rl} Washington defeated L^{d} Cornwall's army.

In scientific & technical works, format superscripted characters with curly braces { and } surrounding them even if there is only one character superscripted. For example:

 Original Image:
 ... up to xn elements in the array.
 Correctly Formatted Text:
 "... up to x^{n} elements in the array.

If the superscript represents a footnote marker, then see the Footnotes/Endnotes section instead.

The Project Manager may specify in the Project Comments that superscripted text be marked differently.

Subscripts

Subscripted text is often found in scientific works, but is not common in other material. Format subscripted text by inserting an underline character _ and surrounding the text with curly braces { and }. For example:

 Original Image:
 H2O.
 Correctly Formatted Text:
 H_{2}O.

Page References ("See Page 123")

Format page number references within the text such as (see p. 123) as they appear in the image.

Check the Project Comments to see if the Project Manager has special requirements for page references.


Formatting at the Paragraph Level

Chapter Headings

Format chapter headings as they appear in the image. A chapter heading may start a bit farther down the page than the page header and won't have a page number on the same line. Chapter Headings are often printed all caps; if so, keep them as all caps. Mark any italics or mixed case small caps that appear in the image.

Put 4 blank lines before the "CHAPTER XXX". Include these blank lines even if the chapter starts on a new page; there are no 'pages' in an e-book, so the blank lines are needed. Then separate with a blank line each additional part of the chapter heading, such as a chapter description, opening quote, etc., and finally leave two blank lines before the start of the text of the chapter.

If the title of a Story or Chapter is presented first on a page by itself, then again at the head of the Story or Chapter, both instances should be preceded by 4 blank lines.

Old books often printed the first word or two of every chapter in all caps or small caps; change these to upper and lower case (first letter only capitalized).

While chapter headings may appear to be bold or spaced out, these are usually the result of font or font size changes and should not be marked. The extra blank lines separate the heading, so do not mark the font change as well. See the first example below.


Original Image:

Chapter Heading Example


Correctly Formatted Text:
this is a blank line
this is a blank line
this is a blank line
this is a blank line
GREEN FANCY
this is a blank line
this is a blank line
this is a blank line
this is a blank line
CHAPTER I

THE FIRST WAYFARER AND THE SECOND WAYFARER
MEET AND PART ON THE HIGHWAY

this is a blank line
this is a blank line
A solitary figure trudged along the narrow
road that wound its serpentinous way
through the dismal, forbidding depths of
the forest: a man who, though weary and footsore,
lagged not in his swift, resolute advance. Night
was coming on, and with it the no uncertain prospects
of storm. Through the foliage that overhung
the wretched road, his ever-lifting and apprehensive
eye caught sight of the thunder-black, low-lying
clouds that swept over the mountain and bore
down upon the green, whistling tops of the trees.


At a cross-road below he had encountered a small
girl driving homeward the cows. She was afraid
of the big, strange man with the bundle on his back
and the stout walking stick in his hand: to her a
remarkable creature who wore "knee pants" and
stockings like a boy on Sunday, and hob-nail shoes,
and a funny coat with "pleats" and a belt, and a
green hat with a feather sticking up from the band.


Original Image:

Chapter Heading Example 2

Chapter Heading Example 3

Correctly Formatted Text:

/#
In the United States?[A] In a railroad? In a mining company?
In a bank? In a church? In a college?


Write a list of all the corporations that you know or have
ever heard of, grouping them under the heads <i>public</i> and <i>private</i>.


How could a pastor collect his salary if the church should
refuse to pay it?


Could a bank buy a piece of ground "on speculation?" To
build its banking-house on? Could a county lend money if it
had a surplus? State the general powers of a corporation.
Some of the special powers of a bank. Of a city.


A portion of a man's farm is taken for a highway, and he is
paid damages; to whom does said land belong? The road intersects
the farm, and crossing the road is a brook containing
trout, which have been put there and cared for by the farmer;
may a boy sit on the public bridge and catch trout from that
brook? If the road should be abandoned or lifted, to whom
would the use of the land go?
#/

this is a blank line
this is a blank line
this is a blank line
this is a blank line
CHAPTER XXXV.

<sc>Commercial Paper.</sc>
this is a blank line
this is a blank line
<b>Kinds and Uses.</b>--If a man wishes to buy some commodity
from another but has not the money to pay for
it, he may secure what he wants by giving his written
promise to pay at some future time. This written
promise, or <i>note</i>, the seller prefers to an oral promise
for several reasons, only two of which need be mentioned
here: first, because it is <i>prima facie</i> evidence of
the debt; and, second, because it may be more easily
transferred or handed over to some one else.


If J. M. Johnson, of Saint Paul, owes C. M. Jones,
of Chicago, a hundred dollars, and Nelson Blake, of
Chicago, owes J. M. Johnson a hundred dollars, it is
plain that the risk, expense, time and trouble of sending
the money to and from Chicago may be avoided,


[Footnote A: The United States: "Its charter, the constitution. * * * Its flag the
symbol of its power; its seal, of its authority."--Dole.]

Section Headings

Some books have sections within chapters. Format these headings as they appear in the image. Leave 2 blanks lines before the heading and one after, unless the Project Manager has requested otherwise. If you are not sure if a heading indicates a chapter or a section, post a question in the Project Discussion, noting the page number.

Mark any italics or mixed case small caps that appear in the image. While section headings may appear to be bold or spaced out, these are usually the result of font or font size changes and should not be marked. The extra blank lines separate the heading, so do not mark the font change as well.

Original Image:

Section Heading Example 1

Correctly Formatted Text:

and numerous, found in collections of well-authenticated
specimens. The suggested caution implied
is not unnecessary, for the periods overlap, and there
is but little to show when such things as lamps and
lanterns were actually made.

this is a blank line
this is a blank line
RUSHLIGHTS AND HOLDERS.

In tracing the development of lighting from quite
homely beginnings, rushlights, prepared by the
cottager and the farm hand for the winter supply,
seem to come first on the list. Rushlights, however,

Other Major Divisions in Texts

Major Divisions in the text such as Preface, Foreword, Table of Contents, Introduction, Prologue, Epilogue, Appendix, References, Conclusion, Glossary, Summary, Acknowledgements, Bibliography, etc., should be formatted in the same way as Chapter Headings, i.e. 4 blank lines before the heading and 2 blank lines before the start of the text.


Paragraph Spacing/Indenting

Put a blank line before the start of a paragraph, even if it starts at the top of a page. You should not indent the start of the paragraph, but if it is already indented don't bother removing those spaces—that can be done automatically during post-processing.

See the Chapter Headings image/text for an example.


Extra Spacing/Stars/Line Between Paragraphs

In the image, most paragraphs start on the line immediately after the end of the previous one. Sometimes two paragraphs are separated to indicate a "thought break." A thought break may take the form of a line of stars, hyphens, or some other character, a plain or floridly decorated horizontal line, a simple decoration, or even just an extra blank line or two.

A thought break may represent a change of scene or subject, a lapse in time, or a bit of suspense. This is intended by the author, so we preserve it by putting a blank line, <tb>, and then another blank line.

Sometimes printers used decorative lines to mark the ends of chapters or sections. These are not thought breaks so they should not be marked with <tb>.

Please check the Project Comments as the Project Manager may request that additional information be retained in the thought break markup, such as <tb stars> for a row of stars.


Original Image:

Thought Breaks Example


Correctly Formatted Text:

like the gentleman with the spiritual hydrophobia
in the latter end of Uncle Tom's Cabin.
Unconsciously Mr. Dixon has done his best to
prove that Legree was not a fictitious character.


<tb>

Joel Chandler Harris, Harry Stillwell Edwards,
George W. Cable, Thomas Nelson Page,
James Lane Allen, and Mark Twain are Southern
men in Mr. Griffith's class. I recommend


Illustrations

Text for an illustration should be surrounded by an illustration tag [Illustration: and ], with the caption text placed in between. Format the caption text as it is printed, preserving the line breaks, italics, etc. Treat lines such as "See Page 66" as part of the caption.

If an illustration has no caption, add a tag [Illustration]. (Be sure to remove the colon and space before the ] in this case.)

If the illustration is in the middle of or at the side of a paragraph, move the illustration tag to before or after the paragraph and leave a blank line to separate them. Rejoin the paragraph by removing any blank lines left by doing so.

If there is no paragraph break on the page, mark the illustration tag with an * like so *[Illustration: (text of caption)], move it to the top of the page, and leave a blank line after it.

If there is just an illustration on the page, add a tag [Illustration] or [Illustration: text] and put a blank line before the tag. Don't put an asterisk before the tag.


Original Image:

Illustrations Example 1


Correctly Formatted Text:

[Illustration: Martha told him that he had always been her ideal and
that she worshipped him.

/*
<i>Frontispiece</i>
<i>Her Weight in Gold</i>
*/
]


Original Image:

Illustrations Example 2


Correctly Formatted Text:

such study are due to Italians. Several of these instruments
have already been described in this journal, and on the present
occasion we shall make known a few others that will
serve to give an idea of the methods employed.

[Illustration: <sc>Fig. 1.</sc>--APPARATUS FOR THE STUDY OF HORIZONTAL
SEISMIC MOVEMENTS.]


For the observation of the vertical and horizontal motions
of the ground, different apparatus are required. The

Footnotes/Endnotes

Format footnotes by leaving the text of the footnote at the bottom of the page and placing a tag where it is referenced in the text. This means:

1. In the main text, the character that marks a footnote location should be surrounded with square brackets ([ and ]) and placed right next to the word being footnoted[1] or its punctuation mark,[2] as shown in the image and the two examples in this sentence. Footnote markers may be numbers, letters, or symbols. When footnotes are marked with a symbol or a series of symbols (*, †, ‡, §, etc.) we replace these with Capital letters in order (A, B, C, etc.).

2. At the bottom of the page, a footnote should be surrounded by a footnote tag [Footnote #: and ], with the footnote text placed in between and the footnote number or letter placed where the # is shown in the tag. Format the footnote text as it is printed, preserving the line breaks, italics, etc. Be sure to use the same tag in the footnote as you used in the text where the footnote was referenced. Place each footnote on a separate line in order of appearance. Separate each footnote with a blank line if there is more than one.

If a footnote is incomplete at the end of the page, leave it at the bottom of the page and just put an asterisk * where the footnote ends, like this: [Footnote 1: (text of footnote)]*. The * will bring it to the attention of the post-processor, who will eventually join the parts of the footnote together.

If a footnote started on a previous page, leave it at the bottom of the page and surround it with *[Footnote: (text of footnote)](without any footnote number or marker). The * will bring it to the attention of the post-processor, who will eventually join the parts of the footnote together.

If a continued footnote ends or starts on a hyphenated word, mark both the footnote and the word with *, thus: [Footnote 1: This footnote is continued and the last word in it is also con-*]* for the leading fragment, and *[Footnote: *tinued onto the next page.].

In some books, footnotes are separated from the main text by a horizontal line. We don't keep this so please just leave a blank line between the main text and the footnotes.

Endnotes are just footnotes that have been located together at the end of a chapter or at the end of the book, instead of on the bottom of each page. These are formatted in the same manner as footnotes. Where you find an endnote reference in the text, just surround it with [ and ]. If you are formatting one of the pages with endnotes, surround the text of each note with [Footnote #: (text of endnote)], with the endnote text placed in between, and the endnote number or letter placed where the # is. Put a blank line after each endnote so that they remain separate paragraphs when the text is rewrapped during post-processing.

Footnotes in Tables should remain where they are in the original image.


Original Image:

Footnotes Example 1


Correctly Formatted Text:

The principal persons involved in this argument were Caesar[A], former military
leader and Imperator, and the orator Cicero[B]. Both were of the aristocratic
(Patrician) class, and were quite wealthy.


[Footnote A: Gaius Julius Caesar.]

[Footnote B: Marcus Tullius Cicero.]


Original Footnoted Poetry:

Illustrations Example 2


Correctly Formatted Text:

/*
Mary had a little lamb[1]
  Whose fleece was white as snow
And everywhere that Mary went
  The lamb was sure to go!
*/


[Footnote 1: This lamb was obviously of the Hampshire breed,
well known for the pure whiteness of their wool.]


Paragraph Side-Descriptions (Sidenotes)

Some books will have short descriptions of the paragraph along the side of the text. These are called sidenotes. Move sidenotes to just above the paragraph that they belong to. A sidenote should be surrounded by a sidenote tag [Sidenote: and ], with the text of the sidenote placed in between. Format the sidenote text as it is printed, preserving the line breaks, italics, etc. (while handling end-of-line hyphenation and dashes normally). Leave a blank line after the sidenote so that it does not get merged into the paragraph when the text is rewrapped during post-processing.

If there are multiple sidenotes for a single paragraph, put them one after another at the start of the paragraph. Leave a blank line separating each of them.

If the paragraph began on a previous page, put the sidenote at the top of the page and mark it with * so that the post-processor can see that it belongs on the previous page, like this: *[Sidenote: (text of sidenote)]. The post-processor will move it to the appropriate place.

Sometimes a Project Manager will request that you put sidenotes next to the sentence they apply to, rather than at the top or bottom of the paragraph. In this case, don't separate them out with blank lines.


Original Image:

Sidenotes Example 1


Correctly Formatted Text:

*[Sidenote: Burning
discs
thrown into
the air.]

that such as looked at the fire holding a bit of larkspur
before their face would be troubled by no malady of the
eyes throughout the year.[1] Further, it was customary at
Würzburg, in the sixteenth century, for the bishop's followers
to throw burning discs of wood into the air from a mountain
which overhangs the town. The discs were discharged by
means of flexible rods, and in their flight through the darkness
presented the appearance of fiery dragons.[2]

[Sidenote: The Midsummer
fires in
Swabia.]

[Sidenote: Omens
drawn from
the leaps
over the
fires.]

[Sidenote: Burning
wheels
rolled
down hill.]

In the valley of the Lech, which divides Upper Bavaria
from Swabia, the midsummer customs and beliefs are, or
used to be, very similar. Bonfires are kindled on the
mountains on Midsummer Day; and besides the bonfire
a tall beam, thickly wrapt in straw and surmounted by a
cross-piece, is burned in many places. Round this cross as
it burns the lads dance with loud shouts; and when the
flames have subsided, the young people leap over the fire in
pairs, a young man and a young woman together. If they
escape unsmirched, the man will not suffer from fever, and
the girl will not become a mother within the year. Further,
it is believed that the flax will grow that year as high as
they leap over the fire; and that if a charred billet be taken
from the fire and stuck in a flax-field it will promote the
growth of the flax.[3] Similarly in Swabia, lads and lasses,
hand in hand, leap over the midsummer bonfire, praying
that the hemp may grow three ells high, and they set fire
to wheels of straw and send them rolling down the hill.
Among the places where burning wheels were thus bowled
down hill at Midsummer were the Hohenstaufen mountains
in Wurtemberg and the Frauenberg near Gerhausen.[4]
At Deffingen, in Swabia, as the people sprang over the mid-*

[Footnote 1: <i>Op. cit.</i> iv. 1. p. 242. We have
seen (p. 163) that in the sixteenth
century these customs and beliefs were
common in Germany. It is also a
German superstition that a house which
contains a brand from the midsummer
bonfire will not be struck by lightning
(J. W. Wolf, <i>Beiträge zur deutschen
Mythologie</i>, i. p. 217, § 185).]

[Footnote 2: J. Boemus, <i>Mores, leges et ritus
omnium gentium</i> (Lyons, 1541), p.
226.]

[Footnote 3: Karl Freiherr von Leoprechting,
<i>Aus dem Lechrain</i> (Munich, 1855),
pp. 181 <i>sqq.</i>; W. Mannhardt, <i>Der
Baumkultus<i>, p. 510.]

[Footnote 4: A. Birlinger, <i>Volksthümliches aus</i>
<i>Schwaben</i> (Freiburg im Breisgau, 1861-1862),
ii. pp. 96 <i>sqq.</i>, § 128, pp. 103
<i>sq.</i>, § 129; <i>id.</i>, <i>Aus Schwaben</i> (Wiesbaden,
1874), ii. 116-120; E. Meier,
<i>Deutsche Sagen, Sitten und Gebräuche
aus Schwaben</i> (Stuttgart, 1852), pp.
423 <i>sqq.</i>; W. Mannhardt, <i>Der Baumkultus</i>,
p. 510.]

Placement of Out-of-Line Formatting Markup

Out-of-line formatting refers to the /# #/ and /* */ markup tags. The /# #/ "rewrap" markup indicates text that is printed differently, but can still be rewrapped during post-processing. The /* */ "no-wrap" markup indicates text that should not be rewrapped later on during post-processing—where the line breaks, indentation, and spacing need to be preserved.

On any page where you use an opening marker, be sure to include the closing markup tag as well. After the text is rewrapped during post-processing, each marker will be removed along with the entire line that it is on. Because of this, leave a blank line between the preceding regular text above the opening marker, and similarly leave a blank line between the closing marker and the regular text that follows.

When /* */ is nested within /# #/ tags, the blank line after the end */ tag is not needed.

[2011] /# #/ tags should not be nested inside /* */ markers, because they will be over-ridden.

See also Letters/Correspondence.

Block Quotations

Block quotations are blocks of text (typically several lines and sometimes several pages) that are distinguished from the surrounding text by wider margins, a smaller font size, different indentation, or other means. Surround block quotations with /# and #/ markers. See Placement of Out-of-Line Formatting Markup for details on this markup.

Apart from adding the markers, block quotations should be formatted as any other text.


Original Image:

Sidenotes Example 1


Correctly Formatted Text:

later day was welcomed in their home on the Hudson.
Dr. Bakewell's contribution was as follows:[24]


/#
The uncertainty as to the place of Audubon's birth has been
put to rest by the testimony of an eye witness in the person
of old Mandeville Marigny now dead some years. His repeated
statement to me was, that on his plantation at Mandeville,
Louisiana, on Lake Ponchartrain, Audubon's mother was
his guest; and while there gave birth to John James Audubon.
Marigny was present at the time, and from his own lips, I have,
as already said, repeatedly heard him assert the above fact.
He was ever proud to bear this testimony of his protection
given to Audubon's mother, and his ability to bear witness as
to the place of Audubon's birth, thus establishing the fact that
he was a Louisianian by birth.
#/

We do not doubt the candor and sincerity of the
excellent Dr. Bakewell, but are bound to say that the
incidents as related above betray a striking lapse of

Lists of Items

Surround lists with /* and */ markers. See Placement of Out-of-Line Formatting Markup for details on this markup.


Original Image:

Sidenotes Example 1


Correctly Formatted Text:

/*
Andersen, Hans Christian
Bach, Johann Sebastian
Balboa, Vasco Nunez de
Bierce, Ambrose
Carroll, Lewis
Churchill, Winston
Columbus, Christopher
Curie, Marie
Daguerre, Louis J. M.
Darwin, Charles
Descartes, René
Earhart, Amelia
Einstein, Albert
Freud, Sigmund
Lewis, Sinclair
Magellan, Ferdinand
Melville, Herman
Newton, Isaac
Pasteur, Louis
Poe, Edgar Allan
Ponce de Leon, Juan
Pulitzer, Joseph
Shakespeare, William
Tesla, Nikola
*/

Tables

Surround tables with /* and */ markers. See Placement of Out-of-Line Formatting Markup for details on this markup. Format the table with spaces (not tabs) to look approximately like the original table. Don't make the table wider than 75 characters. Project Gutenberg's guidelines go on to say "...except where it can't be helped. Never, ever longer than 80...".

Do not use tabs for formatting—use space characters only. Tab characters will line up differently between computers, and your careful formatting will not always display the same way.

If inline formatting (italics, bold, etc.) is needed in the table, mark up each table cell separately. When aligning the text, keep in mind that inline markup will appear differently in the final text version. For example, <i>italics markup</i> normally becomes _underscores_, and most other inline markup will be treated similarly. On the other hand, <sc>Small Caps Markup</sc> is removed completely.

It's often hard to format tables in plain text; just do your best. Be sure to use a mono-spaced font, such as DPCustomMono or Courier. Remember that the goal is to preserve the Author's meaning, while producing a readable table in an e-book. Sometimes this requires sacrificing the original format of the table on the printed page. Check the Project Comments and discussion thread because other volunteers may have settled on a specific format. If there is nothing there, you might find something useful in the Gallery of Table Layouts forum threads or wiki page at DP-US.[August 2012]

Footnotes in tables should remain where they are in the image. See footnotes for details.


Original Image:

Tables Example 1


Correctly Formatted Text:

Tables Example 1


Original Image:

Tables Example 3


Correctly Formatted Text:

Tables Example 4

Poetry/Epigrams

Mark poetry or epigrams with /* and */ so that the line breaks and spacing will be preserved. See Placement of Out-of-Line Formatting Markup for details on this markup. If In-Line Formatting occurs in poetry, such as <i> </i>, <b> </b>, <sc> </sc>, <f> </f>, or <g> </g>, mark each line of the poem separately if the formatting goes on for multiple lines.

Preserve the relative indentation of the individual lines of the poem or epigram by adding 2, 4, 6 (or more) spaces in front of the indented lines to make them resemble the image. If the entire poem is centered on the printed page, don't try to center the lines of poetry during formatting. Move the lines to the left margin, and preserve the relative indentation of the lines.

When a line of verse is too long for the printed page, many books wrap the continuation onto the next printed line and place a wide indentation in front of it. These continuation lines should be rejoined with the line above. Continuation lines usually start with a lower case letter. They will appear randomly unlike normal indentation, which occurs at regular intervals in the meter of the poem.

If a row of dots appears in a poem, this most likely represents a missing line(s); please preserve this markup.[July 09]

Footnotes in poetry should be treated the same as regular footnotes during formatting. Line Numbers in poetry should be kept.

Check the Project Comments for the specific project you are formatting. Books of poetry often have special instructions from the Project Manager. Many times, you won't have to follow all these formatting guidelines for a book that is mostly or entirely poetry.


Original Image:

Poetry Example 1


Correctly Formatted Text:

to the scenery of his own country:

/*
          Oh, to be in England
          Now that April's there,
      And whoever wakes in England
      Sees, some morning, unaware,
That the lowest boughs and the brushwood sheaf
Round the elm-tree bole are in tiny leaf,
While the chaffinch sings on the orchard bough
              In England--now!

And after April, when May follows,
And the whitethroat builds, and all the swallows!
Hark! where my blossomed pear-tree in the hedge
Leans to the field and scatters on the clover
Blossoms and dewdrops--at the bent spray's edge--
That's the wise thrush; he sings each song twice over,
Lest you should think he never could recapture
The first fine careless rapture!
And though the fields look rough with hoary dew,
All will be gay, when noontide wakes anew
The buttercups, the little children's dower;
--Far brighter than this gaudy melon-flower!
*/

So it runs; but it is only a momentary memory;
and he knew, when he had done it, and to his

Line Numbers

Line numbers are common in books of poetry, and usually appear near the margin every fifth or tenth line. Keep line numbers, placing them at least six spaces past the right hand end of the line, even if they are on the left side of the poetry/text in the original image. Since poetry will not be reformatted in the e-book version, the line numbers will be useful to readers.


Letters/Correspondence

Format letters and correspondence as you would paragraphs. Put a blank line before the start of the letter; do not duplicate any indenting.

Surround consecutive heading or footer lines (such as addresses, date blocks, salutations, or signatures) with /* and */ markers. See Placement of Out-of-Line Formatting Markup for details on this markup.

Don't indent the heading or footer lines, even if they are indented or right justified in the image—just put them at the left margin. The post-processor will format them as needed.

If the correspondence is printed differently than the main text, see Block Quotations.


Original Image:

Letters Example 1


Correctly Formatted Text:

<i>John James Audubon to Claude François Rozier</i>

[Letter No. 1, addressed]

/*
<sc>M. Fr. Rozier</sc>,
Merchant-Nantes.
<sc>New York</sc>, <i>10 January, 1807</i>.

<sc>Dear Sir</sc>:
*/

We have had the pleasure of receiving by the <i>Penelope</i> your
consignment of 20 pieces of linen cloth, for which we send our
thanks. As soon as we have sold them, we shall take great
pleasure in making our return.


Original Image:

Letters Example 2


Correctly Formatted Text:

/#
lack of memory which <i>baffles belief</i>, I have a certain
"uptaking" knack. My preachment will bore you, but you
will (if you read it) detect an <i>ensemble</i>; but, for goodness'
sake, <i>zitti</i>! They'll think, when they hear the P.R.A., that,
Lor' bless him! he'd known it all his life. Nevertheless,
enough for the day, &c. Best love to Gussey.--Affect. bro.,

/*
<sc>Fred.</sc>
*/
#/

I remember--when my husband and I were
sitting with him one afternoon after his return
home that autumn--his saying, "I feel distinctly I

Right-aligned Text

Surround lines of right-justified text with /* and */ markers. See Placement of Out-of-Line Formatting Markup for details on this markup.

Formatting at the Page Level:

Blank Page

Format as [Blank Page] if both the text and the image are blank. Don't put a blank line before or after.

If there is text in the formatting text area and a blank image, or if there is text in the image but none in the text box, follow the directions for a Bad Image or Bad Text.

Front/Back Title Page

Format all the text just as it was printed on the page, whether all capitals, upper and lower case, etc., including the years of publication or copyright.

Older books often show the first letter as a large ornate graphic—format this as just the letter.

And, of course, the text of the entire title page should be enclosed between /* and */, to prevent rewrap.


Original Image:

Title Example 1


Correctly Formatted Text:

/*
GREEN FANCY

BY
GEORGE BARR McCUTCHEON

AUTHOR OF "GRAUSTARK," "THE HOLLOW OF HER HAND,"
"THE PRINCE OF GRAUSTARK," ETC.

<i>WITH FRONTISPIECE BY
C. ALLAN GILBERT</i>

[Illustration]

NEW YORK
DODD, MEAD AND COMPANY
1917
*/

Table of Contents

Tables of contents usually require extensive formatting by PP. Therefore, please surround them with /* and */ markup (even though they are not strictly no-rewrap) and do only any required in-line markup needed.

See Placement of Out-of-Line Formatting Markup for details on this markup.

Page number references should be placed at least six spaces past the end of the text. Remove any periods or asterisks (leaders) used to align the page numbers.


Original Image:

Contents Example 1

Correctly Formatted Text:
this is a blank line
this is a blank line
this is a blank line
this is a blank line
CONTENTS
this is a blank line
this is a blank line
/*
CHAPTER      PAGE

I. <sc>The First Wayfarer and the Second Wayfarer
Meet and Part on the Highway</sc>      1

II. <sc>The First Wayfarer Lays His Pack Aside and
Falls in with Friends</sc>      15

III. <sc>Mr. Rushcroft Dissolves, Mr. Jones Intervenes,
and Two Men Ride Away</sc>      33

IV. <sc>An Extraordinary Chambermaid, a Midnight
Tragedy, and a Man Who Said "Thank You"</sc>      50

V. <sc>The Farm-boy Tells a Ghastly Story, and an
Irishman Enters</sc>      67

VI. <sc>Charity Begins Far from Home, and a Stroll in
the Wildwood Follows</sc>      85

VII. <sc>Spun-gold Hair, Blue Eyes, and Various Encounters</sc>      103

VIII. <sc>A Note, Some Fancies, and an Expedition in
Quest of Facts</sc>      120

IX. <sc>The First Wayfarer, the Second Wayfarer, and
the Spirit of Chivalry Ascendant</sc>      134

X. <sc>The Prisoner of Green Fancy, and the Lament of
Peter the Chauffeur</sc>      148

XI. <sc>Mr. Sprouse Abandons Literature at an Early
Hour in the Morning</sc>      167

XII. <sc>The First Wayfarer Accepts an Invitation, and
Mr. Dillingford Belabors a Proxy</sc>      183

XIII. <sc>The Second Wayfarer Receives Two Visitors at
Midnight</sc>      199

XIV. <sc>A Flight, a Stone-cutter's Shed, and a Voice
Outside</sc>      221
*/

Indexes

Surround the index with /* and */ tags. See Placement of Out-of-Line Formatting Markup for details on this markup. You don't need to align the numbers as they appear in the image; just put a comma followed by the page numbers.

Indexes are often printed in 2 columns; this narrower space can cause entries to split onto the next line. Rejoin these back onto a single line. This may create long lines, but they will be rewrapped to the proper width and indentation during post-processing.

Place one blank line before each entry in the index. For sub-topic listings in an index, start each one on a new line, indented 2 spaces.

Treat each new section in an index (A, B, C...) the same as a section heading by placing 2 blank lines before it.

Old books sometimes printed the first word of each section in the index in all caps or small caps; change this to match the style used for the rest of the index entries.

Please check the Project Comments as the Project Manager may request different formatting, such as treating the index like a Table of Contents instead.


Original Image:

Indexes Example 1


Correctly Formatted Text:


/*
Elizabeth I, her royal Majesty the Queen, 123, 144-155.
  birth of, 145.
  christening, 146-147.
  death and burial, 152.

Ethelred II, the Unready, 33.
*/


Original Image:

Indexes Example 2


Correctly Formatted Text:


/*
Hooker, Jos., maj. gen. U. S. V., 345;
  assigned to command Porter's corps, 350;
  afterwards, McDowell's, 367;
  in pursuit of Lee, 380;
  at South Mt., 382;
  unacceptable to Halleck, retires from active service, 390.

Hopkins, Henry H., 209;
  notorious secessionist in Kanawha valley, 217;
  controversy with Gen. Cox over escaped slave, 233.
this is a blank line
this is a blank line
J

James, Lewis M., 187;
  capt. on Gen. Wilson's staff, 194.
*/


Original Image:

Indexes Example 3


Correctly Formatted Text:


/*
Sales committee, 52

Sales manager, 30

Sales records, 120
  daily, 121
  monthly, 123
  salesmen's, 123

Shipping clerk, 184
  class rates, 186
  commodity rate file, 193
  commodity rates, 186
  freight tariffs, 188
  routing shipments, 194

Shipping department, 183-229
  back orders, 199
  checking shipments, 200
*/

Advertisements

Pages that include advertisements, or promotional material (usually about other books printed by the publisher) must be formatted, using the same standards as other pages. Though some PPers may prefer to represent these pages by illustration, to reproduce the exact appearance and fonts of the original, correctly formatted text of the contents of these pages will help to make this decision, and will be needed if the pages are done without an illustration.

Separate major sections within an extensive "ad" portion of the book by 2 blank lines. As usual, if in doubt, leave a [**note] for the PPer.

Plays: Actor Names/Stage Directions

For all plays:

    * Format cast listings (Dramatis Personæ) as lists.
    * Treat each new Act the same as a chapter heading by placing 4 blank lines before it and 2 after.
    * Treat each new Scene the same as a section heading by placing 2 blank lines before it.
    * In dialog, treat a change in speaker as a new paragraph, with one blank line between.
    * Format actor names as they are in the original image, whether they are italics, bold, or all capital letters.
    * Stage directions are formatted as they are in the original image, so if the stage direction is on a line by itself,
      format it that way; if it is at the end of a line of dialog, leave it there; if it is right-justified at the end of a line of
      dialog, leave at least six spaces between the dialog and the stage directions.
      Stage directions often begin with an opening bracket and omit the closing bracket. This convention is retained;
      do not close the brackets. Italics markup is generally placed inside the brackets.

For metrical plays (plays written as poetry):

    * Many plays are metrical, and like poetry should not be rewrapped. Surround metered text with /* and */ as
      for poetry. If stage directions are on their own line, do not surround these with /* and */. (Since stage
      directions are not metrical, and can be safely rewrapped in the PP stage, they should not be contained within
      the /* */ tags that protect the metrical dialog.)
    * Preserve relative indenting of dialog when a single metrical line is shared by more than one speaker.
    * Rejoin metrical lines that were split due to width restrictions of the paper, just as in poetry. If the continuation is
      only a word or so, it is often shown on the line above or below following a (, rather than having a line of its
      own. See the example.

Please check the Project Comments, as the Project Manager may specify different formatting.


Original Image:

Actor/Stage Directions Example 1


Correctly Formatted Text:

/*
Has not his name for nought, he will be trode upon:
What says my Printer now?

<i>Clow.</i> Here's your last Proof, Sir.
You shall have perfect Books now in a twinkling.

<i>Lap.</i> These marks are ugly.

<i>Clow.</i> He says, Sir, they're proper:
Blows should have marks, or else they are nothing worth.

<i>La.</i> But why a Peel-crow here?

<i>Clow.</i> I told 'em so Sir:
A scare-crow had been better.

<i>Lap.</i> How slave? look you, Sir,
Did not I say, this <i>Whirrit</i>, and this <i>Bob</i>,
Should be both <i>Pica Roman</i>.

<i>Clow.</i> So said I, Sir, both <i>Picked Romans</i>,
And he has made 'em <i>Welch</i> Bills,
Indeed I know not what to make on 'em.

<i>Lap.</i> Hay-day; a <i>Souse</i>, <i>Italica</i>?

<i>Clow.</i> Yes, that may hold, Sir,
<i>Souse</i> is a <i>bona roba</i>, so is <i>Flops</i> too.
*/


Original Image:

Actor/Stage Directions Example 2


Correctly Formatted Text:

/*
<sc>Clin.</sc> And do I hold thee, my Antiphila,
Thou only wish and comfort of my soul!

<sc>Syrus.</sc> In, in, for you have made our good man wait. (<i>Exeunt.</i>
*/
this is a blank line
this is a blank line
this is a blank line
this is a blank line
ACT THE THIRD.
this is a blank line
this is a blank line
<sc>Scene I.</sc>

/*
<sc>Chrem.</sc> 'Tis now just daybreak.--Why delay I then
To call my neighbor forth, and be the first
To tell him of his son's return?--The youth,
I understand, would fain not have it so.
But shall I, when I see this poor old man
Afflict himself so grievously, by silence
Rob him of such an unexpected joy,
When the discov'ry can not hurt the son?
No, I'll not do't; but far as in my pow'r
Assist the father. As my son, I see,
Ministers to th' occasions of his friend,
Associated in counsels, rank, and age,
So we old men should serve each other too.
*/
this is a blank line
this is a blank line
<sc>Scene II.</sc>

Enter <sc>Menedemus</sc>.

/*
<sc>Mene.</sc> (<i>to himself</i>). Sure I'm by nature form'd for misery
Beyond the rest of humankind, or else
'Tis a false saying, though a common one,
"That time assuages grief." For ev'ry day
My sorrow for the absence of my son
Grows on my mind: the longer he's away,
The more impatiently I wish to see him,
The more pine after him.

<sc>Chrem.</sc> But he's come forth. (<i>Seeing</i> <sc>Menedemus</sc>.)
Yonder he stands. I'll go and speak with him.
Good-morrow, neighbor! I have news for you;
Such news as you'll be overjoy'd to hear.
*/


Original Image:

Actor/Stage Directions Example 3


Correctly Formatted Text:

[<i>Hernda has come from the grove and moves up to his side</i>]

/*
<i>Her.</i> [<i>Adoringly</i>] And you the master!

<i>Hud.</i> Daughter, you owe my lord Megario
Some pretty thanks.                  [<i>Kisses her cheek</i>]

<i>Her.</i>         I give them, sir.
*/


Original Image:

Actor/Stage Directions Example 4


Correctly Formatted Text:

/*
<i>Am.</i> Sure you are fasting;
Or not slept well to night; some dream (<i>Ismena?</i>)

<i>Ism.</i> My dreams are like my thoughts, honest and innocent,
Yours are unhappy; who are these that coast us?
You told me the walk was private.
*/

Anything else that needs special handling or that you're unsure of

While formatting, if you encounter something that isn't covered in these guidelines that you think needs special handling or that you are not sure how to handle, post your question, noting the png (page) number, in the Project Discussion.

You should also put a note in the formatted text to explain to the next volunteer or post-processor what the problem or question is. Start your note with a square bracket and two asterisks [** and end it with another square bracket ]. This clearly separates it from the author's text and signals the post-processor to stop and carefully examine this part of the text and the matching image to address any issues. You may also want to identify which round you are working in just before the ] so that later volunteers know who left the note. Any comments put in by a previous volunteer must be left in place. See the next section for details.


Previous Volunteers' Notes/Comments

Any notes or comments put in by a previous volunteer must be left in place. You may add agreement or disagreement to the existing note but even if you know the answer, you absolutely must not remove the comment. If you have found a source which clarifies the problem, please cite it so the post-processor can also refer to it.

If you come across a note from a previous volunteer that you know the answer to, please take a moment and provide feedback to them by clicking on their name in the formatting interface and posting a private message to them explaining how to handle the situation in the future. Please, as already stated, do not remove the note.


Common Problems:

Bad Image

If an image is bad (not loading, mostly illegible, etc.), please post about this bad image in the project discussion.

Note that some page images are quite large, and it is common for your browser to have difficulty displaying them, especially if you have several windows open or are using an older computer. Try closing some of your windows and programs to see if that helps, or post in the project discussion to see if anyone else has the same problem.


Wrong Image for Text

If there is a wrong image for the text given, please post about this bad page in the project discussion.


Previous Proofreading or Formatting Mistakes

If a previous volunteer made a lot of mistakes or missed a lot of things, please take a moment and provide feedback to them by clicking on their name in the proofreading interface and posting a private message to them explaining how to handle the situation so that they will know how in the future.

Please be nice! Everyone here is a volunteer and presumably trying their best. The point of your feedback message should be to inform them of the correct way to format, rather than to criticize them. Give a specific example from their work showing what they did, and what they should have done.

If the previous volunteer did an outstanding job, you can also send them a message about that—especially if they were working on a particularly difficult page.


Printer Errors/Misspellings

Correct all of the words that the OCR has misread (scannos), but do not correct what may appear to you to be misspellings or printer errors that occur on the page image. Many of the older texts have words spelled differently from modern usage and we retain these older spellings, including any accented characters.

Place a note in the txet [**typo for text?] next to a printer's error. If you are unsure whether it is actually an error, please also ask in the project discussion. If you do make a change, include a note describing what you changed: [**typo "txet" fixed]. Include the two asterisks ** so the post-processor will notice it.


Factual Errors in Texts

Do not correct factual errors in the author's book. Many of the books we are preparing have statements of fact in them that we no longer accept as accurate. Leave them as the author wrote them. See Printer Errors/Misspellings for how to leave a note if you think the printed text is not what the author intended.