Standard Ebooks

Typography

Section titles and ordinals

  1. Section ordinals in the body text are set in Roman numerals.

  2. Section titles are titlecased according to the output of se titlecase. Section titles are not all-caps or small-caps.

  3. Section titles do not have trailing periods.

  4. Chapter titles omit the word Chapter, unless the word used is a stylistic choice for prose style purposes. Chapters with unique identifiers (i.e. not Chapter, but something unique to the style of the book, like Book or Stave) do include that unique identifier in the title, wrapped in <span epub:type="label">.

    <h2>Chapter <span epub:type="ordinal z3998:roman">II</span></h2>
    <h2 epub:type="ordinal z3998:roman">II</h2>
    <h2> <span epub:type="label">Stave</span> <span epub:type="ordinal z3998:roman">III</span> </h2>

    In special cases it may be desirable to retain Chapter for clarity. For example, Frankenstein has “Chapter” in titles to differentiate them from the “Letter” sections.

Italics

  1. Using both italics and quotes (outside of the context of quoted dialog) is usually not necessary. Either one or the other is used, with rare exceptions.

  2. Words and phrases that require emphasis are italicized with the <em> element.

    <p>“Perhaps <em>he</em> was there,” Raoul said, at last.</p>
  3. Strong emphasis, like shouting, may be set in small caps with the <strong> element.

    <p><strong>Can’t</strong> I?” screamed the unhappy creature to himself.</p>
  4. When a short phrase within a longer clause is italicized, trailing punctuation that may belong to the containing clause is not italicized.

    <p>“Look at <em>that!</em>” she shouted.</p>
    <p>“Look at <em>that</em>!” she shouted.</p>
  5. When an entire clause is italicized, trailing punctuation is italicized, unless that trailing punctuation is a comma at the end of dialog.

    <p><em>Charge!</em>” she shouted.</p>
    <p><em>But I want to</em>,” she said.</p>
  6. Words written to be read as sounds are italicized with <i>.

    <p>He could hear the dog barking: <i>Ruff, ruff, ruff!</i></p>
  7. A person's internal thoughts are italicized with unsemanticated <i>.

    <p>The thought flashed to me: <i>it’s a city you’re firing at, not a plane</i>, and I flinched.</p>

Italicizing individual letters

  1. Individual letters that used in context as a grapheme are italicized with an <i epub:type="z3998:grapheme"> element. They are typically lowercased and not followed by periods.

    <p>He often rolled his <i epub:type="z3998:grapheme">r</i>’s.</p>
    1. Plural graphemes are formed with ’s, to aid in clarity.

      <p>Her <i epub:type="z3998:grapheme">a</i>s were nasally.</p>
      <p>Her <i epub:type="z3998:grapheme">a</i>’s were nasally.</p>
  2. Individual letters that are not graphemes (for example letters that might be referring to names, the shapes of the letters themselves, musical notes or keys, or concepts) are not italicized.

    <p>...due to the loss of what is known in New England as the “L”: that long deep roofed adjunct usually built at right angles to the main house...</p>
    <p>She was learning her A.B.C.s.</p>
    <p>His trident had the shape of an E.</p>
    <p>The piece was in the key of C major.</p>
  3. The ordinal nth is set with an italicized n, without a hyphen.

    <p>The <i>n</i>th degree.</p>

Italicizing non-English words and phrases

  1. Non-English words and phrases that are not in Merriam-Webster are italicized, unless they are in a non-Roman script like Chinese or Japanese.

    <p>The <i xml:lang="fr">corps de ballet</i> was flung into consternation.</p>
  2. Non-English words that are proper names, or are in proper names, are not italicized, unless the name itself would be italicized according to the rules for italicizing or quoting names and titles. If words in the name might be mispronounced in English pronunciation, they are wrapped in a <span xml:lang="LANGUAGE"> element to assist screen readers with pronunciation. Most proper names of people or places do not require this, but occasionally there may be some that do.

    <p><i xml:lang="fr">Où est le métro?</i>” he asked, and she pointed to <i xml:lang="fr">Place de Clichy</i>, next to the <i xml:lang="fr">Le Bon Petit Déjeuner</i> restaurant.</p>
    <p><i xml:lang="fr">Où est le métro?</i>” he asked, and she pointed to <span xml:lang="fr">Place de Clichy</span>, next to the <span xml:lang="fr">Le Bon Petit Déjeuner</span> restaurant.
  3. If certain non-English words are used so frequently in the text that italicizing them at each instance would be distracting to the reader, then only the first instance is italicized. Subsequent instances are wrapped in a <span xml:lang="LANGUAGE"> element.

  4. Words and phrases that are originally non-English in origin, but that can now be found in Merriam-Webster, are not italicized.

    <p>Sir Percy’s bon mot had gone the round of the brilliant reception-rooms.</p>
  5. Inline-level italics are set using the <i> element with an xml:lang attribute corresponding to the correct IETF language tag.

  6. Block-level italics are set using an xml:lang attribute on the closest encompassing block element, with the style of font-style: italic.

    In this example, note the additional namespace declaration, and that we target descendants of the <body> element; otherwise, the entire <body> element would receive italics!

    @namespace xml "http://www.w3.org/XML/1998/namespace"; body [xml|lang]{ font-style: italic; }
    <blockquote epub:type="z3998:verse" xml:lang="la"> <p> <span>—gelidas leto scrutata medullas,</span> <br/> <span>Pulmonis rigidi stantes sine vulnere fibras</span> <br/> <span>Invenit, et vocem defuncto in corpore quaerit.</span> </p> </blockquote>
  7. Words that are in a non-English “alien” language (i.e. one that is made up, like in a science fiction or fantasy work) are italicized and given an IETF languate tag in a custom namespace. Custom namespaces begin consist of x-TAG, where TAG is a custom descriptor of 8 characters or less.

    <p><i xml:lang="x-arcturan">Dolm</i>,” said Haunte.</p>

Italicizing or quoting newly-used English words

  1. When introducing new terms, non-English or technical terms are italicized, but terms composed of common English are set in quotation marks.

    <p>English whalers have given this the name “ice blink.”</p> <p>The soil consisted of that igneous gravel called <i>tuff</i>.</p>
  2. English neologisms in works where a special vocabulary is a regular part of the narrative are not italicized. For example science fiction works may necessarily contain made-up English technology words, and those are not italicized.

Italics in names and titles

  1. Place names, like pubs, bars, or buildings, are not quoted.

  2. The names of publications, music, and art that can stand alone are italicized; additionally, the names of transport vessels are italicized. These include, but are not limited to:

    • Periodicals like magazines, newspapers, and journals.

    • Publications like books, novels, plays, and pamphlets, except “holy texts,” like the Bible or books within the Bible.

    • Long poems and ballads, like the Iliad, that are book-length.

    • Long musical compositions or audio, like operas, music albums, or radio shows.

    • Long visual art, like films or a TV show series.

    • Visual art, like paintings or sculptures.

    • Transport vessels, like ships.

  3. The names of short publications, music, or art, that cannot stand alone and are typically part of a larger collection or work, are quoted. These include, but are not limited to:

    • Short musical compositions or audio, like pop songs, arias, or an episode in a radio series.

    • Short prose like novellas, shot stories, or short (i.e. not epic) poems.

    • Chapter titles in a prose work.

    • Essays or individual articles in a newspaper or journal.

    • Short visual art, like short films or episodes in a TV series.

Examples

<p>He read “Candide” while having a pint at the “King’s Head.”</p>
<p>He read <i epub:type="se:name.publication.book">Candide</i> while having a pint at the King’s Head.</p>

Taxonomy

  1. Binomial names (generic, specific, and subspecific) are italicized with a <i> element having the z3998:taxonomy semantic inflection.

    <p>A bonobo monkey is <i epub:type="z3998:taxonomy">Pan paniscus</i>.</p>
  2. Family, order, class, phylum or division, and kingdom names are capitalized but not italicized.

    <p>A bonobo monkey is in the phylum Chordata, class Mammalia, order Primates.</p>
  3. If a taxonomic name is the same as the common name, it is not italicized.

  4. The second part of the binomial name follows the capitalization style of the source text. Modern usage requires lowercase, but older texts may set it in uppercase.

Exceptions

  1. Epigraphs, bridgeheads, and some other types of heading matter are set in italics by default. Text that in a Roman-set context would be italicized (like non-English words or phrases, or titles of books) are thus set in Roman in that heading matter, to contrast against the default italics. However, if due to this rule the entire block would be set in Roman instead of italics, thus lending the block an unexpected appearance, then the contrasting Roman is discarded and the default italics are preserved.

    [epub|type~="epigraph"]{ font-style: italic; /* ... */ } [epub|type~="epigraph"] i{ font-style: normal; } [epub|type~="epigraph"] cite{ margin-top: 1em; font-style: normal; font-variant: small-caps; } [epub|type~="epigraph"] cite i{ font-style: italic; }
    <blockquote epub:type="epigraph"> <p><i xml:lang="fr">En administration, toutes les sottises sont mères.</i></p> <cite><i epub:type="se:name.publication.book">Maximes</i>, <i xml:lang="la">fr</i> <abbr class="name">M. G.</abbr> De Levis.</cite> </blockquote>
    [epub|type~="epigraph"]{ font-style: italic; /* ... */ } [epub|type~="epigraph"] i{ font-style: normal; } [epub|type~="epigraph"] cite{ margin-top: 1em; font-style: normal; font-variant: small-caps; } [epub|type~="epigraph"] cite i{ font-style: italic; }
    <blockquote epub:type="epigraph"> <p xml:lang="fr">“En administration, toutes les sottises sont mères.”</p> <cite><i epub:type="se:name.publication.book">Maximes</i>, <i xml:lang="la">fr</i> <abbr class="name">M. G.</abbr> De Levis.</cite> </blockquote>

Capitalization

  1. In general, capitalization follows modern English style. Some very old works frequently capitalize nouns that today are no longer capitalized. These archaic capitalizations are removed, unless doing so would change the meaning of the work.

  2. Titlecasing, or the capitalization of titles, follows the formula used in the se titlecase tool.

  3. Text in all caps is almost never correct typography. Instead, such text is changed to the correct case and surround with a semantically-meaningful element like <em> (for emphasis), <strong> (for strong emphasis, like shouting) or <b> (for unsemantic formatting required by the text). <strong> and <b> are styled in small-caps by default in Standard Ebooks.

    <p>The sign read BOB’S RESTAURANT.</p>
    <p>“CHARGE!” he cried.</p>
    <p>The sign read <b>Bob’s Restaurant</b>.</p>
    <p><strong>Charge!</strong>” he cried.</p>
  4. When something is addressed as an apostrophe, O is capitalized.

    <p>I carried the bodies into the sea, O walker in the sea!</p>
  5. Names followed by a generational suffix, like Junior or Senior, have the suffix uppercased if the suffix is part of the person's name.

    Occasionally, junior or senior may be used to refer to a younger or elder person having the same last name, but not necessarily the same first name. In these cases, the suffix is lowercased as it is not part of their name, but rather describing their generational relation.

    <p>He talked to Bob Smith Junior.</p> <p>He talked to John Doe <abbr class="eoc">Jr.</abbr></p> <p>Madame Bovary junior was afraid of accidents for her husband.</p>

Indentation

  1. Paragraphs that directly follow another paragraph are indented by 1em.

  2. The first line of body text in a section, or any text following a visible break in text flow (like a header, a scene break, a figurem etc.), is not indented, with the exception of block quotations.

    1. Body text following a block quotation is indented only if the text begins a new semantic paragraph. Otherwise, if the body text following a block quotation is semantically part of the paragraph preceding the block quotation, it is not indented. Such non-indented paragraphs have class="continued", which removes the default indentation.

      <p>He sat down before a writing-table and, taking pen and ink, wrote on a slip of paper as follows:⁠—</p> <blockquote epub:type="z3998:letter"> <p>The Bishop of Barchester is dead.</p> </blockquote> <p>“There,” said he. “Just take that to the telegraph office at the railway station and give it in as it is.”</p>
      <p>He opened the cover in which the message was enclosed and, having read it, he took his pen and wrote on the back of it⁠—</p> <blockquote epub:type="z3998:letter"> <p epub:type="z3998:salutation">For the Earl of ⸻,</p> <footer> <p epub:type="z3998:valediction">With the Earl of ⸻’s compliments</p> </footer> </blockquote> <p class="continued">and sent it off again on its journey.</p>

Headers

  1. Titles or subtitles that are entirely non-English-language are not italicized. However, they do have an xml:lang attribute to assist screen readers in pronunciation. Titles or subtitles that are in English but contain non-English components have those components italicized according to the general rules for italics.

    <h2 epub:type="title" xml:lang="la">Ex Oblivione</h2> <h3 epub:type="title"> <span epub:type="z3998:roman">XI</span> <span epub:type="subtitle">The <i epub:type="se:name.vessel.ship">Nautilus</i></span> </h3> <h2 epub:type="title"> <span epub:type="z3998:roman">XXXV</span> <span epub:type="subtitle">Miss Thorne’s <i xml:lang="fr">Fête Champêtre</i></span> </h2> <h4 epub:type="title"> <span epub:type="z3998:roman">XI</span> <span epub:type="subtitle" xml:lang="la">Christus Nos Liberavit</span> </h4>

Chapter headers

  1. Epigraphs in chapters have the quote source set in small caps, without a leading em-dash and without a trailing period.

    <header> <h2 epub:type="title z3998:roman">II</h2> <blockquote epub:type="epigraph"> <p>“Desire no more than to thy lot may fall. …”</p> <cite>—Chaucer.</cite> </blockquote> </header>
    header [epub|type~="epigraph"] cite{ font-variant: small-caps; }
    <header> <h2 epub:type="title z3998:roman">II</h2> <blockquote epub:type="epigraph"> <p>“Desire no more than to thy lot may fall. …”</p> <cite>Chaucer</cite> </blockquote> </header>

Ligatures

Ligatures are two or more letters that are combined into a single letter, usually for stylistic purposes. In general they are not used in modern English spelling, and are replaced with their expanded characters.

Words in non-English languages like French may use ligatures to differentiate words or pronunciations. In these cases, ligatures are retained.

<p>Œdipus Rex</p> <p>Archæology</p>
<p>Oedipus Rex</p> <p>Archaeology</p>

Punctuation and spacing

  1. Sentences are single-spaced.

  2. Periods and commas are placed within quotation marks; i.e. American-style punctuation is used, not logical (AKA “British” or “new”) style.

    <p>Bosinney ventured: “It’s the first spring day”.</p>
    <p>Bosinney ventured: “It’s the first spring day.”</p>
  3. Ampersands are preceded by a no-break space (U+00A0).

    <p>The firm of Hawkinsnbsp&amp; Harker.</p>
  4. Some older works include spaces in common contractions; these spaces are removed.

    <p>Would n’t it be nice to go out? It ’s such a nice day.</p>
    <p>Wouldn’t it be nice to go out? It’s such a nice day.</p>

Quotation marks

  1. “Curly” or typographer’s quotes, both single and double, are always used instead of straight quotes. This is known as “American-style” quotation, which is different from British-style quotation which is also commonly found in both older and modern books.

    <p>“Don’t do it!” she shouted.</p>
  2. Quotation marks that are directly side-by-side are separated by a hair space ( or U+200A) character.

    <p>hairsp‘Green?’ Is that what you said?” asked Dave.</p>
  3. Words with missing letters represent the missing letters with a right single quotation mark ( or U+2019) character to indicate elision.

    <p>He had pork ’n’ beans for dinner</p>
    1. Elision is not to be confused with a glottal stop, which may sometimes occur in non-English languages like Hawaiian. Glottal stops that are not elided letters are represented with a turned comma (ʻ or U+02BB), not the similar-looking left single quotation mark ( or U+2018).

      <p><i xml:lang="haw">ʻŌlelo Hawaiʻi</i></p>
    2. Some last names in English were once contractions, and now are no longer contractions but actual last names in their own right. In these cases a left single quotation mark ( or U+2018) is used to clarify pronunciation, but to make clear that the word is not a contraction.

      <p>His friends were James M‘Donald and Sam M‘Daniel.</p>

Ellipses

  1. The ellipses glyph ( or U+2026) is used for ellipses, instead of consecutive or spaced periods.

  2. When ellipses are used as suspension points (for example, to indicate dialog that pauses or trails off), the ellipses are not preceded by a comma.

    Ellipses used to indicate missing words in a quotation require keeping surrounding punctuation, including commas, as that punctuation is in the original quotation.

  3. A word joiner (U+2060), followed by a hair space ( or U+200A) glyph, followed by another word joiner (U+2060), are located before all ellipses that do not begin a paragraph, and that are not directly preceded by .

  4. A regular space is located after all ellipses do not end a paragraph and that are not followed by punctuation.

  5. A hair space ( or U+200A) glyph is located between an ellipses and any punctuation that follows directly after the ellipses, unless that punctuation is a quotation mark, in which case there is no space at all between the ellipses and the quotation mark.

    <p>“I’m so hungrywjhairspwjhairsp! What were you saying about eatingwjhairspwj…?”

Dashes

There are many kinds of dashes, and the run-of-the-mill hyphen is often not the correct dash to use. In particular, hyphens are not used for things like date ranges, phone numbers, or negative numbers.

  1. Dashes of all types do not have white space around them.

  2. Figure dashes ( or U+2012) are used to indicate a dash in numbers that aren’t a range, like phone numbers.

    <p>His number is 555‒1234.</p>
  3. Hyphens (- or U+002D) are used to join words, including double-barrel names, or to separate syllables in a word.

    <p>Pre- and post-natal.</p>
    <p>The Smoot-Hawley act.</p>
  4. Minus sign glyphs ( or U+2212) are used to indicate negative numbers, and are used in mathematical equations instead of hyphens to represent the “subtraction” operator.

    <p>It was −5° out yesterday!</p>
    <p>5 − 2 = 3</p>
  5. En-dashes ( or U+2013) are used to indicate a numerical or date range; to indicate a relationships where two concepts are connected by the word “to,” for example a distance between locations or a range between numbers; or to indicate a connection in location between two places.

    <p>We talked 2–3 days ago.</p>
    <p>We took the Berlin–Munich train yesterday.</p>
    <p>I saw the torpedo-boat in the Ems⁠–⁠Jade Canal.</p>
  6. Non-break hyphens ( or U+2011) are used when a single word is stretched out by a speaker for prosaic effect.

    <p>When you wa‑ake, you shall ha‑ave, all the pretty little hawsiz—</p>

Em-dashes

Em-dashes ( or U+2014) are typically used to offset parenthetical phrases.

  1. Em-dashes are preceded by the invisible word joiner glyph (U+2060).

  2. Interruption in dialog is set by a single em-dash, not two em-dashes or a two-em-dash.

    <p>“I wouldn’t go as far as that, not myself, butwj——”</p>
    <p>“I wouldn’t go as far as that, not myself, butwj—”</p>

Partially-obscured words

  1. Em-dashes are used for partially-obscured years.

    <p>It was the year 19wj— in the town of Metropolis.</p>
  2. A regular hyphen is used in partially obscured years where only the last number is obscured.

    <p>It was the year 192- in the town of Metropolis.</p>
  3. A non-breaking hyphen ( or U+2011) is used when a single letter is obscured in a word.

    <p>He performed Mozart’s famous canon, “Leck mich im A‑sche.”</p>
  4. A two-em-dash ( or U+2E3A) preceded by a word joiner glyph (U+2060) is used in partially obscured word.

    <p>Sally Jwj⸺ walked through town.</p>
  5. A three-em-dash ( or U+2E3B) is used for completely obscured words.

    <p>It was night in the town of ⸻.</p>

Numbers, measurements, and math

  1. Coordinates are set with the prime ( or U+2032) or double prime ( or U+2033) glyphs, not single or double quotes.

    <p><abbr>Lat.</abbr> 27° 0' <abbr class="compass">N.</abbr>, <abbr>long.</abbr> 20° 1' <abbr class="compass eoc">W.</abbr></p> <p><abbr>Lat.</abbr> 27° 0’ <abbr class="compass">N.</abbr>, <abbr>long.</abbr> 20° 1’ <abbr class="compass eoc">W.</abbr></p>
    <p><abbr>Lat.</abbr> 27° 0′ <abbr class="compass">N.</abbr>, <abbr>long.</abbr> 20° 1′ <abbr class="compass eoc">W.</abbr></p>
  2. Ordinals for Arabic numbers are as follows: st, nd, rd, th.

    <p>The 1st, 2d, 3d, 4th.</p>
    <p>The 1st, 2nd, 3rd, 4th.</p>
  3. Numbers in a non-mathematical context are spelled out if they are less than or equal to 100. Numbers over 100 are set with digits.

    <p>“They had a gun on the West Front⁠—a seventy-five,” said O’Keefe.</p> <p>Allowing her 12,000 miles of straight-line travel through Uranus’ frigid soupy atmosphere.</p>
    1. If a series of numbers is close together in a sentence, and one would be spelled out but another wouldn’t, spell out all numbers within that context to maintain visual consistency.

      <p>There the Gulf Stream is 75 miles wide and two hundred ten meters deep.</p>
      <p>There the Gulf Stream is seventy-five miles wide and two hundred ten meters deep.</p>
    2. The plural form of spelled-out numbers is formed without an apostrophe. However the possessive or contracted form does include an apostrophe.

      <p>There were, the other answered, half a dozen two four two’s.</p>
      <p>There were, the other answered, half a dozen two four twos.</p> <p>Twice two’s four, and a stone’s a stone.</p> <p>He was allowed a day or two’s shooting in September.</p>

Roman numerals

  1. Roman numerals are not followed by trailing periods, except for grammatical reasons.

  2. Roman numerals are set using uppercase ASCII, not the Unicode Roman numeral glyphs.

  3. Roman numerals are not followed by ordinal indicators.

    <p>Henry <span epub:type="z3998:roman">VIII</span>th had six wives.</p>
    <p>Henry <span epub:type="z3998:roman">VIII</span> had six wives.</p>

Fractions

  1. Fractions are set in their appropriate Unicode glyph, if a glyph available; for example, ½, ¼, ¾ and U+00BC–U+00BE and U+2150–U+2189.

    <p>I need 1/4 cup of sugar.</p>
    <p>I need ¼ cup of sugar.</p>
  2. If a fraction doesn’t have a corresponding Unicode glyph, it is composed using the fraction slash Unicode glyph ( or U+2044) and superscript/subscript Unicode numbers. See this Wikipedia entry for more details.

    <p>Roughly 6/10 of a mile.</p>
    <p>Roughly ⁶⁄₁₀ of a mile.</p>
  3. There is no space between a whole number and its fraction.

    <p>There are 365¼ days in a year.</p>

Measurements

  1. Dimension measurements are set using the Unicode multiplication glyph (× or U+00D7), not the ASCII letter x or X.

    <p>The board was 4 x 3 x 7 feet.</p>
    <p>The board was 4 × 3 × 7 feet.</p>
  2. Feet and inches in shorthand are set using the prime ( or U+2032) or double prime ( or U+2033) glyphs (not single or double quotes), with a no-break space (U+00A0) separating consecutive feet and inch measurements.

    <p>He was 6'nbsp1" in height.</p> <p>He was 6’nbsp1” in height.</p>
    <p>He was 6′nbsp1″ in height.</p>
  3. When forming a compound of a number and unit of measurement in which the measurement is abbreviated, the number and unit of measurement are separated with a no-break space (U+00A0), not a dash. For exceptions in money, see 8.8.8.

    <p>A 12-<abbr>mm</abbr> pistol.</p>
    <p>A 12nbsp<abbr>mm</abbr> pistol.</p>

Punctuation in abbreviated measurements

See here for general abbreviation rules that also apply to measurements.

  1. Abbreviated SI units are set in lowercase without periods. They are not initialisms.

    <p>A 12nbsp<abbr>mm</abbr> pistol.</p>
  2. Abbreviated English, Imperial, or US customary units that are one word are set in lowercase with a trailing period. They are not initialisms.

    <p>We had two 9nbsp<abbr>ft.</abbr> sledges, of 41nbsp<abbr>lbs.</abbr> each.</p>

    The one exception is G (i.e. G-force), which is an initialism that is set without a period.

    <p>There’s a force of over a hundred thousand <abbr class="initialism">G</abbr>’s.</p>
  3. Abbreviated English, Imperial, or US customary units that are more than one word (like hp for horse power or mph for miles per hour) are set in lowercase without periods. They are not initialisms.

    <p>He drove his 40nbsp<abbr>hp</abbr> car at 20nbsp<abbr>mph</abbr>.</p>

Math

  1. In works that are not math-oriented or that don’t have a significant amount of mathematical equations, equations are set using regular HTML and Unicode.

    1. Operators and operands in mathematical equations are separated by a space.

      <p>6−2+2=6</p>
      <p>6 − 2 + 2 = 6</p>
    2. Operators like subtraction ( or U+2212), multiplication (× or U+00D7), and equivalence ( or U+2261) are set using their corresponding Unicode glyphs, not a hyphen or x. Almost all mathematical operators have a corresponding special Unicode glyph.

      <p>6 - 2 x 2 == 2</p>
      <p>6 − 2 × 2 ≡ 2</p>
  2. In works that are math-oriented or that have a significant amount of math, all variables, equations, and other mathematical objects are set using MathML.

    1. When MathML is used in a file, the m namespace is declared at the top of the file and used for all subsequent MathML code, as follows:

      xmlns:m="http://www.w3.org/1998/Math/MathML"

      This namespace is declared and used even if there is just a single MathML equation in a file.

      <html xmlns="http://www.w3.org/1999/xhtml" xmlns:epub="http://www.idpf.org/2007/ops" ub:prefix="z3998: http://www.daisy.org/z3998/2012/vocab/structure/, se: https://standardebooks.org/vocab/1.0" xml:lang="en-GB"> ... <p> <math xmlns="http://www.w3.org/1998/Math/MathML" alttext="x"> <ci>x</ci> </math> </p>
      <html xmlns="http://www.w3.org/1999/xhtml" xmlns:epub="http://www.idpf.org/2007/ops" xmlns:m="http://www.w3.org/1998/Math/MathML" epub:prefix="z3998: http://www.daisy.org/z3998/2012/vocab/structure/, se: https://standardebooks.org/vocab/1.0" xml:lang="en-GB"> ... <p> <m:math alttext="x"> <m:ci>x</m:ci> </m:math> </p>
    2. When possible, Content MathML is used over Presentational MathML. (This may not always be possible depending on the complexity of the work.)

      <p> <m:math alttext="x + 1 = y"> <m:apply> <m:equals/> <m:apply> <m:plus/> <m:ci>x</m:ci> <m:cn>1</m:cn> </m:apply> <m:ci>y</m:ci> </m:apply> </m:math> </p>
    3. Each <m:math> element has an alttext attribute.

      1. The alttext attribute describes the contents in the element in plain-text Unicode according to the rules in this specification.

      2. Operators in the alttext attribute are surrounded by a single space.

        <p> <m:math alttext="x+1=y"> <m:apply> <m:equals/> <m:apply> <m:plus/> <m:ci>x</m:ci> <m:cn>1</m:cn> </m:apply> <m:ci>y</m:ci> </m:apply> </m:math> </p>
        <p> <m:math alttext="x + 1 = y"> <m:apply> <m:equals/> <m:apply> <m:plus/> <m:ci>x</m:ci> <m:cn>1</m:cn> </m:apply> <m:ci>y</m:ci> </m:apply> </m:math> </p>
    4. When using Presentational MathML, <m:mrow> is used to group subexpressions, but only when necessary. Many elements in MathML, like <m:math> and <m:mtd>, imply <m:mrow>, and redundant elements are not desirable. See this section of the MathML spec for more details.

      <p> <m:math alttext="x"> <m:mrow> <m:mi>x</m:mi> </m:mrow> </m:math> </p>
      <p> <m:math alttext="x"> <m:mi>x</m:mi> </m:math> </p>
    5. If a Presentational MathML expression contains a function, the invisible Unicode function application glyph (U+2061) is used as an operator between the function name and its operand. This element looks exactly like the following, including the comment for readability: <m:mo><!--hidden U+2061 function application--></m:mo>. (Note that the preceding element contains an invisible Unicode character! It can be revealed with the se unicode-names tool.)

      <p> <m:math alttext="f(x)"> <m:mi>f</m:mi> <m:row> <m:mo fence="true">(</m:mo> <m:mi>x</m:mi> <m:mo fence="true">)</m:mo> </m:row> </m:math> </p>
      <p> <m:math alttext="f(x)"> <m:mi>f</m:mi> <m:mo>U+2061<!--hidden U+2061 function application--></m:mo> <m:row> <m:mo fence="true">(</m:mo> <m:mi>x</m:mi> <m:mo fence="true">)</m:mo> </m:row> </m:math> </p>
    6. Expressions grouped by parenthesis or brackets are wrapped in an <m:row> element, and fence characters are set using the <m:mo fence="true"> element. Separators are set using the <m:mo separator="true"> element. <m:mfenced>, which used to imply both fences and separators, is deprecated in the MathML spec and thus is not used.

      <p> <m:math alttext="f(x,y)"> <m:mi>f</m:mi> <m:mo>U+2061<!--hidden U+2061 function application--></m:mo> <m:fenced> <m:mi>x</m:mi> <m:mi>y</m:mi> </m:fenced> </m:math> </p>
      <p> <m:math alttext="f(x,y)"> <m:mi>f</m:mi> <m:mo>U+2061<!--hidden U+2061 function application--></m:mo> <m:row> <m:mo fence="true">(</m:mo> <m:mi>x</m:mi> <m:mo separator="true">,</m:mo> <m:mi>x</m:mi> <m:mo fence="true">)</m:mo> </m:row> </m:math> </p>
    7. If a MathML variable includes an overline, it is set by combining the variable’s normal Unicode glyph and the Unicode overline glyph ( or U+203E) in a <m:mover> element. However in the alttext attribute, the Unicode overline combining mark (U+0305) is used to represent the overline in Unicode.

      <p> <m:math alttext="x̅"> <m:mover> <m:mi>x</m:mi> <m:mo></m:mo> </m:mover> </m:math> </p>

Money

  1. Typographically-correct symbols are used for currency symbols.

    <p>The exchange rate was L2 for $1.</p>
    <p>The exchange rate was £2 for $1.</p>
  2. Currency symbols are not abbreviations.

£sd shorthand

£sd shorthand is a way of denoting pre-decimal currencies (pounds, shillings, and pence) common in England and other parts of the world until the 1970s.

  1. There is no white space between a number and an £sd currency symbol.

    <p>£ 14 8 s. 2 d. is known as a “tuppence.”</p>
    <p>£14 8<abbr>s.</abbr> 2<abbr>d.</abbr> is known as a “tuppence.”</p>
  2. Letters used in £sd shorthand are wrapped in <abbr> elements.

    <p>£14 8s. 2d. is known as a “tuppence.”</p>
    <p>£14 8<abbr>s.</abbr> 2<abbr>d.</abbr> is known as a “tuppence.”</p>

Latinisms

  1. Latinisms (except sic) that can be found in a modern dictionary are not italicized. Examples include e.g., i.e., ad hoc, viz., ibid., etc.. The exception is sic, which is always italicized.

  2. Whole passages of Latin language and Latinisms that aren’t found in a modern dictionary are italicized.

  3. &c. is not used, and is replaced with etc..

  4. For Ibid., see Endnotes.

  5. Latinisms that are abbreviations are set in lowercase with periods between words and no spaces between them, except BC, AD, BCE, and CE, which are set without periods, in small caps, and wrapped with <abbr class="era">:

    abbr.era{ font-variant: all-small-caps; }
    <p>Julius Caesar was born around 100 <abbr class="era">BC</abbr>.</p>

Initials and abbreviations

  1. Acronyms (terms made up of initials and pronounced as one word, like NASA, SCUBA, or NATO) are set in small caps, without periods, and are wrapped in an <abbr class="acronym"> element with corresponding CSS.

    abbr.acronym{ font-variant: all-small-caps; }
    <p>He was hired by <abbr class="acronym">NASA</abbr> last week.</p>
  2. Initialisms (terms made up of initials in which each initial is pronounced separately, like M.P., P.S., or U.S.S.R.) are set with periods and without spaces (with some exceptions that follow) and are wrapped in an <abbr class="initialism"> element.

    <p>He was hired by the <abbr class="initialism">U.S.</abbr> <abbr class="initialism">F.B.I.</abbr> last week.</p>
  3. When an abbreviation that is not an acronym contains a terminal period, its <abbr> element has the additional eoc class (End of Clause) if the terminal period is also the last period in clause. Such sentences do not have two consecutive periods.

    <p>She loved Italian food like pizza, pasta, <abbr class="eoc">etc.</abbr></p>
    <p>He lists his name alphabetically as Johnson, <abbr class="name eoc">R. A.</abbr></p>
    <p>His favorite hobby was <abbr class="acronym">SCUBA</abbr>.</p>
  4. Initials of people’s names are each separated by periods and spaces. The group of initials is wrapped in an <abbr class="name"> element.

    <p><abbr class="name">H. P.</abbr> Lovecraft described himself as an aged antiquarian.</p>
  5. Academic degrees are wrapped in an <abbr class="degree"> element. Degrees that consist of initials are set with a period between each initial. Degrees that consist of initials followed by abbreviated words are set with a hair space before the word.

    <p>Judith Douglas, <abbr class="degree eoc">D.D.S</abbr></p> <p>Abraham Van Helsing, <abbr class="degree">M.D.</abbr>, <abbr class="degree">D.hairspPh.</abbr>, <abbr class="degree">D.hairspLit.</abbr>, <abbr>etc.</abbr>, <abbr class="eoc">etc.</abbr></p>
    1. Some degrees are exceptions:

      • LL.D. does not have a period in LL, because it indicates the plural Legum.

  6. Postal codes and abbreviated US states are set in all caps, without periods or spaces, and are wrapped in an <abbr class="postal"> element.

    <p>Washington <abbr class="postal">DC</abbr>.</p>
  7. Abbreviations that are abbreviations of a single word, and that are not acronyms or initialisms (like Mr., Mrs., or lbs.) are set with <abbr>.

    1. Abbreviations ending in a lowercase letter are set without spaces between the letters, and have a trailing period.

    2. Abbreviations without lowercase letters are set without spaces and without a trailing period.

    3. Abbreviations that describes the next word, like Mr., Mrs., Mt., and St., are set with a no-break space (U+00A0) between the abbreviation and its target.

      <p>He called on <abbr>Mrs.</abbr>nbspJones yesterday.</p>
  8. Compass points are separated by periods and spaces. The group of points are wrapped in an <abbr class="compass"> element.

    <p>He traveled <abbr class="compass">S.</abbr>, <abbr class="compass">N. W.</abbr>, then <abbr class="compass eoc">E. S. E.</abbr></p>

Exceptions that are not abbreviations

  1. The following are not abbreviations, and are set without periods or spaces.

    • OK

    • SOS

  2. The following are initialisms, but are set without periods or spaces:

    • TV, i.e. television.

    • AC and DC, when referring to electrical current.

    • G, when used in the sense of G-force. Also see 8.8.5.4.2.

    • Stock ticker symbols.

      <p>She bought 125 shares of <abbr class="initialism">XYZ</abbr> corporation.</p>
  3. The following are abbreviations, but are not initialisms. Unlike almost all other abbreviations, they are in all caps and only have a period at the end.

    • MS. (manuscript)

    • MSS. (manuscripts)

    • M. (Monsieur)

    • MM. (Messieurs)

    <p><abbr>MM.</abbr>nbspGuy and Luc were putting the finishing touches on the <abbr>MS.</abbr> of their new novel.</p>
  4. A.B.C., when used in the sense of the alphabet, is not an abbreviation, and is set with periods between the letters. But other uses, like A.B.C. shops, are abbreviations. (The abbreviation in A.B.C. shop stands for “Australian Broadcasting Corporation.”)

    <p>She was learning her A.B.C.s</p> <p>He stopped by the <abbr class="initialism">A.B.C.</abbr> shop.</p>

Other exceptions

  1. The abbreviations 1D, 2D, 3D, and 4D, meaning first, second, third, and fourth dimensions, are abbreviations but do not have a trailing period.

  2. The words recto and verso are sometimes abbreviated with an initial and a superscript o. They are regular abbreviations, set without periods, and the o is superscripted with <sup>.

    <p><abbr>Ch.</abbr> 1, <abbr>fol.</abbr> 2 <abbr>r<sup>o</sup></abbr>.</p>

Times

  1. Times in a.m. and p.m. format are set in lowercase, with periods, and without spaces.

  2. a.m. and p.m. are wrapped in an <abbr class="time"> element.

Times as digits

  1. Digits in times are separated by a colon, not a period or comma.

  2. Times written in digits followed by a.m. or p.m. are set with a no-break space (U+00A0) between the digit and a.m. or p.m..

    <p>He called at 6:40nbsp<abbr class="time eoc">a.m.</abbr></p>

Times as words

  1. Words in a spelled-out time are separated by spaces, unless they appear before a noun, where they are separated by a hyphen.

    <p>He arrived at five thirty.</p>
    <p>They took the twelve-thirty train.</p>
  2. Times written in words followed by a.m. or p.m. are set with a regular space between the time and a.m. or p.m..

    <p>She wasn’t up till seven <abbr class="time eoc">a.m.</abbr></p>
  3. Military times that are spelled out (for example, in dialog) are set with dashes. Leading zeros are spelled out as oh.

    <p>He arrived at oh-nine-hundred.</p>

Chemicals and compounds

  1. Molecular compounds are set in Roman, without spaces, and wrapped in an <abbr class="compound"> element.

    <p>He put extra <abbr class="compound">NaCl</abbr> on his dinner.</p>
  2. Elements in a molecular compound are capitalized according to their listing in the periodic table.

  3. Amounts of an element in a molecular compound are set in subscript with a <sub> element.

    <p>She drank eight glasses of <abbr class="compound">H<sub>2</sub>O</abbr> a day.</p>

Temperatures

  1. The minus sign glyph ( or U+2212), not the hyphen glyph, is used to indicate negative numbers.

  2. Either the degree glyph (° or U+00B0) or the word degrees is acceptable. Works that use both are normalized to use the dominant method.

Abbreviated units of temperature

  1. Units of temperature measurement, like Farenheit or Celcius, may be abbreviated to F or C.

  2. Units of temperature measurement do not have trailing periods.

  3. If an abbreviated unit of temperature measurement is preceded by a number, the unit of measurement is first preceded by a hair space ( or U+200A).

  4. Abbreviated units of measurement are set in small caps.

  5. Abbreviated units of measurement are wrapped in an <abbr class="temperature"> element.

    abbr.temperature{ font-variant: all-small-caps; }
    <p>It was −23.33° Celsius (or −10°hairsp<abbr class="temperature">F</abbr>) last night.</p>

Scansion

Scansion is the representation of the metrical stresses in lines of verse.

  1. × (U+00d7) indicates an unstressed sylllable and / (U+002f) indicates a stressed syllable. They are separated from each other with no-break spaces (U+00A0).

    <p>Several of his types, however, constantly occur; <abbr>e.g.</abbr> A and a variant (/ × | / ×) (/ × × | / ×); B and a variant (× / | × /) (× × / | × /); a variant of D (/ × | / × ×); E (/ × × | /). </p>
  2. Lines of poetry listed on a single line (like in a quotation) are separated by a space, then a forward slash, then a space. Capitalization is preserved for each line.

    <p>The famous lines “Wake! For the Sun, who scatter’d into flight / The Stars before him from the Field of Night” are from <i epub:type="se:name.publication.book">The Rubáiyát of Omar Khayyám</i>.</p>

Legal cases and terms

  1. Legal cases are set in italics.

  2. Either versus or v. are acceptable in the name of a legal case; if using v., a period follows the v., and it is wrapped in an <abbr> element.

    <p>He prosecuted <i epub:type="se:name.legal-case">Johnson <abbr>v.</abbr> Smith</i>.</p>

Morse code

Any Morse code that appears in a book is changed to fit Standard Ebooks’ format.

American Morse Code

  1. Middle dot glyphs (· or U+00B7) are used for the short mark or dot.

  2. En dash ( or U+2013) are used for the longer mark or short dash.

  3. Em dashes ( or U+2014) are used for the long dash (the letter L).

  4. If two en dashes are placed next to each other, a hair space ( or U+200A) is placed between them to keep the glyphs from merging into a longer dash.

  5. Only in American Morse Code, there are internal gaps used between glyphs in the letters C, O, R, or Z. No-break spaces (U+00A0) are used for these gaps.

  6. En spaces (U+2002) are used between letters.

  7. Em spaces (U+2003) are used between words.

    <p>-- .. .. __ .. - - __ . . .. __ -.. .. . .- -</p> <p>My little old cat.</p>
    <p>– – ·· ·· — ·· – – — · · · — –·· ·· · ·– –</p> <p>My little old cat.</p>

Citations

  1. Citations are wrapped in a <cite> element.

  2. Citations that are the source of a quote are preceded by a space and an em dash, within the <cite> element.

    <p>“The Moving Finger writes; and, having writ, moves on.” <cite><i epub:type="se:name.publication.book">The Rubaiyat of Omar Khayyam</i></cite>.</p>
  3. Citations within a <blockquote> element have the <cite> element as the last direct child of the <blockquote> parent.

    <blockquote> <p>“The Moving Finger writes; and, having writ, moves on.”</p> <p> <cite><i epub:type="se:name.publication.book">The Rubaiyat of Omar Khayyam</i></cite> </p> </blockquote>
    <blockquote> <p>“The Moving Finger writes; and, having writ, moves on.”</p> <cite><i epub:type="se:name.publication.book">The Rubaiyat of Omar Khayyam</i></cite> </blockquote>

Verses and Chapters of the Bible

  1. Citations of passages from the Bible include the name of the book, followed by the chapter number and the verse number. The chapter and the verse numbers are separated by a colon.

    1. All chapter and verse numbers are written in Arabic numerals. Similarly, if a book being cited is a “numbered” book, the number is also written in Arabic numerals.

    <blockquote> <p>“Though I speak with the tongues of men and of angels, and have not charity, I am become as sounding brass, or a tinkling cymbal.”</p> <cite><span epub:type="z3998:roman">I</span> Corinthians <span epub:type="z3998:roman">XIII</span> 1</cite> </blockquote>
    <blockquote> <p>“Though I speak with the tongues of men and of angels, and have not charity, I am become as sounding brass, or a tinkling cymbal.”</p> <cite>—1 Corinthians 13:1</cite> </blockquote>
  2. If an entire chapter, instead of a particular verse, is being cited, then the citation includes the name of the book followed by the chapter number.

    <p>“In the beginning God created the heaven and the earth” is the first verse of Genesis <span epub:type="z3998:roman">I</span>.</p>
    <p>“In the beginning God created the heaven and the earth” is the first verse of Genesis 1.</p>
  3. If a continuous range of verses is being cited, an en-dash ( or U+2013) is placed between the verse numbers indicating the beginning and the end of the range.

    <p>Matthew 5:3–11.</p>

    Ranges may also span multiple chapters within the same book:

    <p>Matthew 5:1–7:29.</p>
  4. If a discontinuous group of verses in the same chapter is being cited, each distinct verse number is separated by a comma followed by a space.

    <p>Matthew 6:2, 16.</p>
  5. If there are multiple citations of the same book, each citation is separated by a semicolon followed by a space, and the name of the book is omitted after the first citation.

    <p>Matthew 5:3–11; 5:1–7:29; 6:2, 16</p>

Non-Latin Scripts and Transliterations

  1. Greek script is set in italics. All other scripts are not set in italics unless specially required by the text.

Greek

  1. Rough breathing marks are set using their precomposed character, if available; for example, , , and . If a precomposed character is not available, ̔ (U+0314) is used when the mark must be combined with a character, and ʽ (U+02BD) is used in all other cases.

  2. Smooth breathing marks are set with ◌̓ (U+0313) when the mark must be combined with a character. ᾿ (U+1FBF) is used in all other cases.

Chinese

  1. Wade-Giles is the preferred method of transliterating Chinese script. (See here for discussion.) Transliteration to Wade-Giles from Legge is permitted, but not required.

  2. In Wade-Giles transliteration, rough breathing marks are set using ʽ (U+02BD).