Typography

Section titles and ordinals

8.1.1
Section ordinals in the body text are set in Roman numerals.
8.1.2
Section titles are titlecased according to the output of se titlecase. Section titles are not all-caps or small-caps.
8.1.3
Section titles do not have trailing periods.
8.1.4
Chapter titles omit the word Chapter, unless the word used is a stylistic choice for prose style purposes. Chapters with unique identifiers (i.e. not Chapter, but something unique to the style of the book, like Book or Stave) do include that unique identifier in the title, wrapped in .
<h2>Chapter II</h2>

<h2 epub:type="ordinal z3998:roman">II</h2>

<h2> Stave III </h2>

In special cases it may be desirable to retain Chapter for clarity. For example, Frankenstein has “Chapter” in titles to differentiate them from the “Letter” sections.

Italics

8.2.1
Using both italics and quotes (outside of the context of quoted dialog) is usually not necessary. Either one or the other is used, with rare exceptions.
8.2.2
Words and phrases that require emphasis are italicized with the  element.
“Perhaps he was there,” Raoul said, at last.
8.2.3
Strong emphasis, like shouting, may be set in small caps with the  element.
“Can’t I?” screamed the unhappy creature to himself.
8.2.4
When a short phrase within a longer clause is italicized, trailing punctuation that may belong to the containing clause is not italicized.
“Look at that!” she shouted.

“Look at that!” she shouted.
8.2.5
When an entire clause is italicized, trailing punctuation is italicized, unless that trailing punctuation is a comma at the end of dialog.
“Charge!” she shouted.

“But I want to,” she said.
8.2.6
Words written to be read as sounds are italicized with .
He could hear the dog barking: Ruff, ruff, ruff!
8.2.7
A person's internal thoughts, if they are italicized in the source, are formatted with <q>, styled with italics. If the thoughts are quoted, they are left as quoted.
q{ font-style: italic; }

The thought flashed to me: <q>it’s a city you’re firing at, not a plane</q>, and I flinched.

Italicizing individual letters

8.2.8.1
Individual letters that are used in context as a phoneme are italicized with an  element. They are sentence-cased and not followed by periods.
“Mamochka, let’s play priatki,” (hide and seek), cried Lelechka, pronouncing the r like the l, so that the word sounded “pliatki.”
1. 8.2.8.1.1
 Plural phonemes are formed with ’s, to aid in clarity.
 Her as were nasally.
 
 Her a’s were nasally.
8.2.8.2
Graphemes are italicized with an  element.
“It’s such a pity,” she would say pensively, “that July hasn’t got an r in it.
1. 8.2.8.2.1
 When a word is being spelled out, the individual letters of the word are set as graphemes.
 I rattled off, “t-h-i-r-d, third.”
8.2.8.3
Individual letters that are not graphemes or phonemes (for example letters that might be referring to names, the shapes of the letters themselves, musical notes or keys, or concepts) are not italicized.
...due to the loss of what is known in New England as the “L”: that long deep roofed adjunct usually built at right angles to the main house...

She was learning her A.B.C.s.

His trident had the shape of an E.

The piece was in the key of C major.
8.2.8.4
The ordinal nth is set with an italicized n, without a hyphen.
The nth degree.

Italicizing non-English words and phrases

8.2.9.1
Non-English words and phrases that are not in Merriam-Webster are italicized, unless they are in a non-Roman script like Chinese or Japanese.
I have so much to tell you, mon petit chou darling.
8.2.9.2
Non-English words that are proper names, or are in proper names, are not italicized, unless the name itself would be italicized according to the rules for italicizing or quoting names and titles. If words in the name might be mispronounced in English pronunciation, they are wrapped in a  element to assist screen readers with pronunciation. Most proper names of people or places do not require this, but occasionally there may be some that do.
“Où est le métro?” he asked, and she pointed to Place de Clichy, next to the Le Bon Petit Déjeuner restaurant.

“Où est le métro?” he asked, and she pointed to Place de Clichy, next to the Le Bon Petit Déjeuner restaurant.
8.2.9.3
If certain non-English words are used so frequently in the text that italicizing them at each instance would be distracting to the reader, then only the first instance is italicized. Subsequent instances are wrapped in a  element.
8.2.9.4
Words and phrases that are originally non-English in origin, but that can now be found in Merriam-Webster’s basic online search results, are not italicized. “Basic online search results” means that results from other dictionaries that may appear alongside basic search results, including results from the unabridged or legal dictionaries, do not fall under this rule and may still be obscure enough to be italicized.
Sir Percy’s bon mot had gone the round of the brilliant reception-rooms.
8.2.9.5
Inline-level italics are set using the  element with an xml:lang attribute corresponding to the correct IETF language tag.
8.2.9.6
Block-level italics are set using an xml:lang attribute on the closest encompassing block element, with the style of font-style: italic.
In this example, note the additional namespace declaration, and that we target only <blockquote> elements that have the language tag. This is because there can be other elements, e.g. , that have a language tag but should not be italicized.

@namespace xml "http://www.w3.org/XML/1998/namespace"; blockquote[xml|lang]{ font-style: italic; }

<blockquote epub:type="z3998:verse" xml:lang="la"> —gelidas leto scrutata medullas, Pulmonis rigidi stantes sine vulnere fibras Invenit, et vocem defuncto in corpore quaerit. </blockquote>
8.2.9.7
Words that are in a non-English “alien” language (i.e. one that is made up, like in a science fiction or fantasy work) are italicized and given an IETF language tag in a custom namespace. Custom namespaces begin consist of x-TAG, where TAG is a custom descriptor of 8 characters or less.
“Dolm,” said Haunte.
8.2.9.8
Words that are in an unknown language have their xml:lang attribute set to und.
8.2.9.9
Some phrases that are not English in origin but that that are common in English prose, and which begin in a word that could be confused with an English word, are always italicized to prevent confusion. An incomplete list follows:
- sic
- a posteriori
- a priori
- a fortiori
- ad absurdum
- ad hominem
- ad infinitum
- ad interim
- ad nauseam
- in absentia
- in camera
- in loco parentis
- in situ
- in statu quo
- in toto
- in vitro
- inter alia
- more suo

Italicizing or quoting newly-used English words

8.2.10.1
When introducing new terms, non-English or technical terms are italicized, but terms composed of common English are set in quotation marks.
English whalers have given this the name “ice blink.” The soil consisted of that igneous gravel called tuff.
8.2.10.2
English neologisms in works where a special vocabulary is a regular part of the narrative are not italicized. For example science fiction works may necessarily contain made-up English technology words, and those are not italicized.

Italics in names and titles

8.2.11.1
Place names, like pubs, bars, or buildings, are not quoted.
8.2.11.2
The names of publications, music, and art that can stand alone are italicized; additionally, the names of transport vessels are italicized. These include, but are not limited to:
- Periodicals like magazines, newspapers, and journals.
- Publications like books, novels, plays, and pamphlets, except “holy texts,” like the Bible or books within the Bible.
- Long poems and ballads, like the Iliad, that are book-length.
- Long musical compositions or audio, like operas, music albums, or radio shows.
- Long visual art, like films or a TV show series.
- Visual art, like paintings or sculptures.
- Transport vessels, like ships.
8.2.11.3
The names of short publications, music, or art, that cannot stand alone and are typically part of a larger collection or work, are quoted. These include, but are not limited to:
- Short musical compositions or audio, like pop songs, arias, or an episode in a radio series.
- Short prose like novellas, short stories, or short (i.e. not epic) poems.
- Chapter titles in a prose work.
- Essays or individual articles in a newspaper or journal.
- Short visual art, like short films or episodes in a TV series.

Examples

He read “Candide” while having a pint at the “King’s Head.”

He read Candide while having a pint at the King’s Head.

Taxonomy

8.2.12.1
Binomial names (generic, specific, and subspecific) are italicized with a  element having the z3998:taxonomy semantic inflection.
A bonobo monkey is Pan paniscus.
8.2.12.2
Genus, tribe, subfamily, family, order, class, phylum or division, and kingdom names are capitalized but not italicized.
A bonobo monkey is in the phylum Chordata, class Mammalia, order Primates.
8.2.12.3
If a taxonomic name is the same as the common name, it is not italicized.
8.2.12.4
The second part of the binomial name follows the capitalization style of the source text. Modern usage requires lowercase, but older texts may set it in uppercase.

Exceptions

8.2.13.1
Epigraphs, bridgeheads, and some other types of heading matter are set in italics by default. Text that in a Roman-set context would be italicized (like non-English words or phrases, or titles of books) are thus set in Roman in that heading matter, to contrast against the default italics. However, if due to this rule the entire block would be set in Roman instead of italics, thus lending the block an unexpected appearance, then the contrasting Roman is discarded and the default italics are preserved.
This can usually be achieved by removing  elements (which have no semantic meaning and merely indicate the desire for italics) and moving their epub:type or xml:lang attributes to their parent element.

[epub|type~="epigraph"]{ font-style: italic; /* ... */ } [epub|type~="epigraph"] i{ font-style: normal; } [epub|type~="epigraph"] cite{ margin-top: 1em; font-style: normal; font-variant: small-caps; } [epub|type~="epigraph"] cite i{ font-style: italic; }

<blockquote epub:type="epigraph"> “En administration, toutes les sottises sont mères.” <cite>Maximes, fr <abbr epub:type="z3998:given-name">M. G.</abbr> De Levis.</cite> </blockquote>

[epub|type~="epigraph"]{ font-style: italic; /* ... */ } [epub|type~="epigraph"] i{ font-style: normal; } [epub|type~="epigraph"] cite{ margin-top: 1em; font-style: normal; font-variant: small-caps; } [epub|type~="epigraph"] cite i{ font-style: italic; }

<blockquote epub:type="epigraph"> “En administration, toutes les sottises sont mères.” <cite>Maximes, fr <abbr epub:type="z3998:given-name">M. G.</abbr> De Levis.</cite> </blockquote>

Capitalization

8.3.1
In general, capitalization follows modern English style. Some very old works frequently capitalize nouns that today are no longer capitalized. These archaic capitalizations are removed, unless doing so would change the meaning of the work.
8.3.2
Titlecasing, or the capitalization of titles, follows the formula used in the se titlecase tool.
8.3.3
Text in all caps is almost never correct typography. Instead, such text is changed to the correct case and surround with a semantically-meaningful element like  (for emphasis),  (for strong emphasis, like shouting) or  (for unsemantic formatting required by the text).  and  are styled in small-caps by default in Standard Ebooks.
The sign read BOB’S RESTAURANT.

“CHARGE!” he cried.

The sign read Bob’s Restaurant.

“Charge!” he cried.
8.3.4
When something is addressed as an apostrophe, O is capitalized.
I carried the bodies into the sea, O walker in the sea!
8.3.5
Names followed by a generational suffix, like Junior or Senior, have the suffix uppercased if the suffix is part of the person's name.
Occasionally, junior or senior may be used to refer to a younger or elder person having the same last name, but not necessarily the same first name. In these cases, the suffix is lowercased as it is not part of their name, but rather describing their generational relation.

He talked to Bob Smith Junior. He talked to John Doe <abbr class="eoc">Jr.</abbr> Madame Bovary junior was afraid of accidents for her husband.

Indentation

8.4.1
Paragraphs that directly follow another paragraph are indented by 1em.
8.4.2
The first line of body text in a section, or any text following a visible break in text flow (like a header, a scene break, a figure etc.), is not indented, with the exception of block quotations.
1. 8.4.2.1
 Body text following a block quotation is indented only if the text begins a new semantic paragraph. Otherwise, if the body text following a block quotation is semantically part of the paragraph preceding the block quotation, it is not indented. Such non-indented paragraphs have class="continued", which removes the default indentation.
 He sat down before a writing-table and, taking pen and ink, wrote on a slip of paper as follows:⁠— <blockquote epub:type="z3998:letter"> The Bishop of Barchester is dead. </blockquote> “There,” said he. “Just take that to the telegraph office at the railway station and give it in as it is.”
 
 He opened the cover in which the message was enclosed and, having read it, he took his pen and wrote on the back of it⁠— <blockquote epub:type="z3998:letter"> For the Earl of ⸻, <footer> With the Earl of ⸻’s compliments </footer> </blockquote> and sent it off again on its journey.

Headers

8.5.1
Titles or subtitles that are entirely non-English-language are not italicized. However, they do have an xml:lang attribute to assist screen readers in pronunciation. Titles or subtitles that are in English but contain non-English components have those components italicized according to the general rules for italics.
<h2 epub:type="title" xml:lang="la">Ex Oblivione</h2> <hgroup> <h2 epub:type="ordinal z3998:roman">XI</h2> The Nautilus </hgroup> <hgroup> <h2 epub:type="ordinal z3998:roman">XXXV</h2> Miss Thorne’s Fête Champêtre </hgroup> <hgroup> <h2 epub:type="ordinal z3998:roman">XI</h2> Christus Nos Liberavit </hgroup>

Chapter headers

8.5.2.1
Epigraphs in chapters have the quote source set in small caps, without a leading em dash and without a trailing period.
<header> <h2 epub:type="ordinal z3998:roman">II</h2> <blockquote epub:type="epigraph"> “Desire no more than to thy lot may fall. …” <cite>—Chaucer.</cite> </blockquote> </header>

header [epub|type~="epigraph"] cite{ font-variant: small-caps; }

<header> <h2 epub:type="ordinal z3998:roman">II</h2> <blockquote epub:type="epigraph"> “Desire no more than to thy lot may fall. …” <cite>Chaucer</cite> </blockquote> </header>

Ligatures

Ligatures are two or more letters that are combined into a single letter, usually for stylistic purposes. In general they are not used in modern English spelling, and are replaced with their expanded characters.

Words in non-English languages like French may use ligatures to differentiate words or pronunciations. In these cases, ligatures are retained.

<p>Œdipus Rex</p>
<p>Archæology</p>

<p>Oedipus Rex</p>
<p>Archaeology</p>

Punctuation and spacing

8.7.1
Sentences are single-spaced.
8.7.2
Periods and commas are placed within quotation marks; i.e. American-style punctuation is used, not logical (AKA “British” or “new”) style.
Bosinney ventured: “It’s the first spring day”.

Bosinney ventured: “It’s the first spring day.”
1. 8.7.2.1
 If dialog ends in a semicolon, the semicolon is placed within the closing quotation mark. Otherwise, semicolons always go outside of quotation marks.
 “I’ve ask Him, and ask Him, but der help don’t come. I can do no more;” and a tempest of despairing sobs shook her gaunt frame.
 
 A premonition told him that this misfortune had befallen the little “Family”; he quickly drew on a coat and ran over to the “Ark.”
8.7.3
Ampersands are preceded by a no-break space (U+00A0).
The firm of Hawkinsnbsp& Harker.
8.7.4
Some older works include spaces in common contractions; these spaces are removed.
Would n’t it be nice to go out? It ’s such a nice day.

Wouldn’t it be nice to go out? It’s such a nice day.

Quotation marks

8.7.5.1
“Curly” or typographer’s quotes, both single and double, are always used instead of straight quotes. This is known as “American-style” quotation, which is different from British-style quotation which is also commonly found in both older and modern books.
“Don’t do it!” she shouted.
8.7.5.2
Quotation marks that are directly side-by-side are separated by a hair space ( or U+200A) character.
“hairsp‘Green?’ Is that what you said?” asked Dave.
8.7.5.3
Words with missing letters represent the missing letters with a right single quotation mark (’ or U+2019) character to indicate elision.
He had pork ’n’ beans for dinner
1. 8.7.5.3.1
 Elision is not to be confused with a glottal stop, which may sometimes occur in non-English languages like Hawaiian. Glottal stops that are not elided letters are represented with a turned comma (ʻ or U+02BB), not the similar-looking left single quotation mark (‘ or U+2018).
 ʻŌlelo Hawaiʻi
2. 8.7.5.3.2
 Rarely, in older texts some common last names are rendered using a left single quotation mark (‘ or U+2018) instead of a superscript c. This is a matter of typography, and is not the actual spelling of such names. These names are changed to their equivalent modern spelling.
 His friends were James M‘Donald and Sam M‘Daniel.
 
 His friends were James McDonald and Sam McDaniel.
8.7.5.4
Ditto marks are set with the right double quotation mark glyph (” or U+201D), This is not to be confused with the ditto mark glyph (〃 or U+3003), which is for non-Latin scripts only, or the quotation mark glyph (" or U+0022).
<table> <tbody> <tr> <td>3</td> <td>lbs.</td> </tr> <tr> <td>12</td> <td>”</td> </tr> </tbody> </table>
8.7.5.5
Some idiomatic phrases are not set with scare quotes:
- ... to a T.

Ellipses

8.7.6.1
The ellipsis glyph (… or U+2026) is used for ellipses, instead of consecutive or spaced periods.
8.7.6.2
When ellipses are used as suspension points (for example, to indicate dialog that pauses or trails off), the ellipses are not preceded by a comma.
Ellipses used to indicate missing words in a quotation require keeping surrounding punctuation, including commas, as that punctuation is in the original quotation.
8.7.6.3
A word joiner (U+2060), followed by a hair space ( or U+200A) glyph, followed by another word joiner (U+2060), are located before all ellipses that do not begin a paragraph, and that are not directly preceded by “.
8.7.6.4
A regular space is located after all ellipses that do not end a paragraph and that are not followed by punctuation.
8.7.6.5
A hair space ( or U+200A) glyph is located between an ellipsis and any punctuation that follows directly after the ellipsis, unless that punctuation is a quotation mark, in which case there is no space at all between the ellipsis and the quotation mark.
“I’m so hungrywjhairspwj…hairsp! Let’s eatwjhairspwj…”

Dashes

There are many kinds of dashes, and the run-of-the-mill hyphen is often not the correct dash to use. In particular, hyphens are not used for things like date ranges, phone numbers, or negative numbers.

8.7.7.1
Dashes of all types do not have white space around them.
8.7.7.2
Figure dashes (‒ or U+2012) are used to indicate a dash in numbers that aren’t a range, like phone numbers.
His number is 555‒1234.
8.7.7.3
Hyphens (- or U+002D) are used to join words, including double-barrel names, or to separate syllables in a word. The Unicode hyphen (U+2010) is not used.
Pre- and post-natal.

The Smoot-Hawley act.
8.7.7.4
Minus sign glyphs (− or U+2212) are used to indicate negative numbers, and are used in mathematical equations instead of hyphens to represent the “subtraction” operator.
It was −5° out yesterday!

5 − 2 = 3
8.7.7.5
En dashes (– or U+2013) are used to indicate a numeric or date range; to indicate a relationship where two concepts are connected by the word “to,” for example a distance between locations or a range between numbers; or to indicate a connection in location between two places. En dashes are preceded and followed by the invisible word joiner glyph (U+2060).
We talked 2wj–wj3 days ago.

We took the Berlinwj–wjMunich train yesterday.

I saw the torpedo-boat in the Ems⁠wj–wj⁠Jade Canal.
8.7.7.6
Non-break hyphens (‑ or U+2011) are used when a single word is stretched out by a speaker for prosodic effect.
When you wa‑ake, you shall ha‑ave, all the pretty little hawsiz—

When adding non-breaking hyphens to stretch out words, beware that se typogrify will incorrectly convert them to regular hyphens!

Em dashes

Em dashes (— or U+2014) are typically used to offset parenthetical phrases.

8.7.7.7.1
Em dashes are preceded by the invisible word joiner glyph (U+2060).
8.7.7.7.2
Interruption in dialog is set by a single em dash, not two em dashes or a two-em dash.
“I wouldn’t go as far as that, not myself, butwj——”

“I wouldn’t go as far as that, not myself, butwj—”

Partially-obscured words

A partially-obscured word is a word that the author chooses to not divulge by consistently obscuring some or all of it. This is not the same as an interruption in dialog, which may interrupt a word, but not obscure it in the same stylistic sense.

8.7.7.8.1
Em dashes are used for partially-obscured years and totally-obscured days of the month.
It was the year 19wj— in the town of Metropolis. She arrived on May —, 1922.
8.7.7.8.2
A regular hyphen is used in partially-obscured years where only the last number is obscured, and in partially-obscured days of the month.
It was the year 192- in the town of Metropolis. His birthday was August 1-, 1911.
8.7.7.8.3
A non-breaking hyphen (‑ or U+2011) is used when a single letter is obscured in a word.
He performed Mozart’s famous canon, “Leck mich im A‑sche.”
8.7.7.8.4
A two-em dash (⸺ or U+2E3A) preceded by a word joiner glyph (U+2060) is used in partially obscured words.
Sally Jwj⸺ walked through town.
1. 8.7.7.8.4.1
 If both the start and end of a partially-obscured word are visible, a word joiner is placed on both sides of the two-em-dash.
 A blwj⸺wjy murder!
8.7.7.8.5
A three-em dash (⸻ or U+2E3B) is used for completely obscured words.
It was night in the town of ⸻.

Numbers, measurements, and math

8.8.1
Coordinates are set with the prime (′ or U+2032) or double prime (″ or U+2033) glyphs, not single or double quotes.
<abbr>Lat.</abbr> 27° 0' <abbr epub:type="se:compass">N.</abbr>, <abbr>long.</abbr> 20° 1' <abbr class="eoc" epub:type="se:compass">W.</abbr> <abbr>Lat.</abbr> 27° 0’ <abbr epub:type="se:compass">N.</abbr>, <abbr>long.</abbr> 20° 1’ <abbr class="eoc" epub:type="se:compass">W.</abbr>

<abbr>Lat.</abbr> 27° 0′ <abbr epub:type="se:compass">N.</abbr>, <abbr>long.</abbr> 20° 1′ <abbr class="eoc" epub:type="se:compass">W.</abbr>
8.8.2
Ordinals for Arabic numbers are as follows: st, nd, rd, th.
The 1st, 2d, 3d, 4th.

The 1st, 2nd, 3rd, 4th.
8.8.3
Numbers in a non-mathematical context are spelled out in the following cases:
- If they are from 0–100.
- If they are whole numbers from 0–100 and are made greater by being paired with words like hundred, thousand, million, and so on.
- If they begin a sentence.
- If they are simple fractions.
“They had a gun on the West Front⁠—a seventy-five,” said O’Keefe. Allowing her twelve thousand miles of straight-line travel through Uranus’ frigid soupy atmosphere. He died in the year 619. The vote needed two-thirds majority. The army consisted of 113,000 soldiers. He reached out of the unlived depths of nineteen hundred years.
1. 8.8.3.1
 If a series of numbers is close together in a sentence, and one would be spelled out but another wouldn’t, spell out all numbers within that context to maintain visual consistency.
 There the Gulf Stream is 75 miles wide and two hundred ten meters deep.
 
 There the Gulf Stream is seventy-five miles wide and two hundred ten meters deep.
2. 8.8.3.2
 The plural form of spelled-out numbers is formed without an apostrophe. However the possessive or contracted form does include an apostrophe.
 There were, the other answered, half a dozen two four two’s.
 
 There were, the other answered, half a dozen two four twos. Twice two’s four, and a stone’s a stone. He was allowed a day or two’s shooting in September.
8.8.4
Numbers of four or more digits should include commas at every 3rd decimal place.
“You will agree to do me service for the sum of 4000 guilders?”

“You will agree to do me service for the sum of 4,000 guilders?”

Roman numerals

8.8.5.1
Roman numerals are set using uppercase ASCII, not the Unicode Roman numeral glyphs.
8.8.5.2
Roman numerals have the semantic inflection of z3998:roman.
8.8.5.3
Roman numerals are not followed by trailing periods, except for grammatical reasons.
8.8.5.4
Roman numerals are not followed by ordinal indicators.
Henry VIIIth had six wives.

Henry VIII had six wives.

Fractions

8.8.6.1
Fractions are set in their appropriate Unicode glyph, if a glyph available; for example, ½, ¼, ¾ and U+00BC–U+00BE and U+2150–U+2189.
I need 1/4 cup of sugar.

I need ¼ cup of sugar.
8.8.6.2
If a fraction doesn’t have a corresponding Unicode glyph, it is composed using the fraction slash Unicode glyph (⁄ or U+2044) and superscript/subscript Unicode numbers. See this Wikipedia entry for more details.
Roughly 6/10 of a mile.

Roughly ⁶⁄₁₀ of a mile.
8.8.6.3
There is no space between a whole number and its fraction.
There are 365¼ days in a year.

Measurements

8.8.7.1
Dimension measurements are set using the Unicode multiplication glyph (× or U+00D7), not the ASCII letter x or X.
The board was 4 x 3 x 7 feet.

The board was 4 × 3 × 7 feet.
8.8.7.2
Feet and inches in shorthand are set using the prime (′ or U+2032) or double prime (″ or U+2033) glyphs (not single or double quotes), with a no-break space (U+00A0) separating consecutive feet and inch measurements.
He was 6'nbsp1" in height. He was 6’nbsp1” in height.

He was 6′nbsp1″ in height.
8.8.7.3
When forming a compound of a number and unit of measurement in which the measurement is abbreviated, the number and unit of measurement are separated with a no-break space (U+00A0), not a dash. For exceptions in money, see 8.8.8.
A 12-<abbr>mm</abbr> pistol.

A 12nbsp<abbr>mm</abbr> pistol.

Punctuation in abbreviated measurements

See here for general abbreviation rules that also apply to measurements.

8.8.7.4.1
Abbreviated SI units are set in lowercase without periods. They are not initialisms.
A 12nbsp<abbr>mm</abbr> pistol.
8.8.7.4.2
Abbreviated English, Imperial, or US customary units that are one word are set in lowercase with a trailing period. They are not initialisms.
We had two 9nbsp<abbr>ft.</abbr> sledges, of 41nbsp<abbr>lbs.</abbr> each.

The one exception is G (i.e. G-force), which is an initialism that is set without a period.

There’s a force of over a hundred thousand <abbr epub:type="z3998:initialism">G</abbr>’s.
8.8.7.4.3
Abbreviated English, Imperial, or US customary units that are more than one word (like hp for horse power or mph for miles per hour) are set in lowercase without periods. They are not initialisms.
He drove his 40nbsp<abbr>hp</abbr> car at 20nbsp<abbr>mph</abbr>.

Math

8.8.8.1
In works that are not math-oriented or that don’t have a significant amount of mathematical equations, equations are set using regular HTML and Unicode.
1. 8.8.8.1.1
 Operators and operands in mathematical equations are separated by a space.
 6−2+2=6
 
 6 − 2 + 2 = 6
2. 8.8.8.1.2
 Operators like subtraction (− or U+2212), multiplication (× or U+00D7), and equivalence (≡ or U+2261) are set using their corresponding Unicode glyphs, not a hyphen or x. Almost all mathematical operators have a corresponding special Unicode glyph.
 6 - 2 x 2 == 2
 
 6 − 2 × 2 ≡ 2
3. 8.8.8.1.3
 Simple in-line variables are set individually with the <var> tag.
 If the value of the labour = <var>x</var> and the force of demand = <var>y</var>, the exchangeable value of the commodity is <var>x</var><var>y</var>
8.8.8.2
In works that are math-oriented or that have a significant amount of math, all variables, equations, and other mathematical objects are set using MathML.
1. 8.8.8.2.1
 When MathML is used in a file, the m namespace is declared at the top of the file and used for all subsequent MathML code, as follows:
 xmlns:m="http://www.w3.org/1998/Math/MathML"
 
 This namespace is declared and used even if there is just a single MathML equation in a file.
 
 <html xmlns="http://www.w3.org/1999/xhtml" xmlns:epub="http://www.idpf.org/2007/ops" ub:prefix="z3998: http://www.daisy.org/z3998/2012/vocab/structure/, se: https://standardebooks.org/vocab/1.0" xml:lang="en-GB"> ... <math xmlns="http://www.w3.org/1998/Math/MathML" alttext="x"> <ci>x</ci> </math> 
 
 <html xmlns="http://www.w3.org/1999/xhtml" xmlns:epub="http://www.idpf.org/2007/ops" xmlns:m="http://www.w3.org/1998/Math/MathML" epub:prefix="z3998: http://www.daisy.org/z3998/2012/vocab/structure/, se: https://standardebooks.org/vocab/1.0" xml:lang="en-GB"> ... <m:math alttext="x"> <m:ci>x</m:ci> </m:math> 
2. 8.8.8.2.2
 When possible, Content MathML is provided in an additional <m:annotation-xml> element. (This may not always be possible depending on the complexity of the work.)
  <m:math alttext="x + 1 = y"> <m:semantics> <m:mrow> <m:mi>x</m:mi> <m:mo>+</m:mo> <m:mn>1</m:mn> <m:mo>=</m:mo> <m:mi>y</m:mi> </m:mrow> <m:annotation-xml encoding="MathML-Content"> <m:apply> <m:eq/> <m:apply> <m:plus/> <m:ci>x</m:ci> <m:cn>1</m:cn> </m:apply> <m:ci>y</m:ci> </m:apply> </m:annotation-xml> </m:semantics> </m:math> 
3. 8.8.8.2.3
 Each <m:math> element has an alttext attribute.
 1. 8.8.8.2.3.1
 The alttext attribute describes the contents in the element in plain-text Unicode according to the rules in this specification.
 2. 8.8.8.2.3.2
 Operators in the alttext attribute are surrounded by a single space.
  <m:math alttext="x+1=y"> <m:apply> <m:eq/> <m:apply> <m:plus/> <m:ci>x</m:ci> <m:cn>1</m:cn> </m:apply> <m:ci>y</m:ci> </m:apply> </m:math> 
 
  <m:math alttext="x + 1 = y"> <m:apply> <m:eq/> <m:apply> <m:plus/> <m:ci>x</m:ci> <m:cn>1</m:cn> </m:apply> <m:ci>y</m:ci> </m:apply> </m:math> 
4. 8.8.8.2.4
 When using Presentation MathML, <m:mrow> is used to group subexpressions, but only when necessary. Many elements in MathML, like <m:math> and <m:mtd>, imply <m:mrow>, and redundant elements are not desirable. See this section of the MathML spec for more details.
  <m:math alttext="x"> <m:mrow> <m:mi>x</m:mi> </m:mrow> </m:math> 
 
  <m:math alttext="x"> <m:mi>x</m:mi> </m:math> 
5. 8.8.8.2.5
 If a Presentation MathML expression contains a function, the invisible Unicode function application glyph (U+2061) is used as an operator between the function name and its operand. This element looks exactly like the following, including the comment for readability: <m:mo>⁡</m:mo>. (Note that the preceding element contains an invisible Unicode character! It can be revealed with the se unicode-names tool.)
  <m:math alttext="f(x)"> <m:mi>f</m:mi> <m:mrow> <m:mo fence="true">(</m:mo> <m:mi>x</m:mi> <m:mo fence="true">)</m:mo> </m:mrow> </m:math> 
 
  <m:math alttext="f(x)"> <m:mi>f</m:mi> <m:mo>⁡U+2061</m:mo> <m:mrow> <m:mo fence="true">(</m:mo> <m:mi>x</m:mi> <m:mo fence="true">)</m:mo> </m:mrow> </m:math> 
6. 8.8.8.2.6
 Expressions grouped by parenthesis or brackets are wrapped in an <m:mrow> element, and fence characters are set using the <m:mo fence="true"> element. Separators are set using the <m:mo separator="true"> element. <m:mfenced>, which used to imply both fences and separators, is deprecated in the MathML spec and thus is not used.
  <m:math alttext="f(x,y)"> <m:mi>f</m:mi> <m:mo>⁡U+2061</m:mo> <m:mfenced> <m:mi>x</m:mi> <m:mi>y</m:mi> </m:mfenced> </m:math> 
 
  <m:math alttext="f(x,y)"> <m:mi>f</m:mi> <m:mo>⁡U+2061</m:mo> <m:mrow> <m:mo fence="true">(</m:mo> <m:mi>x</m:mi> <m:mo separator="true">,</m:mo> <m:mi>x</m:mi> <m:mo fence="true">)</m:mo> </m:mrow> </m:math> 
7. 8.8.8.2.7
 If a MathML variable includes an overline, it is set by combining the variable’s normal Unicode glyph and the Unicode overline glyph, ‾ (U+203E), in a <m:mover> element. However in the alttext attribute, the Unicode combining overline, ◌̅ (U+0305), is used to represent the overline in Unicode.
  <m:math alttext="x̅"> <m:mover> <m:mi>x</m:mi> <m:mo>‾</m:mo> </m:mover> </m:math> 
8.8.8.3
Ratios are expressed with the Unicode ratio character (∶ or U+2236) surrounded by spaces, not a colon. The ratio character is also used for logical comparisons in non-mathematical contexts, like analogies in running prose.
And so we get four names⁠—two for intellect, and two for opinion⁠—reason or mind, understanding, faith, perception of shadows⁠—which make a proportion⁠—being ∶ becoming ∶∶ intellect ∶ opinion⁠— and science ∶ belief ∶∶ understanding ∶ perception of shadows.

Money

8.8.9.1
Typographically-correct symbols are used for currency symbols.
The exchange rate was L2 for $1.

The exchange rate was £2 for $1.
8.8.9.2
Currency symbols are not abbreviations.

£sd shorthand

£sd shorthand is a way of denoting pre-decimal currencies (pounds, shillings, and pence) common in England and other parts of the world until the 1970s.

8.8.9.3.1
There is no white space between a number and an £sd currency symbol.
£ 14 8 s. 2 d. is known as a “tuppence.”

£14 8<abbr>s.</abbr> 2<abbr>d.</abbr> is known as a “tuppence.”
8.8.9.3.2
Abbreviated currencies used in £sd shorthand are wrapped in <abbr> elements.
£14 8s. 2d. is known as a “tuppence.”

£14 8<abbr>s.</abbr> 2<abbr>d.</abbr> is known as a “tuppence.”
8.8.9.3.3
Abbreviated currencies used in £sd shorthand are followed by periods.

Dates

8.8.10.1
Years with 4 digits are set without commas, but years with 5 digits or more include commas at every 3rd decimal place.
Tutankhamun ruled till 1,325 <abbr epub:type="se:era">BC</abbr>.

Tutankhamun ruled till 1325 <abbr epub:type="se:era">BC</abbr>, but ancient aliens built the pyramids in 12,633 <abbr epub:type="se:era">BC</abbr>.

Latinisms

See here for times.

8.9.1
Latinisms that can be found in a modern dictionary are not italicized, with some exceptions. Examples of Latinisms that are not italicized include e.g., i.e., ad hoc, viz., ibid., etc..
1. 8.9.1.1
 Exception: inst., the abbreviation of instante mense, is not italicized.
 The two bade adieu to their landlady upon Tuesday, the 4th <abbr xml:lang="la">inst.</abbr>, and departed to Euston Station with the avowed intention of catching the Liverpool express.
8.9.2
Whole passages of Latin language and Latinisms that aren’t found in a modern dictionary are italicized.
8.9.3
&c. is not used, and is replaced with etc..
8.9.4
For Ibid., see Endnotes.
8.9.5
Latinisms that are abbreviations are set in lowercase with periods between words and no spaces between them, except BC, AD, BCE, and CE, which are set without periods, in small caps, and wrapped with <abbr epub:type="se:era">:
abbr[epub|type~="se:era"]{ font-variant: all-small-caps; }

Julius Caesar was born around 100 <abbr epub:type="se:era">BC</abbr>.

Initials and abbreviations

8.10.1
Acronyms (terms made up of initials and pronounced as one word, like NASA, SCUBA, or NATO) are set in small caps, without periods, and are wrapped in an <abbr epub:type="z3998:acronym"> element with corresponding CSS.
[epub|type~="z3998:acronym"]{ font-variant: all-small-caps; }

He was hired by <abbr epub:type="z3998:acronym">NASA</abbr> last week.
8.10.2
Initialisms (terms made up of initials in which each initial is pronounced separately, like M.P., P.S., or U.S.S.R.) are set with periods and without spaces (with some exceptions that follow) and are wrapped in an <abbr epub:type="z3998:initialism"> element.
He was hired by the <abbr epub:type="z3998:initialism">U.S.</abbr> <abbr epub:type="z3998:initialism">F.B.I.</abbr> last week.
8.10.3
When an abbreviation that is not an acronym contains a terminal period, its <abbr> element has the additional eoc class (End of Clause) if the terminal period is also the last period in clause. Such sentences do not have two consecutive periods.
She loved Italian food like pizza, pasta, <abbr class="eoc">etc.</abbr>

He lists his name alphabetically as Johnson, <abbr class="eoc" epub:type="z3998:given-name">R. A.</abbr>

His favorite hobby was <abbr epub:type="z3998:acronym">SCUBA</abbr>.
8.10.4
Initials of people’s names are each separated by periods and spaces. The group of initials is wrapped in an <abbr epub:type="z3998:*-name"> element. The correct semantic is selected from z3998:personal-name (a complete personal name including last name), z3998:given-name (a person's given, or first, name(s), and/or middle name), or z3998:surname (a person's last name). If it’s unclear whether a name is a first or last name, z3998:personal-name is used as a catchall.
<abbr epub:type="z3998:given-name">H. P.</abbr> Lovecraft described himself as an aged antiquarian. William <abbr epub:type="z3998:given-name">H.</abbr> Taft was our twenty-seventh president. <footer> <abbr epub:type="z3998:personal-name">A. A. C.</abbr> Dec 12, 1933 </footer>
8.10.5
Academic degrees are wrapped in an <abbr epub:type="z3998:name-title"> element. Degrees that consist of initials are set with a period between each initial. Degrees that consist of initials followed by abbreviated words are set with a hair space before the word.
Judith Douglas, <abbr class="eoc" epub:type="z3998:name-title">D.D.S</abbr> Abraham Van Helsing, <abbr epub:type="z3998:name-title">M.D.</abbr>, <abbr epub:type="z3998:name-title">D.hairspPh.</abbr>, <abbr epub:type="z3998:name-title">D.hairspLit.</abbr>, <abbr>etc.</abbr>, <abbr class="eoc">etc.</abbr>
1. 8.10.5.1
 Some degrees are exceptions:
 - LL.D. does not have a period in LL, because it indicates the plural Legum.
8.10.6
Postal codes and abbreviated US states are set in all caps, without periods or spaces, and are wrapped in an <abbr epub:type="z3998:place"> element.
Washington <abbr epub:type="z3998:place">DC</abbr>.
8.10.7
Abbreviations that are abbreviations of a single word, and that are not acronyms or initialisms (like Mr., Mrs., or lbs.) are set with <abbr>.
1. 8.10.7.1
 Abbreviations ending in a lowercase letter are set without spaces between the letters, and have a trailing period.
2. 8.10.7.2
 Abbreviations without lowercase letters are set without spaces and without a trailing period.
3. 8.10.7.3
 Abbreviations that describes the next word, like Mr., Mrs., Mt., and St., are set with a no-break space (U+00A0) between the abbreviation and its target.
 He called on <abbr>Mrs.</abbr>nbspJones yesterday.
8.10.8
Compass points are separated by periods and spaces. The group of points are wrapped in an <abbr epub:type="se:compass"> element.
He traveled <abbr epub:type="se:compass">S.</abbr>, <abbr epub:type="se:compass">N. W.</abbr>, then <abbr class="eoc" epub:type="se:compass">E. S. E.</abbr>

Exceptions

8.10.9.1
The following are not abbreviations, and are set without periods or spaces.
- A1
- BB, when referring to a BB gun or its projectiles.
- OK
- SOS
- SS, when referring to collars of SS.
8.10.9.2
The following are initialisms, but are set without periods or spaces:
- TV, i.e. television.
- AC and DC, when referring to electrical current.
- G, when used in the sense of G-force. Also see 8.8.7.4.2.
- Stock ticker symbols.
 She bought 125 shares of <abbr epub:type="z3998:initialism">XYZ</abbr> corporation.
8.10.9.3
The following are abbreviations, but are not initialisms. Unlike almost all other abbreviations, they are in all caps and only have a period at the end.
- MS. (manuscript)
- MSS. (manuscripts)
- M. (Monsieur)
- MM. (Messieurs)
<abbr>MM.</abbr>nbspGuy and Luc were putting the finishing touches on the <abbr>MS.</abbr> of their new novel.
8.10.9.4
A.B.C., when used in the sense of the alphabet, is not an abbreviation, and is set with periods between the letters. But other uses, like A.B.C. shops, are abbreviations. (The abbreviation in A.B.C. shop stands for “Australian Broadcasting Corporation.”)
She was learning her A.B.C.s He stopped by the <abbr epub:type="z3998:initialism">A.B.C.</abbr> shop.
8.10.9.5
Company names and brand marks which may be abbreviations, but are stylized without periods by the brand, are kept in the style preferred by the brand.
He read an <abbr epub:type="z3998:initialism">AP</abbr> news wire story. She called her colleague at <abbr epub:type="z3998:initialism">IBM</abbr>.
8.10.9.6
The abbreviations 1D, 2D, 3D, and 4D, meaning first, second, third, and fourth dimensions, are abbreviations but do not have a trailing period.
8.10.9.7
The words recto and verso are sometimes abbreviated with an initial and a superscript o. They are regular abbreviations, set without periods, and the o is superscripted with .
<abbr>Ch.</abbr> 1, <abbr>fol.</abbr> 2 <abbr>ro</abbr>.

Times

8.11.1
Times in a.m. and p.m. format are set in lowercase, with periods, and without spaces.
8.11.2
a.m. and p.m. are wrapped in an <abbr> element.

Times as digits

8.11.3.1
Digits in times are separated by a colon, not a period or comma.
8.11.3.2
Times written in digits followed by a.m. or p.m. are set with a no-break space (U+00A0) between the digit and a.m. or p.m..
He called at 6:40nbsp<abbr class="eoc">a.m.</abbr>

Times as words

8.11.4.1
Words in a spelled-out time are separated by spaces, unless they appear before a noun, where they are separated by a hyphen.
He arrived at five thirty.

They took the twelve-thirty train.
8.11.4.2
Times written in words followed by a.m. or p.m. are set with a regular space between the time and a.m. or p.m..
She wasn’t up till seven <abbr class="eoc">a.m.</abbr>
8.11.4.3
Military times that are spelled out (for example, in dialog) are set with dashes. Leading zeros are spelled out as oh.
He arrived at oh-nine-hundred.

Chemicals and compounds

8.12.1
Molecular compounds are set in Roman, without spaces, and wrapped in an <abbr epub:type="se:compound"> element.
He put extra <abbr epub:type="se:compound">NaCl</abbr> on his dinner.
8.12.2
Elements in a molecular compound are capitalized according to their listing in the periodic table.
8.12.3
Amounts of an element in a molecular compound are set in subscript with a  element.
She drank eight glasses of <abbr epub:type="se:compound">H2O</abbr> a day.

Temperatures

8.13.1
The minus sign glyph (− or U+2212), not the hyphen glyph, is used to indicate negative numbers.
8.13.2
Either the degree glyph (° or U+00B0) or the word degrees is acceptable. Works that use both are normalized to use the dominant method.

Abbreviated units of temperature

8.13.3.1
Units of temperature measurement, like Fahrenheit or Celsius, may be abbreviated to F or C.
8.13.3.2
Units of temperature measurement do not have trailing periods.
8.13.3.3
If an abbreviated unit of temperature measurement is preceded by a number, the unit of measurement is first preceded by a hair space ( or U+200A).
8.13.3.4
Abbreviated units of measurement are set in small caps.
8.13.3.5
Abbreviated units of measurement are wrapped in an <abbr epub:type="se:temperature"> element.
[epub|type~="se:temperature"]{ font-variant: all-small-caps; }

It was −23.33° Celsius (or −10°hairsp<abbr epub:type="se:temperature">F</abbr>) last night.

Scansion

Scansion is the representation of the metrical stresses in lines of verse.

8.14.1
When scansion marks are next to, instead of above, letters, × (U+00d7) indicates an unstressed syllable and / (U+002f) indicates a stressed syllable. They are separated from each other with no-break spaces (U+00A0).
Several of his types, however, constantly occur; <abbr>e.g.</abbr> A and a variant (/ × | / ×) (/ × × | / ×); B and a variant (× / | × /) (× × / | × /); a variant of D (/ × | / × ×); E (/ × × | /). 
8.14.2
When scansion marks are above letters, a combining breve, ◌̆ (U+0306), is used to indicate an unstressed syllable and a combining vertical line above, ◌̍ (U+030D), is used to indicate a stressed syllable. Vertical lines are always above letters, not next to them. Indicating unstressed symbols is optional.
I̍f wĕ sha̍dŏws ha̍ve ŏffe̍ndĕd, / Thi̍nk bŭt thi̍s ănd a̍ll ĭs me̍ndĕd.
8.14.3
Lines of poetry listed on a single line (like in a quotation) are separated by a space, then a forward slash, then a space. Capitalization is preserved for each line.
The famous lines “Wake! For the Sun, who scatter’d into flight / The Stars before him from the Field of Night” are from The Rubáiyát of Omar Khayyám.

Legal cases and terms

8.15.1
Legal cases are set in italics.
8.15.2
Either versus or v. are acceptable in the name of a legal case; if using v., a period follows the v., and it is wrapped in an <abbr> element.
He prosecuted Johnson <abbr>v.</abbr> Smith.

Morse code

Any Morse code that appears in a book is changed to fit Standard Ebooks’ format.

American Morse Code

8.16.1.1
Middle dot glyphs (· or U+00B7) are used for the short mark or dot.
8.16.1.2
En dash (– or U+2013) are used for the longer mark or short dash.
8.16.1.3
Em dashes (— or U+2014) are used for the long dash (the letter L).
8.16.1.4
If two en dashes are placed next to each other, a hair space ( or U+200A) is placed between them to keep the glyphs from merging into a longer dash.
8.16.1.5
Only in American Morse Code, there are internal gaps used between glyphs in the letters C, O, R, or Z. No-break spaces (U+00A0) are used for these gaps.
8.16.1.6
En spaces (U+2002) are used between letters.
8.16.1.7
Em spaces (U+2003) are used between words.
-- .. .. __ .. - - __ . . .. __ -.. .. . .- - My little old cat.

– – ·· ·· — ·· – – — · · · — –·· ·· · ·– – My little old cat.

Citations

8.17.1
Citations are wrapped in a <cite> element.
8.17.2
Citations that are the source of a quote are preceded by a space and an em dash, within the <cite> element.
“The Moving Finger writes; and, having writ, moves on.” <cite>—The Rubaiyat of Omar Khayyam</cite>.
8.17.3
Citations within a <blockquote> element have the <cite> element as the last direct child of the <blockquote> parent.
<blockquote> “The Moving Finger writes; and, having writ, moves on.” <cite>—The Rubaiyat of Omar Khayyam</cite> </blockquote>

<blockquote> “The Moving Finger writes; and, having writ, moves on.” <cite>—The Rubaiyat of Omar Khayyam</cite> </blockquote>

Verses and Chapters of the Bible

8.17.4.1
Citations of passages from the Bible include the name of the book, followed by the chapter number and the verse number. The chapter and the verse numbers are separated by a colon.
1. 8.17.4.1.1
 All chapter and verse numbers are written in Arabic numerals. Similarly, if a book being cited is a “numbered” book, the number is also written in Arabic numerals.
<blockquote> “Though I speak with the tongues of men and of angels, and have not charity, I am become as sounding brass, or a tinkling cymbal.” <cite>—I Corinthians XIII 1</cite> </blockquote>

<blockquote> “Though I speak with the tongues of men and of angels, and have not charity, I am become as sounding brass, or a tinkling cymbal.” <cite>—1 Corinthians 13:1</cite> </blockquote>
8.17.4.2
If an entire chapter, instead of a particular verse, is being cited, then the citation includes the name of the book followed by the chapter number.
“In the beginning God created the heaven and the earth” is the first verse of Genesis I.

“In the beginning God created the heaven and the earth” is the first verse of Genesis 1.
8.17.4.3
If a continuous range of verses is being cited, an en dash (– or U+2013) is placed between the verse numbers indicating the beginning and the end of the range.
Matthew 5:3–11.

Ranges may also span multiple chapters within the same book:

Matthew 5:1–7:29.
8.17.4.4
If a discontinuous group of verses in the same chapter is being cited, each distinct verse number is separated by a comma followed by a space.
Matthew 6:2, 16.
8.17.4.5
If there are multiple citations of the same book, each citation is separated by a semicolon followed by a space, and the name of the book is omitted after the first citation.
Matthew 5:3–11; 5:1–7:29; 6:2, 16

Non-Latin Scripts and Transliterations

8.18.1
Greek script is set in italics. All other scripts are not set in italics unless specially required by the text.

Greek

8.18.2.1
Rough breathing marks are set using their precomposed character, if available; for example, Ἁ, ἇ, and Ἧ. If a precomposed character is not available, ̔ (U+0314) is used when the mark must be combined with a character, and ʽ (U+02BD) is used in all other cases.
8.18.2.2
Smooth breathing marks are set with ’ (U+2019) in all cases.

Chinese

8.18.3.1
Wade-Giles is the preferred method of transliterating Chinese script. (See here for discussion.) Transliteration to Wade-Giles from Legge is permitted, but not required.
8.18.3.2
In Wade-Giles transliteration, rough breathing marks are set using ʽ (U+02BD).

Tables

For ditto marks, see 8.7.5.4.

8.19.1
<table> elements that are used to display tabular numerical data, for example columns of sums, have CSS styling for tabular numbers: font-variant-numeric: tabular-nums;.
table td:last-child{ text-align: right; font-variant-numeric: tabular-nums; }

<table> <tbody> <tr> <td>Amount 1</td> <td>100</td> </tr> <tr> <td>Amount 2</td> <td>300</td> </tr> <tr> <td>Total</td> <td>400</td> </tr> </tbody> </table>