From 46a98104de76f0bd0b88a3c06f016e2c2b08ac18 Mon Sep 17 00:00:00 2001 From: John MacFarlane Date: Thu, 9 Jul 2015 15:57:29 -0700 Subject: Updated test/smart_punct.txt. This incorporates the changes in jgm/commonmark.js#58. Sequences of hyphens are converted to sequences of em and en dashes, with no hyphens. Currently this test fails. --- test/smart_punct.txt | 84 +++++++++++++++++++++++++++++++++++++++++++++------- 1 file changed, 74 insertions(+), 10 deletions(-) (limited to 'test') diff --git a/test/smart_punct.txt b/test/smart_punct.txt index 14da741..ac13a26 100644 --- a/test/smart_punct.txt +++ b/test/smart_punct.txt @@ -1,5 +1,9 @@ ## Smart punctuation +Open quotes are matched with closed quotes. +The same method is used for matching openers and closers +as is used in emphasis parsing: + . "Hello," said the spider. "'Shelob' is my name." @@ -28,6 +32,10 @@ So is ‘pine.’

‘He said, “I want to go.”’

. +A single quote that isn't an open quote matched +with a close quote will be treated as an +apostrophe: + . Were you alive in the 70's? . @@ -40,12 +48,19 @@ Here is some quoted '`code`' and a "[quoted link](url)".

Here is some quoted ‘code’ and a “quoted link”.

. +Here the first `'` is treated as an apostrophe, not +an open quote, because the final single quote is matched +by the single quote before `jolly`: + . 'tis the season to be 'jolly' .

’tis the season to be ‘jolly’

. +An unmatched double quote will be interpreted as a +left double quote, to facilitate this style: + . "A paragraph with no closing quote. @@ -55,40 +70,89 @@ Here is some quoted '`code`' and a "[quoted link](url)".

“Second paragraph by same speaker, in fiction.”

. +Quotes that are escaped come out as literal straight +quotes: + . \"This is not smart.\" This isn\'t either. +5\'8\" .

"This is not smart." -This isn't either.

+This isn't either. +5'8"

. +Two hyphens form an en-dash, three an em-dash. + . -Some dashes: one---two --- -three---four --- five. +Some dashes: em---em +en--en +em --- em +en -- en +2--3 . -

Some dashes: one—two — -three—four — five.

+

Some dashes: em—em +en–en +em — em +en – en +2–3

. +A sequence of more than three hyphens is +parsed as a sequence of em and/or en dashes, +with no hyphens. If possible, a homogeneous +sequence of dashes is used (so, 10 hyphens += 5 en dashes, and 9 hyphens = 3 em dashes). +When a heterogeneous sequence must be used, +the em dashes come first, followed by the en +dashes, and as few en dashes as possible are +used (so, 7 hyphens = 2 em dashes an 1 en +dash). + . -Escaped dashes: \-- \-\-\-. -. -

Escaped dashes: -- ---.

+one- +two-- +three--- +four---- +five----- +six------ +seven------- +eight-------- +nine--------- +thirteen-------------. +. +

one- +two– +three— +four–– +five—– +six—— +seven—–– +eight–––– +nine——— +thirteen———––.

. +Hyphens can be escaped: + . -Dashes between numbers: 5--7, 255--66, 1987--1999. +Escaped hyphens: \-- \-\-\-. . -

Dashes between numbers: 5–7, 255–66, 1987–1999.

+

Escaped hyphens: -- ---.

. +Three periods form an ellipsis: + . Ellipses...and...and.... .

Ellipses…and…and….

. +Periods can be escaped if ellipsis-formation +is not wanted: + . No ellipses\.\.\. . -- cgit v1.2.3