cmark - My own fork of cmark for commonmark conversion

Age	Commit message (Collapse)	Author
2016-09-14	Allow tabs after setext header line.	John MacFarlane
	See jgm/commonmark.js#109
2016-09-13	Regenerated scanners.c.	John MacFarlane

2016-09-13	Don't let URI schemes start with spaces.	John MacFarlane

2016-09-13	Merge pull request #153 from gaborcsardi/patch-1	John MacFarlane
	autolink scheme can contain digits
2016-09-12	Fixed h2..h6 HTML blocks (jgm/CommonMark#430).	John MacFarlane
	Added regression test.
2016-09-12	autolink scheme can contain digits	Gábor Csárdi

2016-08-26	Fix nullary function declarations in cmark.h	Nick Wellnhofer
	Fixes strict prototypes warnings.
2016-07-16	Removed size_t and ssize_t defs for WIN32.	John MacFarlane

2016-07-15	Reformatted.	John MacFarlane

2016-07-14	Merge pull request #137 from foonathan/master	John MacFarlane
	CMake fixes
2016-07-13	Fix sourcepos for blockquotes.	John MacFarlane
	Fixes #142.
2016-07-13	Replaced check for `\n` with `S_is_line_end_char`.	John MacFarlane

2016-07-13	Empty list items cannot interrupt paragraphs (spec change).	John MacFarlane

2016-07-11	Fix mistaken sourcepos for atx headers.	John MacFarlane
	Closes #141.
2016-07-11	Removed "two blanks breaks out of a list" feature.	John MacFarlane

2016-07-11	Don't allow ordered lists to interrupt paragraphs unless...	John MacFarlane
	...they start with 1.
2016-07-03	Fix chunk_set_cstr with suffix of current string	Nick Wellnhofer
	It's possible that cmark_chunk_set_cstr is called with a substring (suffix) of the current string. Delay freeing of the chunk content to handle this case correctly. Fixes issue #139.
2016-07-02	Fixed ATX headers and thematic breaks to allow tabs as well as spaces.	John MacFarlane

2016-06-27	Change export install location	Jonathan Müller

2016-06-27	Export the targets on installation	Jonathan Müller
	This allows using them in other cmake projects.
2016-06-24	Reformatted.	John MacFarlane

2016-06-24	Removed redundant check.	John MacFarlane

2016-06-24	Changed `process_emphasis` to get better results in corner cases.	John MacFarlane
	This will need corresponding spec changes. The change is this: when considering matches between an interior delimiter run (one that can open and can close) and another delimiter run, we require that the sum of the lengths of the two delimiter runs mod 3 is not 0. Thus, for example, in ab 1 23 4 delimiter 1 cannot match 2, since the sum of the lengths of the first delimiter run (1) and the second (1,2) == 3. Thus we get `<em>a*b</em>` instead of `<em>a</em><em>b</em>`. This gives better behavior on things like abc* which previously got parsed as <em>a</em><em>b</em><em>c</em> and now would be parsed as <em>a<strong>b</strong>c</em> With this change we get four spec test failures, but in each case the output seems more "intuitive": ``` Example 386 (lines 6490-6494) Emphasis and strong emphasis foobarbaz --- expected HTML +++ actual HTML @@ -1 +1 @@ -<p><em>foo</em><em>bar</em><em>baz</em></p> +<p><em>foo<strong>bar</strong>baz</em></p> Example 389 (lines 6518-6522) Emphasis and strong emphasis foobar --- expected HTML +++ actual HTML @@ -1 +1 @@ -<p><em>foo</em><em>bar</em></p> +<p><em>foo<strong>bar</strong></em></p> Example 401 (lines 6620-6624) Emphasis and strong emphasis *foobarbaz* --- expected HTML +++ actual HTML @@ -1 +1 @@ -<p><em><em>foo</em>bar</em>baz</p> +<p><strong>foo<em>bar</em>baz</strong></p> Example 442 (lines 6944-6948) Emphasis and strong emphasis foobar* --- expected HTML +++ actual HTML @@ -1 +1 @@ -<p><em><em>foo</em>bar</em></p> +<p><strong>foobar</strong></p> ```
2016-06-23	Removed positon from delimiter struct.	John MacFarlane
	It is no longer needed; only the brackets struct needs it. Thanks to @robinst.
2016-06-23	Removed check for same mem allocator in S_can_contain.	John MacFarlane
	This is too strict, as it prevents the use of dynamically loaded extensions: see https://github.com/jgm/cmark/pull/123#discussion_r67231518. Documented in man page and public header that one should use the same memory allocator for every node in a tree.
2016-06-23	Ported robinst's changes to link parsing.	John MacFarlane
	See https://github.com/jgm/commonmark.js/pull/101 This uses a separate stack for brackets, instead of putting them on the delimiter stack. This avoids the need for looking through the delimiter stack for the next bracket. It also avoids a shortcut reference lookup when the reference text contains brackets. The change dramatically improved performance on the nested links pathological test for commonmark.js. It has a smaller but measurable effect here.
2016-06-23	Revert "Better parsing of shortcut references."	John MacFarlane
	This reverts commit c069cb55bcadfd0f45890d846ff412b3c892eb87.
2016-06-22	Better parsing of shortcut references.	John MacFarlane
	We reuse the parser for reference labels, instead of just assuming that a slice of the link text will be a valid reference label. (It might contain interior brackets, for example.)
2016-06-22	cmark_reference_lookup: Return NULL if reference is null string.	John MacFarlane

2016-06-06	msvc: Fix warnings and errors	Vicent Marti

2016-06-06	cmark: Remove old include	Vicent Marti

2016-06-06	mem: Rename the new APIs	Vicent Marti

2016-06-06	mem: Add a `realloc` pointer to the memory handler	Vicent Marti

2016-06-06	Do not include `stdbool`	Vicent Marti

2016-06-06	node: Memory diet	Vicent Marti
	Reduce the storage size for the `cmark_code` struct
2016-06-06	buffer: rever to using a 32-bit bufsize_t	Vicent Marti

2016-06-06	node: Memory diet	Vicent Marti
	Save node information in flags instead of using one boolean for each property.
2016-06-06	cmark: Implement support for custom allocators	Vicent Marti

2016-06-06	config: Add SSIZE_T compat for Win32	Vicent Marti

2016-06-06	cmake: Global handler for OOM situations	Vicent Marti

2016-06-06	buffer: proper safety checks for unbounded memory	Vicent Marti
	The previous work for unbounded memory usage and overflows on the buffer API had several shortcomings: 1. The total size of the buffer was limited by arbitrarily small precision on the storage type for buffer indexes (typedef'd as `bufsize_t`). This is not a good design pattern in secure applications, particualarly since it requires the addition of helper functions to cast to/from the native `size` types and the custom type for the buffer, and check for overflows. 2. The library was calling `abort` on overflow and memory allocation failures. This is not a good practice for production libraries, since it turns a potential RCE into a trivial, guaranteed DoS to the whole application that is linked against the library. It defeats the whole point of performing overflow or allocation checks when the checks will crash the library and the enclosing program anyway. 3. The default size limits for buffers were essentially unbounded (capped to the precision of the storage type) and could lead to DoS attacks by simple memory exhaustion (particularly critical in 32-bit platforms). This is not a good practice for a library that handles arbitrary user input. Hence, this patchset provides slight (but in my opinion critical) improvements on this area, copying some of the patterns we've used in the past for high throughput, security sensitive Markdown parsers: 1. The storage type for buffer sizes is now platform native (`ssize_t`). Ideally, this would be a `size_t`, but several parts of the code expect buffer indexes to be possibly negative. Either way, switching to a `size` type is an strict improvement, particularly in 64-bit platforms. All the helpers that assured that values cannot escape the `size` range have been removed, since they are superfluous. 2. The overflow checks have been removed. Instead, the maximum size for a buffer has been set to a safe value for production usage (32mb) that can be proven not to overflow in practice. Users that need to parse particularly large Markdown documents can increase this value. A static, compile-time check has been added to ensure that the maximum buffer size cannot overflow on any growth operations. 3. The library no longer aborts on buffer overflow. The CMark library now follows the convention of other Markdown implementations (such as Hoedown and Sundown) and silently handles buffer overflows and allocation failures by dropping data from the buffer. The result is that pathological Markdown documents that try to exploit the library will instead generate truncated (but valid, and safe) outputs. All tests after these small refactorings have been verified to pass. --- NOTE: Regarding 32 bit overflows, generating test cases that crash the library is trivial (any input document larger than 2gb will crash CMark), but most Python implementations have issues with large strings to begin with, so a test case cannot be added to the pathological tests suite, since it's written in Python.
2016-06-06	Fix character type detection in commonmark.c	Nick Wellnhofer
	- Implement cmark_isalpha. - Check for ASCII character before implicit cast to char. - Use internal ctype functions in commonmark.c. Fixes test failures on Windows and undefined behavior.
2016-06-02	commonmark renderer: fixed code block as first in list item.	John MacFarlane
	We don't want a blank line before a code block when it's the first thing in a list item.
2016-06-01	renderer: no_linebreaks instead of no_wrap.	John MacFarlane
	We generally want this option to prohibit any breaking in things like headers (not just wraps, but softbreaks).
2016-06-01	Coerce realurllen to int.	John MacFarlane
	This is an alternate solution for pull request #132, which introduced a new warning on the comparison: latex.c:191:20: warning: comparison of integers of different signs: 'size_t' (aka 'unsigned long') and 'bufsize_t' (aka 'int') [-Wsign-compare] if (realurllen == link_text->as.literal.len && ~~~~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~~~~
2016-06-01	Merge pull request #130 from MathieuDuponchelle/fix_unused_variable	John MacFarlane
	inlines: Remove unused variable "link_text"
2016-06-01	Merge pull request #132 from BenedictC/master	John MacFarlane
	Changed type from int to size_t to fix implicit type conversion warning
2016-06-01	- Changed type from int to size_t to fix implicit type conversion warning	Benedict Cohen

2016-06-01	inlines: Remove unused variable "link_text"	Mathieu Duponchelle

2016-05-26	Add 2016 to copyright	Kevin Burke
	I thought I had an outdated version of the binary because it printed 2015 for the version string.