cmark - My own fork of cmark for commonmark conversion

Age	Commit message (Collapse)	Author
2020-01-23	Rearrange struct cmark_node	Nick Wellnhofer
	Introduce multi-purpose data/len members in struct cmark_node. This is mainly used to store literal text for inlines, code and HTML blocks. Move the content strbuf for blocks from cmark_node to cmark_parser. When finalizing nodes that allow inlines (paragraphs and headings), detach the strbuf and store the block content in the node's data/len members. Free the block content after processing inlines. Reduces size of struct cmark_node by 8 bytes.
2019-03-28	Remove leftover includes of memory.h.	John MacFarlane
	Closes #290.
2016-12-09	Correctly initialize chunk in S_process_line (#170)	Nick Wellnhofer
	The `alloc` member wasn't initialized. This also allows to add an assertion in `chunk_rtrim` which doesn't work for alloced chunks.
2016-10-11	Ran 'make format' to reformat code.	John MacFarlane

2016-07-03	Fix chunk_set_cstr with suffix of current string	Nick Wellnhofer
	It's possible that cmark_chunk_set_cstr is called with a substring (suffix) of the current string. Delay freeing of the chunk content to handle this case correctly. Fixes issue #139.
2016-06-24	Reformatted.	John MacFarlane

2016-06-06	msvc: Fix warnings and errors	Vicent Marti

2016-06-06	cmark: Implement support for custom allocators	Vicent Marti

2016-06-06	cmake: Global handler for OOM situations	Vicent Marti

2016-06-06	buffer: proper safety checks for unbounded memory	Vicent Marti
	The previous work for unbounded memory usage and overflows on the buffer API had several shortcomings: 1. The total size of the buffer was limited by arbitrarily small precision on the storage type for buffer indexes (typedef'd as `bufsize_t`). This is not a good design pattern in secure applications, particualarly since it requires the addition of helper functions to cast to/from the native `size` types and the custom type for the buffer, and check for overflows. 2. The library was calling `abort` on overflow and memory allocation failures. This is not a good practice for production libraries, since it turns a potential RCE into a trivial, guaranteed DoS to the whole application that is linked against the library. It defeats the whole point of performing overflow or allocation checks when the checks will crash the library and the enclosing program anyway. 3. The default size limits for buffers were essentially unbounded (capped to the precision of the storage type) and could lead to DoS attacks by simple memory exhaustion (particularly critical in 32-bit platforms). This is not a good practice for a library that handles arbitrary user input. Hence, this patchset provides slight (but in my opinion critical) improvements on this area, copying some of the patterns we've used in the past for high throughput, security sensitive Markdown parsers: 1. The storage type for buffer sizes is now platform native (`ssize_t`). Ideally, this would be a `size_t`, but several parts of the code expect buffer indexes to be possibly negative. Either way, switching to a `size` type is an strict improvement, particularly in 64-bit platforms. All the helpers that assured that values cannot escape the `size` range have been removed, since they are superfluous. 2. The overflow checks have been removed. Instead, the maximum size for a buffer has been set to a safe value for production usage (32mb) that can be proven not to overflow in practice. Users that need to parse particularly large Markdown documents can increase this value. A static, compile-time check has been added to ensure that the maximum buffer size cannot overflow on any growth operations. 3. The library no longer aborts on buffer overflow. The CMark library now follows the convention of other Markdown implementations (such as Hoedown and Sundown) and silently handles buffer overflows and allocation failures by dropping data from the buffer. The result is that pathological Markdown documents that try to exploit the library will instead generate truncated (but valid, and safe) outputs. All tests after these small refactorings have been verified to pass. --- NOTE: Regarding 32 bit overflows, generating test cases that crash the library is trivial (any input document larger than 2gb will crash CMark), but most Python implementations have issues with large strings to begin with, so a test case cannot be added to the pathological tests suite, since it's written in Python.
2016-04-09	Reformatted.	John MacFarlane

2016-01-18	Automatic code reformat.	John MacFarlane

2015-12-28	Reformat sources.	John MacFarlane

2015-12-19	Use fully qualified versions of constants.	John MacFarlane

2015-08-10	Don't include static in CMARK_INLINE.	John MacFarlane
	ALso don't set CMARK_INLINE to __inline if we're compiling under MSVC in cplusplus mode.
2015-08-10	Fix MSVC inline errors when cmark is included in other sources that don't ↵	Kevin Wojniak
	have the same set of disabled warnings
2015-07-27	Use clang-format, llvm style, for formatting.	John MacFarlane
	* Reformatted all source files. * Added 'format' target to Makefile. * Removed 'astyle' target. * Updated .editorconfig.
2015-06-07	Helper to safely call strlen	Nick Wellnhofer

2015-06-07	Convert code base to strbuf_t	Nick Wellnhofer
	There are probably a couple of places I missed. But this will only be a problem if we use a 64-bit bufsize_t at some point. Then, we'll get warnings from -Wshorten-64-to-32.
2015-06-06	astyle formatting changes.	John MacFarlane

2015-05-30	Fix for UBSAN note	Jeroen Ooms

2015-05-14	Allow NULL value in string setters	Nick Wellnhofer
	Supersedes pull request #34.
2015-05-14	Store link URL and title as cmark_chunk	Nick Wellnhofer

2014-12-29	Added cmark_ prefix to functions in cmark_ctype.	John MacFarlane

2014-12-29	Added cmark_ctype.h with locale-independent isspace, ispunct, etc.	John MacFarlane
	Otherwise cmark's behavior varies unpredictably with the locale. `is_punctuation` in utf8.h has also been adjusted so that everything that counts all ASCII symbol characters count as punctuation, even though some are not in P* character classes.
2014-12-15	Re-added cmark_ prefix to strbuf and chunk.	John MacFarlane
	Reverts 225d720.
2014-12-13	Removed cmark_ prefix on chunk and strbuf.	John MacFarlane
	This isn't needed any more since we don't expose these in the API.
2014-11-28	Always define short name macros for private symbols	Nick Wellnhofer
	Since chunk.h and buffer.h are private now, there's no need to optionally disable the short name macros.
2014-11-18	Add node constructor and accessors to the public API	Nick Wellnhofer
	The approach I'm taking is to copy inline literals internally to NULL-terminated C strings if requested by an accessor. This allows to return a 'const char *' that doesn't have to be freed by the caller.
2014-11-16	Cast void pointers explicitly	Nick Wellnhofer
	Needed for C++ compatibility.
2014-11-16	Rename include guards for consistency and to avoid reserved identifiers	Craig Barnes

2014-11-12	Prefix names in chunk.h	Nick Wellnhofer

2014-11-12	Prefix names in buffer.h	Nick Wellnhofer

2014-10-06	- Use of calloc instead of malloc	tchetch
	- Test for NULL after allocation
2014-09-09	Rename to strbuf	Vicent Marti

2014-09-09	Add chunk.h	Vicent Marti