Skip to content

Releases: lennartpollvogt/markdown-to-data

v2.0.1

13 Aug 18:06

Choose a tag to compare

Bug fixes

  • Handle header content in list and blockquote items during merging (#3)
  • Spacer argument in to_md_parser sets too many (+1) lines (#4)

Full Changelog: v2.0.0...v2.0.1

Release Notes - v2.0.0

03 Aug 09:11

Choose a tag to compare

Breaking Changes

All parsed markdown elements now include start_line and end_line fields.

What Changed

Line number tracking has been added to all parsed markdown elements. Each element in md_list now includes the line numbers where it appears in the original markdown.

Example:

# Input markdown
markdown = """# Header
Some text"""

# Output now includes line numbers
[
    {'header': {'level': 1, 'content': 'Header'}, 'start_line': 1, 'end_line': 1},
    {'paragraph': 'Some text', 'start_line': 2, 'end_line': 2}
]

Migration Required

This is a breaking change. Code that depends on the exact structure of returned dictionaries will need updates.

Before:

{'header': {'level': 1, 'content': 'Title'}}

After:

{'header': {'level': 1, 'content': 'Title'}, 'start_line': 1, 'end_line': 1}

Updated

  • All documentation examples now show line number fields
  • Test suite updated for new structure
  • Version bumped to 2.0.0 following semantic versioning

Full Changelog: v1.0.0...v2.0.0

v1.0.0

15 Dec 20:33

Choose a tag to compare

Release v1.0.0 - Enhanced Structure and Features

Major Changes

  • Complete restructuring of internal data representation for better consistency and usability
  • Added comprehensive task list support with checkbox states
  • Headers now use a standardized format: {'header': {'level': 1, 'content': 'text'}}
  • Lists have been enhanced with explicit nesting and task support: {'list': {'type': 'ul', 'items': [...], 'task': state}}
  • Tables now use a column-based structure for better data manipulation and can easily used with pandas
  • Blockquotes support proper nesting with a consistent structure
  • Code blocks include improved language detection and formatting

Technical Updates

  • Enhanced test coverage

Documentation

  • Completely revised README with updated examples
  • Added comprehensive examples for all supported elements
  • Improved documentation of data structures

Breaking Changes of v1.0.0

  • Changed data structure format for headers, lists, and tables
  • Added for property classified_lines of class Markdown the indent of each line (goal: add parsing to HTML in further development + be more indent sensitive)

For detailed usage examples and documentation, please refer to the updated README.md.

Full Changelog: v0.1.0...v1.0.0

v0.1.0 - Initial release

18 Nov 18:16

Choose a tag to compare

Initial release of markdown-to-data

This is the initial release of the library markdown-to-data. This means it was published on PyPI to provide it on the Index for easy access and usability.

Have fun exploring the use cases!

Full Changelog: https://github.com/lennartpollvogt/markdown-to-data/commits/v0.1.0