Skip to content

Hybrid parser outputs an ever-increasing number of empty tables on each successive page #632

@dhdaines

Description

@dhdaines

When running the hybrid parser on multiple pages, for example this document:

hybrid_multipage.pdf

camelot -p all -f csv -o foo.csv hybrid hybridtest.pdf

camelot will output an extra, empty file on the second page. In the case of a document (which I definitely cannot share) of 59 pages, the final page will have 58 empty tables. As you can imagine this gets really unwieldy with Excel output.

The fix is really, really simple, a PR will be provided momentarily :-)

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions