Missing whitespace from cells

Apologies in advance if this is a duplicate issue.

I need to parse a PDF document where some cells contain two separate numbers. See the image below:

<img width="377" height="112" alt="Image" src="https://github.com/user-attachments/assets/2238a37d-aa5f-498e-a547-9e9e3598ad8b" />

I used the following example code:

```

using (PdfDocument document = PdfDocument.Open("./document2.pdf", options))
{
    var page = ObjectExtractor.Extract(document, 1);
    var ea = new SpreadsheetExtractionAlgorithm();
    IReadOnlyList<Table> tables = ea.Extract(page); 
    var table = tables[0];
    var rows = table.Rows;
    using var streamWriter = new StreamWriter("./myjson.json");
    new JSONWriter().Write(streamWriter, table);
}

```

This produces the following (incorrect) result:

<img width="333" height="140" alt="Image" src="https://github.com/user-attachments/assets/bf28beb3-fa24-4ff1-ba06-3b7447703126" />

When I use [Camelot (Python)](https://github.com/camelot-dev/camelot) I get the following (correct) result:

<img width="297" height="74" alt="Image" src="https://github.com/user-attachments/assets/e60690f9-76ef-489d-a67e-9062c992d85f" />

Is this a bug or am I doing something wrong?

A working solution in .NET would be ideal. I appreciate any help.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Missing whitespace from cells #51

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Missing whitespace from cells #51

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions