Slow table parsing for huge tables

`from docx.table import _Cell, Table
from docx.oxml.table import CT_Tbl            


elif isinstance(child, CT_Tbl):
      # table -> JSON
      # DEBUG
      table_obj = Table(child, parent)
      list_table = [[k.text for k in j.cells] for j in table_obj.rows]
      str_table = self.list_to_md_table(list_table)
      yield str_table`

This is my current code reads Word tables and converts them to JSON, but performance degrades significantly when handling large tables — for example, a 9000-row × 10-column table takes too long to parse.

Is there a way to optimize or accelerate this process? Any suggestions for improving efficiency would be greatly appreciated! 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Slow table parsing for huge tables #1516

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Slow table parsing for huge tables #1516

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions