Skip to content

added 2 new options to make ParseXLSX more efficient with Large Multi-tab Workbooks #98

Open
cboleary wants to merge 4 commits intodoy:masterfrom
cboleary:master
Open

added 2 new options to make ParseXLSX more efficient with Large Multi-tab Workbooks #98
cboleary wants to merge 4 commits intodoy:masterfrom
cboleary:master

Conversation

@cboleary
Copy link

@cboleary cboleary commented Apr 2, 2021

This is an attempt to fix a performance issue with the way ParseXLSX handles large spreadsheets.
If you use the ReadData method, it parses every worksheet in the workbook and this can consume alot of time
especially if you just want to find out the worksheet tab names

Added option:
--just_find_sheet_names
If just_find_sheet_names is set, the code will quickly return the worksheetInfo without parsing each worksheet
and
--sheet_filter comma separated string of worksheet names to parse
if sheet_filter is not set, the module operates as it did, parsing EVERY worksheet in the workbook
if its set, the module will only parse the worksheets listed which can be way more efficient

--just_find_sheet_names
If just_find_sheet_names is set, the code will quickly return the worksheetInfo without parsing each worksheet
and
--sheet_filter comma separated string of worksheet names to parse
if sheet_filter is not set, the module operates as it did, parsing EVERY worksheet in the workbook
if its set, the module will only parse the worksheets listed which can be way more efficient
cboleary added 3 commits April 3, 2021 06:32
…tested)

I needed to return a shell of a WorkSheet with an empty list for the Cells Attribute
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant