Replies: 10 comments
-
|
What is the file size of your CSV files? |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
|
How many rows do you have in the substream that starts with the Generate Rows transform and feeds into the Cartesian join (should only be one row, or your cartesian join will multiple all row numbers from both streams). |
Beta Was this translation helpful? Give feedback.
-
|
generate rows creates one row, which has just the filename of the excel that we need to create, so that all 4 streams can insert into same excel. and Cartesian join will combine one row from "generate rows" and one row from "get File names". It wont be the issue i guess. Is there a way to free up the memory once the first sheet written on the Excel? |
Beta Was this translation helpful? Give feedback.
-
|
Number of rows in first csv is 35000. |
Beta Was this translation helpful? Give feedback.
-
|
like I said earlier: adding a parameter to your pipeline to write your sheets one at a time would eliminate the need for blockers. You would only require a single pipeline (more or less the one you have right now, but with an additional parameter) and without the need for blocking transforms). You would have to read your input files multiple times, but it would make the entire process less resource heavy, less headaches and maybe even faster. |
Beta Was this translation helpful? Give feedback.
-
|
I would also check the field definition of the output fiedls of those text file inputs. It could be that you have string types with maximum length and that might cause trying to create 2GB text fields inside the excel file. |
Beta Was this translation helpful? Give feedback.
-
|
Have you enabled the “Stream XLSX data” option?
Perhaps we should enable it by default. |
Beta Was this translation helpful? Give feedback.
-
|
I have tried this and getting below error : 2025/10/15 06:39:36 - Generate Report 2.0 - ERROR: Error writing line :java.lang.IllegalArgumentException: Attempting to write a row[0] in the range [0,0] that is already written to disk. |
Beta Was this translation helpful? Give feedback.
-
|
I confirm that for xslx generation with POI, it's better to use stream. @vkydhyani is it possible for you to share a sample project to help us to reproduce? |
Beta Was this translation helpful? Give feedback.


Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Apache Hop version?
2.15
Java version?
Tried on both 17 and 25
Operating system
Windows
What happened?
Hi Team,
I am getting a Java heap space issue while trying to merge 4 csv file into one excel file.
System configurations are PROD like, 32 Gbs RAM, 4 core, 8 processor
Error :
Exception: java.lang.OutOfMemoryError thrown from the UncaughtExceptionHandler in thread "Timer-1"
Exception in thread "HopGuiPipelineGraph: redraw timer" Exception in thread "pool-2-thread-1" java.lang.OutOfMemoryError: Java heap space
2025/10/14 09:50:30 - Generate Additional Sheet.0 - ERROR: UnexpectedError:
Exception in thread "Timer-4" java.lang.OutOfMemoryError: Java heap space
java.lang.OutOfMemoryError: Java heap space
at java.base/java.util.HashMap$KeySet.toArray(HashMap.java:1002)
at org.apache.commons.vfs2.impl.DefaultFileMonitor.run(DefaultFileMonitor.java:476)
at java.base/java.lang.Thread.runWith(Thread.java:1487)
at java.base/java.lang.Thread.run(Thread.java:1474)
2025/10/14 09:50:30 - Generate Additional Sheet.0 - ERROR: java.lang.OutOfMemoryError: Java heap space
Can you please suggest what is wrong here?
Issue Priority
Priority: 3
Issue Component
Component: Transforms
Beta Was this translation helpful? Give feedback.
All reactions