Skip to content

Commit 65f7a96

Browse files
committed
NRL-1187 read all files
1 parent 31413b5 commit 65f7a96

File tree

1 file changed

+5
-1
lines changed
  • terraform/account-wide-infrastructure/modules/glue/src

1 file changed

+5
-1
lines changed

terraform/account-wide-infrastructure/modules/glue/src/pipeline.py

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -38,7 +38,11 @@ def run(self):
3838
def extract(self):
3939
"""Extract JSON data from S3"""
4040
self.logger.info(f"Extracting data from {self.source_path} as JSON")
41-
return self.spark.read.schema(self.schema).json(self.source_path)
41+
return (
42+
self.spark.read.option("recursiveFileLookup", "true")
43+
.schema(self.schema)
44+
.json(self.source_path)
45+
)
4246

4347
def transform(self, dataframe):
4448
"""Apply a list of transformations on the dataframe"""

0 commit comments

Comments
 (0)