Skip to content

FileSystem to be chosen based on the file system of Input Path A.#15

Open
petr-pokorny-absa wants to merge 1 commit intomasterfrom
feature/support-s3a-paths
Open

FileSystem to be chosen based on the file system of Input Path A.#15
petr-pokorny-absa wants to merge 1 commit intomasterfrom
feature/support-s3a-paths

Conversation

@petr-pokorny-absa
Copy link
Collaborator

This pull request makes a targeted change to the way the Hadoop FileSystem is instantiated in the ArgsParser utility. Instead of using the default file system from the Hadoop configuration, it now determines the file system based on the input path, which improves compatibility with multiple file system schemes (e.g., local, HDFS, S3).

Overview

Release Notes

  • Use correct fs instance when validating input parameter

@github-actions
Copy link

github-actions bot commented Feb 4, 2026

☂️ Python Coverage

current status: ✅

Overall Coverage

Lines Covered Coverage Threshold Status
3 3 100% 70% 🟢

New Files

No new covered files...

Modified Files

No covered modified files...

updated for commit: 81dcfd7 by action🐍

@github-actions
Copy link

github-actions bot commented Feb 4, 2026

JaCoCo code coverage report - scala 2.12.17

Overall Project 94.57% 🍏
Files changed 100% 🍏

File Coverage
ArgsParser.scala 100% 🍏

@github-actions
Copy link

github-actions bot commented Feb 4, 2026

JaCoCo code coverage report - scala 2.11.12

Overall Project 95.6% 🍏
Files changed 100% 🍏

File Coverage
ArgsParser.scala 100% 🍏

val config = spark.sparkContext.hadoopConfiguration
val fs = FileSystem.get(config)
val fsFilePath = new Path(args.inputA)
val fs = fsFilePath.getFileSystem(config)
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

plase create a separate fs instance per inputs and for output - it can be different fs - e.g. hdfs and s3 bucket

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants