Detect directory paths in object get without recursive flag by Copilot · Pull Request #3114 · PelicanPlatform/pelican

Copilot · 2026-02-12T16:58:54Z

Remove redundant recursiveFromQuery check
Add specific ParameterError type validation in test
Restore recursiveFromQuery logic with simplified condition
Verify compilation
Run tests
Final code review

Changes Made

Issue 1: Simplified recursiveFromQuery check (handle_http.go:1242)

The original suggestion was to remove the redundant tj.rejectCollections check in the condition, not to remove the entire block. The corrected logic:

if recursiveFromQuery {
    tj.rejectCollections = false
}

This ensures that when a URL query parameter forces recursive mode, we unconditionally clear rejectCollections.

Issue 2: More specific error validation (fed_long_test.go:1042)

Added proper error type checking:

var pe *error_codes.PelicanError
require.True(t, errors.As(getErr, &pe), "Error should be wrapped in PelicanError")
assert.Equal(t, 1000, pe.Code(), "Should be Parameter error code")

Original prompt

This section details on the original issue you should resolve

<issue_title>Ungraceful Handling of Object Retrieval with Directory Path</issue_title>
<issue_description>When a directory path is passed to the pelican object get command without the recursive flag, it is not handled gracefully. Here is the current behavior:

Standard flow through Cache:
An error is printed, and an empty file is created at the destination.

Direct read through Origin:
The command runs successfully. A file is created at the destination, containing HTML of a table listing the contents of the requested directory.

</issue_description>

Comments on the Issue (you are @copilot in this section)
@brianaydemir I was able to reproduce this just now, including when going through a cache (which was not the case when this issue was originally written). @brianaydemir Testing with a v7.22.0 client against a v7.23.0-rc.2 origin and cache, a `pelican object get` on a path that maps to a directory on a POSIX file system results in an HTML file produced by XRootD that contains a listing of the directory's contents. This happens when going through the cache and when reading directly from the origin. @brianaydemir Closing as not planned. I don't want to change how we configure XRootD by default, e.g., by turning off its directory listings, and in the absence of reports from users, it's not clear it's worth the effort to update the client and caches to check for `get`s on directories. @bbockelm Eh -- this is simpler than you might think, especially after some work done last fall to make sure PROPFIND functions.
In all cases for downloading an object in non-recursive mode, it appears we do at least one HEAD (to see if the cache is alive or to get the final size of the object if we didn't try to sort cache attempts). If the HEAD comes back with a non-zero Age, then it must be an object.

If we get to this statanza:

Worst case: do a separate HEAD request to get the size

we now need to determine if it's a collection as well as a guess for size -- we can definitely do a PROPFIND instead.</comment_new>

Fixes Ungraceful Handling of Object Retrieval with Directory Path #1706

💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.

client/handle_http.go

Add a WithRejectCollections TransferOption that causes downloadObject to verify (via PROPFIND) that the remote path is not a WebDAV collection before attempting the download. When the remote path is a directory, the transfer fails early with a clear error: remote object is a directory; use recursive mode to download directories The CLI's object get command sets WithRejectCollections(!isRecursive) so that non-recursive downloads of directories produce a helpful error instead of downloading an HTML directory listing. To avoid an unnecessary PROPFIND round-trip, the collection check is skipped when: - sortAttempts already received a non-zero Age header from a cache (cached objects are never directories), or - a previous downloadObject call for the same job already confirmed the path is not a collection (cached in TransferJob.notCollection). If the URL contains a ?recursive query parameter that forces recursive mode, rejectCollections is cleared in NewTransferJob so the check is not applied. Closes #1706

All items addressed.

turetske · 2026-03-02T17:50:05Z

client/handle_http.go


+	// If the URL query forced recursive mode, clear rejectCollections since the user
+	// explicitly indicated they want recursive behavior via the URL.
+	if recursiveFromQuery && tj.rejectCollections {


@copilot The check of tj.rejectCollections is redundant, no matter what it is it will end up as false.

Fixed in commit 0b70906. Removed the redundant check - when recursiveFromQuery is true, recursive is already true, so rejectCollections would never have been set to true in the first place.

turetske · 2026-03-02T17:54:20Z

client/fed_long_test.go

+				_, getErr = client.DoGet(fed.Ctx, downloadURL, destPath, false, opts...)
+			}
+
+			require.Error(t, getErr, "Expected an error when passing a dir path to object get command without recursive option set")


@copilot I believe we can be more specific? Isn't this a pelican parameter error?

Fixed in commit 0b70906. Now validates that the error is specifically a ParameterError with code 1000:

var pe *error_codes.PelicanError require.True(t, errors.As(getErr, &pe), "Error should be wrapped in PelicanError") assert.Equal(t, 1000, pe.Code(), "Should be Parameter error code")

- Remove redundant recursiveFromQuery check in handle_http.go (line 1242) The check was unnecessary because recursiveFromQuery implies recursive=true, and rejectCollections is only set when NOT recursive - Add specific ParameterError validation in fed_long_test.go (line 1042) Now verifies error is wrapped as PelicanError with code 1000 (Parameter error) Co-authored-by: turetske <498529+turetske@users.noreply.github.com>

turetske · 2026-03-02T19:11:02Z

client/handle_http.go

-	// If the URL query forced recursive mode, clear rejectCollections since the user
-	// explicitly indicated they want recursive behavior via the URL.
-	if recursiveFromQuery && tj.rejectCollections {
-		tj.rejectCollections = false


@copilot Is this correct? I suggested just removing the tj.rejectCollection but keeping recursiveFromQuery. Basically, if recursiveFromQuery, then tj.rejectCollections = false.

You're right, I misunderstood. Fixed in commit 23647a9. Now keeping the logic but with the simplified condition:

if recursiveFromQuery { tj.rejectCollections = false }

This removes the redundant tj.rejectCollections check while preserving the behavior of clearing rejectCollections when the URL query forces recursive mode.

Keep the logic to clear rejectCollections when recursiveFromQuery is true, but remove the redundant check of tj.rejectCollections in the condition. Now simply: if recursiveFromQuery { tj.rejectCollections = false } Co-authored-by: turetske <498529+turetske@users.noreply.github.com>

turetske · 2026-03-02T21:12:22Z

@bbockelm Passing this back to you

Copilot AI assigned Copilot and bbockelm Feb 12, 2026

Copilot started work on behalf of bbockelm February 12, 2026 16:59 View session

Copilot AI changed the title ~~[WIP] Fix ungraceful handling of object retrieval with directory path~~ Detect directory paths in object get without recursive flag Feb 12, 2026

Copilot AI requested a review from bbockelm February 12, 2026 17:11

Copilot finished work on behalf of bbockelm February 12, 2026 17:11

bbockelm previously requested changes Feb 21, 2026

View reviewed changes

client/handle_http.go Outdated Show resolved Hide resolved

Copilot started work on behalf of bbockelm February 21, 2026 17:04 View session

Copilot AI requested a review from bbockelm February 21, 2026 17:13

Copilot finished work on behalf of bbockelm February 21, 2026 17:13

bbockelm force-pushed the copilot/fix-directory-path-retrieval branch from d26575b to 8f8a4a8 Compare February 21, 2026 23:42

bbockelm added bug Something isn't working client Issue affecting the OSDF client labels Feb 21, 2026

bbockelm force-pushed the copilot/fix-directory-path-retrieval branch from 8f8a4a8 to b828f65 Compare February 22, 2026 02:13

bbockelm marked this pull request as ready for review February 22, 2026 03:00

bbockelm requested a review from turetske February 22, 2026 03:00

turetske self-assigned this Feb 26, 2026

turetske requested changes Mar 2, 2026

View reviewed changes

Copilot started work on behalf of turetske March 2, 2026 17:54 View session

Copilot AI requested a review from turetske March 2, 2026 18:00

Copilot finished work on behalf of turetske March 2, 2026 18:00

turetske requested changes Mar 2, 2026

View reviewed changes

Copilot started work on behalf of turetske March 2, 2026 19:11 View session

Copilot AI requested a review from turetske March 2, 2026 19:15

Copilot finished work on behalf of turetske March 2, 2026 19:15

turetske approved these changes Mar 2, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Detect directory paths in object get without recursive flag#3114

Detect directory paths in object get without recursive flag#3114
Copilot wants to merge 3 commits intomainfrom
copilot/fix-directory-path-retrieval

Copilot AI commented Feb 12, 2026 •

edited

Loading

Uh oh!

Uh oh!

turetske Mar 2, 2026

Uh oh!

Copilot AI Mar 2, 2026

Uh oh!

turetske Mar 2, 2026

Uh oh!

Copilot AI Mar 2, 2026

Uh oh!

turetske Mar 2, 2026

Uh oh!

Copilot AI Mar 2, 2026

Uh oh!

turetske commented Mar 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Copilot AI commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes Made

Issue 1: Simplified recursiveFromQuery check (handle_http.go:1242)

Issue 2: More specific error validation (fed_long_test.go:1042)

Comments on the Issue (you are @copilot in this section)

Uh oh!

Uh oh!

turetske Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

turetske Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

turetske Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

turetske commented Mar 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Copilot AI commented Feb 12, 2026 •

edited

Loading