Skip to content

[Fixes #13459] Improve sync_geonode_datasets logging#13477

Merged
giohappy merged 2 commits intomasterfrom
ISSUE_13459
Oct 21, 2025
Merged

[Fixes #13459] Improve sync_geonode_datasets logging#13477
giohappy merged 2 commits intomasterfrom
ISSUE_13459

Conversation

@sijandh35
Copy link
Contributor

Fixes #13459

Ported the changes from PR related to this issue.

Checklist

Reviewing is a process done by project maintainers, mostly on a volunteer basis. We try to keep the overhead as small as possible and appreciate if you help us to do so by completing the following items. Feel free to ask in a comment if you have troubles with any of them.

For all pull requests:

  • Confirm you have read the contribution guidelines
  • You have sent a Contribution Licence Agreement (CLA) as necessary (not required for small changes, e.g., fixing typos in the documentation)
  • Make sure the first PR targets the master branch, eventual backports will be managed later. This can be ignored if the PR is fixing an issue that only happens in a specific branch, but not in newer ones.

The following are required only for core and extension modules (they are welcomed, but not required, for contrib modules):

  • There is a ticket in https://github.com/GeoNode/geonode/issues describing the issue/improvement/feature (a notable exemption is, changes not visible to end-users)
  • The issue connected to the PR must have Labels and Milestone assigned
  • PR for bug fixes and small new features are presented as a single commit
  • PR title must be in the form "[Fixes #<issue_number>] Title of the PR"
  • New unit tests have been added covering the changes, unless there is an explanation on why the tests are not necessary/implemented

Submitting the PR does not require you to check all items, but by the time it gets merged, they should be either satisfied or inapplicable.

@sijandh35 sijandh35 self-assigned this Sep 2, 2025
@cla-bot cla-bot bot added the cla-signed CLA Bot: community license agreement signed label Sep 2, 2025
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Summary of Changes

Hello @sijandh35, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly enhances the logging capabilities of the sync_geonode_datasets management command. It introduces a more reliable and independent logger setup, integrates this improved logging throughout the command's execution, and provides a new option for verbose debug output. These changes aim to make the command's operation more transparent and its error reporting more informative.

Highlights

  • Refactored Logger Setup: The setup_logger utility function has been refactored to be more self-contained and robust. It now explicitly removes any existing handlers from a logger before configuring new ones, ensuring consistent logging behavior. It also removes a previous dependency on Django's settings.LOGGING for basic command logging.
  • Improved sync_geonode_datasets Logging: The sync_geonode_datasets management command now utilizes the refactored setup_logger and replaces all direct print statements with structured logging.info and logging.warning calls. This provides more consistent and configurable output for command execution.
  • Added Debug Logging Option: A new --debug command-line argument has been added to sync_geonode_datasets, allowing users to enable debug-level logging for more verbose output during command execution, which can be helpful for troubleshooting.
  • Enhanced Error Reporting: Error handling within sync_geonode_datasets has been improved. When an error occurs and --ignore-errors is not set, the command now uses logger.error with stack_info=True to provide a full traceback in the logs, making it easier to diagnose issues.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in issue comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

@sijandh35 sijandh35 requested review from etj and giohappy September 2, 2025 14:17
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request significantly improves the logging for the sync_geonode_datasets management command by replacing print calls with a configurable logger. The refactoring of setup_logger in command_utils.py is a good move, making it more self-contained and avoiding modifications to global settings. I have a couple of suggestions to make the code even more robust: one to prevent a potential bug when removing logger handlers, and another to improve how exceptions are logged for better diagnostics.

Comment on lines +10 to +11
for old_handler in logger.handlers:
logger.removeHandler(old_handler)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

Modifying a list while iterating over it can lead to unpredictable behavior, such as skipping some items. In this case, not all handlers might be removed if there are more than one, which could lead to duplicate log messages. It's safer to iterate over a copy of the list.

Suggested change
for old_handler in logger.handlers:
logger.removeHandler(old_handler)
for old_handler in logger.handlers[:]:
logger.removeHandler(old_handler)

Comment on lines 77 to +86
exception_type, error, traceback = sys.exc_info()
print(exception_type, error, traceback)
logger.info(exception_type, error, traceback)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

Using logger.info with sys.exc_info() objects will not produce a readable traceback in the logs. It's better to use logger.exception() which automatically captures and formats the exception information, including the traceback. This will make debugging much easier and simplifies the code by removing the need for sys.exc_info().

Suggested change
exception_type, error, traceback = sys.exc_info()
print(exception_type, error, traceback)
logger.info(exception_type, error, traceback)
logger.exception(f"An error occurred while processing layer: {layer.alternate}")

@codecov
Copy link

codecov bot commented Sep 2, 2025

Codecov Report

❌ Patch coverage is 23.52941% with 26 lines in your changes missing coverage. Please review.
✅ Project coverage is 73.61%. Comparing base (df342c3) to head (d5e2a80).
⚠️ Report is 53 commits behind head on master.

Additional details and impacted files
@@            Coverage Diff             @@
##           master   #13477      +/-   ##
==========================================
- Coverage   73.61%   73.61%   -0.01%     
==========================================
  Files         918      918              
  Lines       53703    53710       +7     
  Branches     6126     6128       +2     
==========================================
+ Hits        39536    39538       +2     
- Misses      12586    12592       +6     
+ Partials     1581     1580       -1     
🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
  • 📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

@giohappy giohappy removed their request for review October 20, 2025 08:09
)
return
print(f"There are {len(dataset_errors)} layers which could not be updated because of errors")
logger.info(f"There are {len(dataset_errors)} layers which could not be updated because of errors")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is always printed, even if there are no errors. This is the explicit example of misleading log line in the issue.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated.

@sijandh35 sijandh35 requested a review from etj October 21, 2025 06:10
@giohappy giohappy merged commit d99c837 into master Oct 21, 2025
14 of 16 checks passed
@giohappy giohappy deleted the ISSUE_13459 branch October 21, 2025 11:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cla-signed CLA Bot: community license agreement signed master

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Improve sync_geonode_datasets logging

3 participants