Skip to content
This repository was archived by the owner on Oct 13, 2025. It is now read-only.

mediadc shows too much photos in summary and not-existing photos in duplicates list #271

@thomasmerz

Description

@thomasmerz

Describe the bug
I let MediaDC run on my Nextcloud and it found some duplicates (yeah!), but when I want to delete one/both of them:

An error occurred while deleting the file

It also shows that it has scanned:

Photos 17152 files

But there are only about 16400 Photos and some files that are no Photos (shell scripts, text-files, …)!

I found "by accident" in appdata_oczqbo00w7z0/mediadc/logs many errors like this which seem to be related to this:

--- Logging error ---
Traceback (most recent call last):
  File "/var/www/html/data/appdata_oczqbo00w7z0/mediadc/binaries/mediadc_musllinux_amd64/logging/__init__.py", line 1100, in emit
  File "/var/www/html/data/appdata_oczqbo00w7z0/mediadc/binaries/mediadc_musllinux_amd64/logging/__init__.py", line 943, in format
  File "/var/www/html/data/appdata_oczqbo00w7z0/mediadc/binaries/mediadc_musllinux_amd64/logging/__init__.py", line 678, in format
  File "/var/www/html/data/appdata_oczqbo00w7z0/mediadc/binaries/mediadc_musllinux_amd64/logging/__init__.py", line 368, in getMessage
TypeError: %d format: a real number is required, not str
Call stack:
  File "/var/www/html/data/appdata_oczqbo00w7z0/mediadc/binaries/mediadc_musllinux_amd64/main.py", line 43, in <module>
  File "/var/www/html/data/appdata_oczqbo00w7z0/mediadc/binaries/mediadc_musllinux_amd64/python/task.py", line 164, in process_task
  File "/var/www/html/data/appdata_oczqbo00w7z0/mediadc/binaries/mediadc_musllinux_amd64/python/task.py", line 189, in process_image_task
  File "/var/www/html/data/appdata_oczqbo00w7z0/mediadc/binaries/mediadc_musllinux_amd64/nc_py_api/files.py", line 84, in fs_nodes_info
  File "/var/www/html/data/appdata_oczqbo00w7z0/mediadc/binaries/mediadc_musllinux_amd64/nc_py_api/files.py", line 84, in <listcomp>
  File "/var/www/html/data/appdata_oczqbo00w7z0/mediadc/binaries/mediadc_musllinux_amd64/nc_py_api/files.py", line 305, in db_record_to_fs_node
  File "/var/www/html/data/appdata_oczqbo00w7z0/mediadc/binaries/mediadc_musllinux_amd64/nc_py_api/files.py", line 254, in get_file_full_path
Message: 'get_file_full_path: cant parse: %d'
Arguments: ('1abb73880decc812e1db1f6f065ead61',)

To Reproduce
Steps to reproduce the behavior:

  1. start a task on a folder with photos
  2. when finished compare number of photos to find . -type f -iname "*.jpg" | wc -l and find . -type f -iname "*.png" | wc -l

Expected behavior
MediaDC should not report that it has scanned more Photos than there are existing.

Screenshots

Image

Logfiles in appdata-directory mentioned above:

.rw-r--r-- www-data@fpm_app www-data@fpm_app 40 MB 2 hours ago --  04-06-2025_11-34-48.log
.rw-r--r-- www-data@fpm_app www-data@fpm_app 40 MB an hour ago --  04-06-2025_11-56-27.log
.rw-r--r-- www-data@fpm_app www-data@fpm_app  0 B  an hour ago --  04-06-2025_11-56-34.log

Server:

  • OS: Docker
  • CPU: x86
  • Nextcloud version: Nextcloud Hub 10 (31.0.5)
  • Database configuration: mariadb:10.5
  • Python version: appdata_oczqbo00w7z0/mediadc/binaries/mediadc_musllinux_amd64/libpython3.10.so.1.0
  • MediaDC version: mediadc: 0.4.0
  • cloud_py_api version: cloud_py_api: 0.2.0

Additional context
Photos are on an external storage where I had deleted some very old photos and very new, but "duplicate" photos by hand out of nextcloud context.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions