-
Notifications
You must be signed in to change notification settings - Fork 75
Description
Hello,
I have encountered some problems related to diamond while annotating genomes with bakta latest version (1.12), installed via conda.
In the logs, the diamond error codes are -6, -7 and -11. I saw that -6 and -11 were already reported for previous bakta versions.
File "/home/mibi/miniconda3/envs/bakta/bin/bakta", line 10, in <module>
sys.exit(main())
~~~~^^
File "/home/mibi/miniconda3/envs/bakta/lib/python3.13/site-packages/bakta/main.py", line 285, in main
cdss_psc, cdss_pscc, cdss_not_found = psc.search(cdss_not_found)
~~~~~~~~~~^^^^^^^^^^^^^^^^
File "/home/mibi/miniconda3/envs/bakta/lib/python3.13/site-packages/bakta/psc.py", line 65, in search
raise Exception(f'diamond error! error code: {proc.returncode}')
Exception: diamond error! error code: -6File "/home/mibi/miniconda3/envs/bakta/bin/bakta", line 10, in <module>
sys.exit(main())
~~~~^^
File "/home/mibi/miniconda3/envs/bakta/lib/python3.13/site-packages/bakta/main.py", line 332, in main
pseudo_candidates = feat_cds.predict_pseudo_candidates(hypotheticals)
File "/home/mibi/miniconda3/envs/bakta/lib/python3.13/site-packages/bakta/features/cds.py", line 621, in predict_pseudo_candidates
raise Exception(f'diamond error! error code: {proc.returncode}')
Exception: diamond error! error code: -7Traceback (most recent call last):
File "/home/mibi/miniconda3/envs/bakta/bin/bakta", line 10, in <module>
sys.exit(main())
~~~~^^
File "/home/mibi/miniconda3/envs/bakta/lib/python3.13/site-packages/bakta/main.py", line 285, in main
cdss_psc, cdss_pscc, cdss_not_found = psc.search(cdss_not_found)
~~~~~~~~~~^^^^^^^^^^^^^^^^
File "/home/mibi/miniconda3/envs/bakta/lib/python3.13/site-packages/bakta/psc.py", line 65, in search
raise Exception(f'diamond error! error code: {proc.returncode}')
Exception: diamond error! error code: -11I'm annotating a batch of an hundred genomes (downloaded from NCBI) with a for cycle in the terminal, and I'm noticing that these errors seem to occur randomly, they didn't seem to be linked to a particular genome. In fact, if I try to annotate the single genome or if the cycle is relaunched, the annotation may end successfully. I noticed that sometimes the diamond process seems to hang when detecting pseudogenes. It occupies RAM but no CPU, hinderingthe completion of the annotation, so I have to kill it and restart.
I also tried the previous version of bakta (1.11.4) installed via conda with python=3.11 and diamond=2.1.18, but without success. Instead I got a different error:
Traceback (most recent call last):
File "/home/francesco/miniconda3/envs/bakta_1.11.4/bin/bakta", line 10, in <module>
sys.exit(main())
^^^^^^
File "/home/francesco/miniconda3/envs/bakta_1.11.4/lib/python3.11/site-packages/bakta/main.py", line 343, in main
pfam_hits = feat_cds.predict_pfam(hypotheticals)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/francesco/miniconda3/envs/bakta_1.11.4/lib/python3.11/site-packages/bakta/features/cds.py", line 371, in predict_pfam
aa_identifier = hit.name.decode()
^^^^^^^^^^^^^^^
AttributeError: 'str' object has no attribute 'decode'. Did you mean: 'encode'?I don't know if this can be somehow fixed since the errors seem to happen randomly, but maybe this report could be useful!
Thanks.