Skip to content

Add MAGs-taxonomy-annotation workflow#1104

Open
SantaMcCloud wants to merge 3 commits intogalaxyproject:mainfrom
SantaMcCloud:mags_taxa_anno_wf_add
Open

Add MAGs-taxonomy-annotation workflow#1104
SantaMcCloud wants to merge 3 commits intogalaxyproject:mainfrom
SantaMcCloud:mags_taxa_anno_wf_add

Conversation

@SantaMcCloud
Copy link

FOR CONTRIBUTOR:

  • I have read the Adding workflows guidelines
  • License permits unrestricted use (educational + commercial)
  • Please also take note of the reviewer guidelines below to facilitate a smooth review process.

FOR REVIEWERS:

  • .dockstore.yml: file is present and aligned with creator metadata in workflow. ORCID identifiers are strongly encouraged in creator metadata. The .dockstore.yml file is required to run tests
  • Workflow is sufficiently generic to be used with lab data and does not hardcode sample names, reference data and can be run without reading an accompanying tutorial.
  • In workflow: annotation field contains short description of what the workflow does. Should start with This workflow does/runs/performs … xyz … to generate/analyze/etc …
  • In workflow: workflow inputs and outputs have human readable names (spaces are fine, no underscore, dash only where spelling dictates it), no abbreviation unless it is generally understood. Altering input or output labels requires adjusting these labels in the the workflow-tests.yml file as well
  • In workflow: name field should be human readable (spaces are fine, no underscore, dash only where spelling dictates it), no abbreviation unless generally understood
  • Workflow folder: prefer dash (-) over underscore (_), prefer all lowercase. Folder becomes repository in iwc-workflows organization and is included in TRS id
  • Readme explains what workflow does, what are valid inputs and what outputs users can expect. If a tutorial or other resources exist they can be linked. If a similar workflow exists in IWC readme should explain differences with existing workflow and when one might prefer one workflow over another
  • Changelog contains appropriate entries
  • Large files (> 100 KB) are uploaded to zenodo and location urls are used in test file

@github-actions
Copy link

Test Results (powered by Planemo)

Test Summary

Test State Count
Total 1
Passed 0
Error 0
Failure 1
Skipped 0
Failed Tests
  • ❌ MAGs-taxonomy-annotation.ga_0

    Problems:

    • Expected output [MultiQC HTML report] not found in results.
      

    Workflow invocation details

    • Invocation Messages

    • Steps
      • Step 1: Sequence collection:

        • step_state: scheduled
      • Step 2: Unlabelled step (toolshed.g2.bx.psu.edu/repos/iuc/gtdbtk_classify_wf/gtdbtk_classify_wf/2.5.2+galaxy1):

        • step_state: scheduled

        • Jobs
          • Job 1:

            • Job state is ok

            Container:

            • quay.io/biocontainers/gtdbtk:2.5.2--pyh1f0d9b5_0

            Command Line:

            • mkdir input_dir && mkdir output_dir && ln -s '/tmp/tmpy3d6hil1/files/e/f/0/dataset_ef05543e-a313-4216-9e30-2c11e2d947ca.dat' input_dir/'50contig_reads_binette_bin1.fasta.fasta' && ln -s '/tmp/tmpy3d6hil1/files/9/2/2/dataset_922ef39f-bb4c-4dbc-a5f9-c69deb0efdf6.dat' input_dir/'50contig_reads_binette_bin2.fasta.fasta' && ln -s '/tmp/tmpy3d6hil1/files/a/3/6/dataset_a36bc4a7-8021-47fd-981d-36633f3165ff.dat' input_dir/'50contig_reads_binette_bin3.fasta.fasta' && ln -s '/tmp/tmpy3d6hil1/files/4/9/7/dataset_49737e06-edf0-4500-8557-2dc77a4fced5.dat' input_dir/'50contig_reads_binette_bin4.fasta.fasta' && export GTDBTK_DATA_PATH=/cvmfs/data.galaxyproject.org/managed/gtdbtk_database_versioned/full_database_release_220_downloaded_2024-10-19 && gtdbtk classify_wf --genome_dir input_dir --extension '.fasta' --out_dir output_dir --cpus "${GALAXY_SLOTS:-4}" --min_perc_aa 10  --min_af 0.65  --skip_ani_screen

            Exit Code:

            • 0

            Standard Error:

            • ==> Processed 0/4 genomes (0%) |               | [?genome/s, ETA ?]
              ==> Processed 2/4 genomes (50%) |███████▌       | [ 2.95genome/s, ETA 00:00]
              ==> Processed 4/4 genomes (100%) |███████████████| [ 5.14genome/s, ETA 00:00]
                                                                                           
              
              ==> Processed 4/4 genomes (100%) |███████████████| [ 5.14genome/s, ETA 00:00]
                                                                                           
              
              ==> Processed 0/4 genomes (0%) |               | [?genome/s, ETA ?]
              ==> Processed 1/4 genomes (25%) |███▊           | [ 2.32genome/s, ETA 00:01]
              ==> Processed 2/4 genomes (50%) |███████▌       | [ 2.08genome/s, ETA 00:00]
              ==> Processed 4/4 genomes (100%) |███████████████| [ 3.73genome/s, ETA 00:00]
                                                                                           
              
              ==> Processed 4/4 genomes (100%) |███████████████| [ 3.73genome/s, ETA 00:00]
                                                                                           
              
              ==> Processed 0/4 genomes (0%) |               | [?genome/s, ETA ?]
              ==> Processed 1/4 genomes (25%) |███▊           | [ 1.28genome/s, ETA 00:02]
                                                                                          
              
              ==> Processed 4/4 genomes (100%) |███████████████| [ 1.28genome/s, ETA 00:00]
                                                                                           
              
              ==> Processed 0/4 genomes (0%) |               | [?genome/s, ETA ?]
                                                                                 
              
              ==> Processed 0/4 genomes (0%) |               | [?genome/s, ETA ?]
                                                                                 
              
              ==> Processed 0/22 markers (0%) |               | [?marker/s, ETA ?]
              ==> Processed 2/22 markers (9%) |█▎             | [16.68marker/s, ETA 00:01]
              ==> Processed 5/22 markers (23%) |███▍           | [21.07marker/s, ETA 00:00]
              ==> Processed 9/22 markers (41%) |██████▏        | [25.55marker/s, ETA 00:00]
              ==> Processed 13/22 markers (59%) |████████▊      | [28.61marker/s, ETA 00:00]
              ==> Processed 18/22 markers (82%) |████████████▎  | [31.35marker/s, ETA 00:00]
                                                                                            
              
              ==> Processed 0/107236 sequences (0%) |               | [?sequence/s, ETA ?]
              ==> Processed 58/107236 sequences (0%) |               | [574.52sequence/s, ETA 03:06]
              ==> Processed 116/107236 sequences (0%) |               | [571.76sequence/s, ETA 03:07]
              ==> Processed 174/107236 sequences (0%) |               | [548.73sequence/s, ETA 03:15]
              ==> Processed 232/107236 sequences (0%) |               | [556.84sequence/s, ETA 03:12]
              ==> Processed 290/107236 sequences (0%) |               | [560.67sequence/s, ETA 03:10]
              ==> Processed 347/107236 sequences (0%) |               | [553.62sequence/s, ETA 03:13]
              ==> Processed 404/107236 sequences (0%) |               | [556.27sequence/s, ETA 03:12]
              ==> Processed 461/107236 sequences (0%) |               | [558.26sequence/s, ETA 03:11]
              ==> Processed 517/107236 sequences (0%) |               | [556.02sequence/s, ETA 03:11]
              ==> Processed 573/107236 sequences (1%) |               | [555.39sequence/s, ETA 03:12]
              ==> Processed 631/107236 sequences (1%) |               | [557.65sequence/s, ETA 03:11]
              ==> Processed 687/107236 sequences (1%) |               | [555.25sequence/s, ETA 03:11]
              ==> Processed 744/107236 sequences (1%) |               | [556.68sequence/s, ETA 03:11]
              ==> Processed 802/107236 sequences (1%) |               | [558.43sequence/s, ETA 03:10]
              ==> Processed 858/107236 sequences (1%) |               | [552.45sequence/s, ETA 03:12]
              ==> Processed 915/107236 sequences (1%) |▏              | [554.12sequence/s, ETA 03:11]
              ==> Processed 972/107236 sequences (1%) |▏              | [554.72sequence/s, ETA 03:11]
              ==> Processed 1028/107236 sequences (1%) |▏              | [549.91sequence/s, ETA 03:13]
              ==> Processed 1085/107236 sequences (1%) |▏              | [550.98sequence/s, ETA 03:12]
              ==> Processed 1142/107236 sequences (1%) |▏              | [552.12sequence/s, ETA 03:12]
              ==> Processed 1200/107236 sequences (1%) |▏              | [554.44sequence/s, ETA 03:11]
              ==> Processed 1258/107236 sequences (1%) |▏              | [556.32sequence/s, ETA 03:10]
              ==> Processed 1314/107236 sequences (1%) |▏              | [553.69sequence/s, ETA 03:11]
              ==> Processed 1372/107236 sequences (1%) |▏              | [556.34sequence/s, ETA 03:10]
              ==> Processed 1429/107236 sequences (1%) |▏              | [557.69sequence/s, ETA 03:09]
              ==> Processed 1485/107236 sequences (1%) |▏              | [555.65sequence/s, ETA 03:10]
              ==> Processed 1542/107236 sequences (1%) |▏              | [556.91sequence/s, ETA 03:09]
              ==> Processed 1600/107236 sequences (1%) |▏              | [558.34sequence/s, ETA 03:09]
              ==> Processed 1656/107236 sequences (2%) |▏              | [551.77sequence/s, ETA 03:11]
              ==> Processed 1713/107236 sequences (2%) |▏              | [553.30sequence/s, ETA 03:10]
              ==> Processed 1770/107236 sequences (2%) |▏              | [554.97sequence/s, ETA 03:10]
              ==> Processed 1826/107236 sequences (2%) |▎              | [549.40sequence/s, ETA 03:11]
              ==> Processed 1883/107236 sequences (2%) |▎              | [551.20sequence/s, ETA 03:11]
              ==> Processed 1941/107236 sequences (2%) |▎              | [553.30sequence/s, ETA 03:10]
              ==> Processed 1997/107236 sequences (2%) |▎              | [547.03sequence/s, ETA 03:12]
              ==> Processed 2054/107236 sequences (2%) |▎              | [548.96sequence/s, ETA 03:11]
              ==> Processed 2111/107236 sequences (2%) |▎              | [550.82sequence/s, ETA 03:10]
              ==> Processed 2167/107236 sequences (2%) |▎              | [547.63sequence/s, ETA 03:11]
              ==> Processed 2224/107236 sequences (2%) |▎              | [549.13sequence/s, ETA 03:11]
              ==> Processed 2282/107236 sequences (2%) |▎              | [551.58sequence/s, ETA 03:10]
              ==> Processed 2338/107236 sequences (2%) |▎              | [551.57sequence/s, ETA 03:10]
              ==> Processed 2395/107236 sequences (2%) |▎              | [553.15sequence/s, ETA 03:09]
              ==> Processed 2452/107236 sequences (2%) |▎              | [554.06sequence/s, ETA 03:09]
              ==> Processed 2508/107236 sequences (2%) |▎              | [553.16sequence/s, ETA 03:09]
              ==> Processed 2565/107236 sequences (2%) |▎              | [554.33sequence/s, ETA 03:08]
              ==> Processed 2621/107236 sequences (2%) |▎              | [553.64sequence/s, ETA 03:08]
              ==> Processed 2677/107236 sequences (2%) |▎              | [549.36sequence/s, ETA 03:10]
              ==> Processed 2735/107236 sequences (3%) |▍              | [551.47sequence/s, ETA 03:09]
              ==> Processed 2791/107236 sequences (3%) |▍              | [549.67sequence/s, ETA 03:10]
              ==> Processed 2848/107236 sequences (3%) |▍              | [551.22sequence/s, ETA 03:09]
              ==> Processed 2906/107236 sequences (3%) |▍              | [553.27sequence/s, ETA 03:08]
              ==> Processed 2962/107236 sequences (3%) |▍              | [550.15sequence/s, ETA 03:09]
              ==> Processed 3019/107236 sequences (3%) |▍              | [552.07sequence/s, ETA 03:08]
              ==> Processed 3076/107236 sequences (3%) |▍              | [553.21sequence/s, ETA 03:08]
              ==> Processed 3132/107236 sequences (3%) |▍              | [551.68sequence/s, ETA 03:08]
              ==> Processed 3190/107236 sequences (3%) |▍              | [554.34sequence/s, ETA 03:07]
              ==> Processed 3247/107236 sequences (3%) |▍              | [555.68sequence/s, ETA 03:07]
              ==> Processed 3303/107236 sequences (3%) |▍              | [554.41sequence/s, ETA 03:07]
              ==> Processed 3360/107236 sequences (3%) |▍              | [555.05sequence/s, ETA 03:07]
              ==> Processed 3417/107236 sequences (3%) |▍              | [556.37sequence/s, ETA 03:06]
              ==> Processed 3473/107236 sequences (3%) |▍              | [553.90sequence/s, ETA 03:07]
              ==> Processed 3530/107236 sequences (3%) |▍              | [555.02sequence/s, ETA 03:06]
              ==> Processed 3587/107236 sequences (3%) |▌              | [556.35sequence/s, ETA 03:06]
              ==> Processed 3643/107236 sequences (3%) |▌              | [555.53sequence/s, ETA 03:06]
              ==> Processed 3701/107236 sequences (3%) |▌              | [557.07sequence/s, ETA 03:05]
              ==> Processed 3758/107236 sequences (4%) |▌              | [558.22sequence/s, ETA 03:05]
              ==> Processed 3814/107236 sequences (4%) |▌              | [552.41sequence/s, ETA 03:07]
              ==> Processed 3873/107236 sequences (4%) |▌              | [555.61sequence/s, ETA 03:06]
              ==> Processed 3930/107236 sequences (4%) |▌              | [556.18sequence/s, ETA 03:05]
              ==> Processed 3987/107236 sequences (4%) |▌              | [556.66sequence/s, ETA 03:05]
              ==> Processed 4044/107236 sequences (4%) |▌              | [557.88sequence/s, ETA 03:04]
              ==> Processed 4101/107236 sequences (4%) |▌              | [558.59sequence/s, ETA 03:04]
              ==> Processed 4158/107236 sequences (4%) |▌              | [559.14sequence/s, ETA 03:04]
              ==> Processed 4215/107236 sequences (4%) |▌              | [559.79sequence/s, ETA 03:04]
              ==> Processed 4272/107236 sequences (4%) |▌              | [560.70sequence/s, ETA 03:03]
              ==> Processed 4329/107236 sequences (4%) |▌              | [560.78sequence/s, ETA 03:03]
              ==> Processed 4386/107236 sequences (4%) |▌              | [561.57sequence/s, ETA 03:03]
              ==> Processed 4443/107236 sequences (4%) |▌              | [561.23sequence/s, ETA 03:03]
              ==> Processed 4500/107236 sequences (4%) |▋              | [557.70sequence/s, ETA 03:04]
              ==> Processed 4557/107236 sequences (4%) |▋              | [558.53sequence/s, ETA 03:03]
              ==> Processed 4613/107236 sequences (4%) |▋              | [557.93sequence/s, ETA 03:03]
              ==> Processed 4670/107236 sequences (4%) |▋              | [558.25sequence/s, ETA 03:03]
              ==> Processed 4727/107236 sequences (4%) |▋              | [559.22sequence/s, ETA 03:03]
              ==> Processed 4783/107236 sequences (4%) |▋              | [556.16sequence/s, ETA 03:04]
              ==> Processed 4841/107236 sequences (5%) |▋              | [557.71sequence/s, ETA 03:03]
              ==> Processed 4898/107236 sequences (5%) |▋              | [558.60sequence/s, ETA 03:03]
              ==> Processed 4954/107236 sequences (5%) |▋              | [555.45sequence/s, ETA 03:04]
              ==> Processed 5011/107236 sequences (5%) |▋              | [556.83sequence/s, ETA 03:03]
              ==> Processed 5068/107236 sequences (5%) |▋              | [557.37sequence/s, ETA 03:03]
              ==> Processed 5124/107236 sequences (5%) |▋              | [555.77sequence/s, ETA 03:03]
              ==> Processed 5181/107236 sequences (5%) |▋              | [557.01sequence/s, ETA 03:03]
              ==> Processed 5239/107236 sequences (5%) |▋              | [558.48sequence/s, ETA 03:02]
              ==> Processed 5295/107236 sequences (5%) |▋              | [556.76sequence/s, ETA 03:03]
              ==> Processed 5353/107236 sequences (5%) |▋              | [558.84sequence/s, ETA 03:02]
              ==> Processed 5411/107236 sequences (5%) |▊              | [560.78sequence/s, ETA 03:01]
              ==> Processed 5468/107236 sequences (5%) |▊              | [556.21sequence/s, ETA 03:02]
              ==> Processed 5525/107236 sequences (5%) |▊              | [557.35sequence/s, ETA 03:02]
              ==> Processed 5584/107236 sequences (5%) |▊              | [559.61sequence/s, ETA 03:01]
              ==> Processed 5641/107236 sequences (5%) |▊              | [555.51sequence/s, ETA 03:02]
              ==> Processed 5699/107236 sequences (5%) |▊              | [557.87sequence/s, ETA 03:02]
              ==> Processed 5756/107236 sequences (5%) |▊              | [558.86sequence/s, ETA 03:01]
              ==> Processed 5812/107236 sequences (5%) |▊              | [555.10sequence/s, ETA 03:02]
              ==> Processed 5869/107236 sequences (5%) |▊              | [555.90sequence/s, ETA 03:02]
              ==> Processed 5926/107236 sequences (6%) |▊              | [556.48sequence/s, ETA 03:02]
              ==> Processed 5982/107236 sequences (6%) |▊              | [556.06sequence/s, ETA 03:02]
              ==> Processed 6039/107236 sequences (6%) |▊              | [557.27sequence/s, ETA 03:01]
              ==> Processed 6096/107236 sequences (6%) |▊              | [558.02sequence/s, ETA 03:01]
              ==> Processed 6152/107236 sequences (6%) |▊              | [557.67sequence/s, ETA 03:01]
              ==> Processed 6208/107236 sequences (6%) |▊              | [557.44sequence/s, ETA 03:01]
              ==> Processed 6264/107236 sequences (6%) |▉              | [557.37sequence/s, ETA 03:01]
              ==> Processed 6321/107236 sequences (6%) |▉              | [557.70sequence/s, ETA 03:00]
              ==> Processed 6378/107236 sequences (6%) |▉              | [558.27sequence/s, ETA 03:00]
              ==> Processed 6434/107236 sequences (6%) |▉              | [557.61sequence/s, ETA 03:00]
              ==> Processed 6490/107236 sequences (6%) |▉              | [557.58sequence/s, ETA 03:00]
              ==> Processed 6547/107236 sequences (6%) |▉              | [558.47sequence/s, ETA 03:00]
              ==> Processed 6603/107236 sequences (6%) |▉              | [555.81sequence/s, ETA 03:01]
              ==> Processed 6659/107236 sequences (6%) |▉              | [555.76sequence/s, ETA 03:00]
              ==> Processed 6716/107236 sequences (6%) |▉              | [557.16sequence/s, ETA 03:00]
              ==> Processed 6772/107236 sequences (6%) |▉              | [552.65sequence/s, ETA 03:01]
              ==> Processed 6829/107236 sequences (6%) |▉              | [554.04sequence/s, ETA 03:01]
              ==> Processed 6886/107236 sequences (6%) |▉              | [555.42sequence/s, ETA 03:00]
              ==> Processed 6942/107236 sequences (6%) |▉              | [554.16sequence/s, ETA 03:00]
              ==> Processed 6999/107236 sequences (7%) |▉              | [555.22sequence/s, ETA 03:00]
              ==> Processed 7056/107236 sequences (7%) |▉              | [556.52sequence/s, ETA 03:00]
              ==> Processed 7112/107236 sequences (7%) |▉              | [555.86sequence/s, ETA 03:00]
              ==> Processed 7169/107236 sequences (7%) |█              | [557.06sequence/s, ETA 02:59]
              ==> Processed 7226/107236 sequences (7%) |█              | [558.10sequence/s, ETA 02:59]
              ==> Processed 7282/107236 sequences (7%) |█              | [557.62sequence/s, ETA 02:59]
              ==> Processed 7338/107236 sequences (7%) |█              | [556.22sequence/s, ETA 02:59]
              ==> Processed 7395/107236 sequences (7%) |█              | [557.44sequence/s, ETA 02:59]
              ==> Processed 7451/107236 sequences (7%) |█              | [557.47sequence/s, ETA 02:58]
              ==> Processed 7509/107236 sequences (7%) |█              | [559.22sequence/s, ETA 02:58]
              ==> Processed 7566/107236 sequences (7%) |█              | [560.10sequence/s, ETA 02:57]
              ==> Processed 7623/107236 sequences (7%) |█              | [557.98sequence/s, ETA 02:58]
              ==> Processed 7680/107236 sequences (7%) |█              | [559.01sequence/s, ETA 02:58]
              ==> Processed 7737/107236 sequences (7%) |█              | [560.05sequence/s, ETA 02:57]
              ==> Processed 7794/107236 sequences (7%) |█              | [558.58sequence/s, ETA 02:58]
              ==> Processed 7851/107236 sequences (7%) |█              | [559.27sequence/s, ETA 02:57]
              ==> Processed 7907/107236 sequences (7%) |█              | [557.88sequence/s, ETA 02:58]
              ==> Processed 7963/107236 sequences (7%) |█              | [557.60sequence/s, ETA 02:58]
              ==> Processed 8020/107236 sequences (7%) |█              | [558.70sequence/s, ETA 02:57]
              ==> Processed 8076/107236 sequences (8%) |█▏             | [558.78sequence/s, ETA 02:57]
              ==> Processed 8132/107236 sequences (8%) |█▏             | [558.59sequence/s, ETA 02:57]
              ==> Processed 8189/107236 sequences (8%) |█▏             | [559.54sequence/s, ETA 02:57]
              ==> Processed 8245/107236 sequences (8%) |█▏             | [559.52sequence/s, ETA 02:56]
              ==> Processed 8303/107236 sequences (8%) |█▏             | [560.71sequence/s, ETA 02:56]
              ==> Processed 8360/107236 sequences (8%) |█▏             | [561.52sequence/s, ETA 02:56]
              ==> Processed 8417/107236 sequences (8%) |█▏             | [559.72sequence/s, ETA 02:56]
              ==> Processed 8474/107236 sequences (8%) |█▏             | [559.84sequence/s, ETA 02:56]
              ==> Processed 8531/107236 sequences (8%) |█▏             | [560.81sequence/s, ETA 02:56]
              ==> Processed 8588/107236 sequences (8%) |█▏             | [557.82sequence/s, ETA 02:56]
              ==> Processed 8644/107236 sequences (8%) |█▏             | [556.88sequence/s, ETA 02:57]
              ==> Processed 8701/107236 sequences (8%) 
              ..
              A 00:10]
              ==> Processed 101485/107236 sequences (95%) |██████████████▏| [554.95sequence/s, ETA 00:10]
              ==> Processed 101541/107236 sequences (95%) |██████████████▏| [549.11sequence/s, ETA 00:10]
              ==> Processed 101600/107236 sequences (95%) |██████████████▏| [552.54sequence/s, ETA 00:10]
              ==> Processed 101659/107236 sequences (95%) |██████████████▏| [555.43sequence/s, ETA 00:10]
              ==> Processed 101715/107236 sequences (95%) |██████████████▏| [552.21sequence/s, ETA 00:09]
              ==> Processed 101773/107236 sequences (95%) |██████████████▏| [554.42sequence/s, ETA 00:09]
              ==> Processed 101830/107236 sequences (95%) |██████████████▏| [555.48sequence/s, ETA 00:09]
              ==> Processed 101886/107236 sequences (95%) |██████████████▎| [553.29sequence/s, ETA 00:09]
              ==> Processed 101943/107236 sequences (95%) |██████████████▎| [554.30sequence/s, ETA 00:09]
              ==> Processed 102001/107236 sequences (95%) |██████████████▎| [555.94sequence/s, ETA 00:09]
              ==> Processed 102057/107236 sequences (95%) |██████████████▎| [552.62sequence/s, ETA 00:09]
              ==> Processed 102114/107236 sequences (95%) |██████████████▎| [554.02sequence/s, ETA 00:09]
              ==> Processed 102171/107236 sequences (95%) |██████████████▎| [555.37sequence/s, ETA 00:09]
              ==> Processed 102227/107236 sequences (95%) |██████████████▎| [554.15sequence/s, ETA 00:09]
              ==> Processed 102285/107236 sequences (95%) |██████████████▎| [556.34sequence/s, ETA 00:08]
              ==> Processed 102341/107236 sequences (95%) |██████████████▎| [556.67sequence/s, ETA 00:08]
              ==> Processed 102397/107236 sequences (95%) |██████████████▎| [554.38sequence/s, ETA 00:08]
              ==> Processed 102455/107236 sequences (96%) |██████████████▎| [556.08sequence/s, ETA 00:08]
              ==> Processed 102511/107236 sequences (96%) |██████████████▎| [556.19sequence/s, ETA 00:08]
              ==> Processed 102567/107236 sequences (96%) |██████████████▎| [555.57sequence/s, ETA 00:08]
              ==> Processed 102624/107236 sequences (96%) |██████████████▎| [556.69sequence/s, ETA 00:08]
              ==> Processed 102680/107236 sequences (96%) |██████████████▎| [555.88sequence/s, ETA 00:08]
              ==> Processed 102736/107236 sequences (96%) |██████████████▎| [556.09sequence/s, ETA 00:08]
              ==> Processed 102793/107236 sequences (96%) |██████████████▍| [557.34sequence/s, ETA 00:07]
              ==> Processed 102849/107236 sequences (96%) |██████████████▍| [557.17sequence/s, ETA 00:07]
              ==> Processed 102905/107236 sequences (96%) |██████████████▍| [555.84sequence/s, ETA 00:07]
              ==> Processed 102963/107236 sequences (96%) |██████████████▍| [557.56sequence/s, ETA 00:07]
              ==> Processed 103019/107236 sequences (96%) |██████████████▍| [555.72sequence/s, ETA 00:07]
              ==> Processed 103075/107236 sequences (96%) |██████████████▍| [555.41sequence/s, ETA 00:07]
              ==> Processed 103133/107236 sequences (96%) |██████████████▍| [556.91sequence/s, ETA 00:07]
              ==> Processed 103189/107236 sequences (96%) |██████████████▍| [552.76sequence/s, ETA 00:07]
              ==> Processed 103245/107236 sequences (96%) |██████████████▍| [553.07sequence/s, ETA 00:07]
              ==> Processed 103302/107236 sequences (96%) |██████████████▍| [554.68sequence/s, ETA 00:07]
              ==> Processed 103358/107236 sequences (96%) |██████████████▍| [553.33sequence/s, ETA 00:07]
              ==> Processed 103416/107236 sequences (96%) |██████████████▍| [555.88sequence/s, ETA 00:06]
              ==> Processed 103473/107236 sequences (96%) |██████████████▍| [557.20sequence/s, ETA 00:06]
              ==> Processed 103529/107236 sequences (97%) |██████████████▍| [554.71sequence/s, ETA 00:06]
              ==> Processed 103587/107236 sequences (97%) |██████████████▍| [556.84sequence/s, ETA 00:06]
              ==> Processed 103645/107236 sequences (97%) |██████████████▍| [558.39sequence/s, ETA 00:06]
              ==> Processed 103701/107236 sequences (97%) |██████████████▌| [558.09sequence/s, ETA 00:06]
              ==> Processed 103758/107236 sequences (97%) |██████████████▌| [559.10sequence/s, ETA 00:06]
              ==> Processed 103814/107236 sequences (97%) |██████████████▌| [558.08sequence/s, ETA 00:06]
              ==> Processed 103870/107236 sequences (97%) |██████████████▌| [557.80sequence/s, ETA 00:06]
              ==> Processed 103927/107236 sequences (97%) |██████████████▌| [558.75sequence/s, ETA 00:05]
              ==> Processed 103985/107236 sequences (97%) |██████████████▌| [559.98sequence/s, ETA 00:05]
              ==> Processed 104042/107236 sequences (97%) |██████████████▌| [559.33sequence/s, ETA 00:05]
              ==> Processed 104100/107236 sequences (97%) |██████████████▌| [560.81sequence/s, ETA 00:05]
              ==> Processed 104157/107236 sequences (97%) |██████████████▌| [561.47sequence/s, ETA 00:05]
              ==> Processed 104214/107236 sequences (97%) |██████████████▌| [561.19sequence/s, ETA 00:05]
              ==> Processed 104271/107236 sequences (97%) |██████████████▌| [561.99sequence/s, ETA 00:05]
              ==> Processed 104328/107236 sequences (97%) |██████████████▌| [562.50sequence/s, ETA 00:05]
              ==> Processed 104385/107236 sequences (97%) |██████████████▌| [560.31sequence/s, ETA 00:05]
              ==> Processed 104442/107236 sequences (97%) |██████████████▌| [561.10sequence/s, ETA 00:04]
              ==> Processed 104499/107236 sequences (97%) |██████████████▌| [560.84sequence/s, ETA 00:04]
              ==> Processed 104556/107236 sequences (98%) |██████████████▋| [559.80sequence/s, ETA 00:04]
              ==> Processed 104612/107236 sequences (98%) |██████████████▋| [558.82sequence/s, ETA 00:04]
              ==> Processed 104668/107236 sequences (98%) |██████████████▋| [558.81sequence/s, ETA 00:04]
              ==> Processed 104724/107236 sequences (98%) |██████████████▋| [556.77sequence/s, ETA 00:04]
              ==> Processed 104782/107236 sequences (98%) |██████████████▋| [558.29sequence/s, ETA 00:04]
              ==> Processed 104838/107236 sequences (98%) |██████████████▋| [558.18sequence/s, ETA 00:04]
              ==> Processed 104895/107236 sequences (98%) |██████████████▋| [559.31sequence/s, ETA 00:04]
              ==> Processed 104952/107236 sequences (98%) |██████████████▋| [559.85sequence/s, ETA 00:04]
              ==> Processed 105009/107236 sequences (98%) |██████████████▋| [560.59sequence/s, ETA 00:03]
              ==> Processed 105067/107236 sequences (98%) |██████████████▋| [562.31sequence/s, ETA 00:03]
              ==> Processed 105126/107236 sequences (98%) |██████████████▋| [564.61sequence/s, ETA 00:03]
              ==> Processed 105183/107236 sequences (98%) |██████████████▋| [562.23sequence/s, ETA 00:03]
              ==> Processed 105240/107236 sequences (98%) |██████████████▋| [562.57sequence/s, ETA 00:03]
              ==> Processed 105297/107236 sequences (98%) |██████████████▋| [563.22sequence/s, ETA 00:03]
              ==> Processed 105354/107236 sequences (98%) |██████████████▋| [561.38sequence/s, ETA 00:03]
              ==> Processed 105411/107236 sequences (98%) |██████████████▋| [562.08sequence/s, ETA 00:03]
              ==> Processed 105468/107236 sequences (98%) |██████████████▊| [562.60sequence/s, ETA 00:03]
              ==> Processed 105525/107236 sequences (98%) |██████████████▊| [555.08sequence/s, ETA 00:03]
              ==> Processed 105583/107236 sequences (98%) |██████████████▊| [556.60sequence/s, ETA 00:02]
              ==> Processed 105640/107236 sequences (99%) |██████████████▊| [557.60sequence/s, ETA 00:02]
              ==> Processed 105696/107236 sequences (99%) |██████████████▊| [556.47sequence/s, ETA 00:02]
              ==> Processed 105753/107236 sequences (99%) |██████████████▊| [557.67sequence/s, ETA 00:02]
              ==> Processed 105811/107236 sequences (99%) |██████████████▊| [558.92sequence/s, ETA 00:02]
              ==> Processed 105867/107236 sequences (99%) |██████████████▊| [555.73sequence/s, ETA 00:02]
              ==> Processed 105925/107236 sequences (99%) |██████████████▊| [557.21sequence/s, ETA 00:02]
              ==> Processed 105982/107236 sequences (99%) |██████████████▊| [558.03sequence/s, ETA 00:02]
              ==> Processed 106038/107236 sequences (99%) |██████████████▊| [558.18sequence/s, ETA 00:02]
              ==> Processed 106095/107236 sequences (99%) |██████████████▊| [558.53sequence/s, ETA 00:02]
              ==> Processed 106152/107236 sequences (99%) |██████████████▊| [559.55sequence/s, ETA 00:01]
              ==> Processed 106208/107236 sequences (99%) |██████████████▊| [558.09sequence/s, ETA 00:01]
              ==> Processed 106266/107236 sequences (99%) |██████████████▊| [559.50sequence/s, ETA 00:01]
              ==> Processed 106322/107236 sequences (99%) |██████████████▊| [558.33sequence/s, ETA 00:01]
              ==> Processed 106379/107236 sequences (99%) |██████████████▉| [559.02sequence/s, ETA 00:01]
              ==> Processed 106436/107236 sequences (99%) |██████████████▉| [559.66sequence/s, ETA 00:01]
              ==> Processed 106492/107236 sequences (99%) |██████████████▉| [559.55sequence/s, ETA 00:01]
              ==> Processed 106550/107236 sequences (99%) |██████████████▉| [560.86sequence/s, ETA 00:01]
              ==> Processed 106608/107236 sequences (99%) |██████████████▉| [561.80sequence/s, ETA 00:01]
              ==> Processed 106665/107236 sequences (99%) |██████████████▉| [560.20sequence/s, ETA 00:01]
              ==> Processed 106723/107236 sequences (100%) |██████████████▉| [562.03sequence/s, ETA 00:00]
              ==> Processed 106781/107236 sequences (100%) |██████████████▉| [563.66sequence/s, ETA 00:00]
              ==> Processed 106838/107236 sequences (100%) |██████████████▉| [561.84sequence/s, ETA 00:00]
              ==> Processed 106895/107236 sequences (100%) |██████████████▉| [560.74sequence/s, ETA 00:00]
              ==> Processed 106952/107236 sequences (100%) |██████████████▉| [561.51sequence/s, ETA 00:00]
              ==> Processed 107009/107236 sequences (100%) |██████████████▉| [558.47sequence/s, ETA 00:00]
              ==> Processed 107066/107236 sequences (100%) |██████████████▉| [559.59sequence/s, ETA 00:00]
              ==> Processed 107124/107236 sequences (100%) |██████████████▉| [560.74sequence/s, ETA 00:00]
              ==> Processed 107181/107236 sequences (100%) |██████████████▉| [559.84sequence/s, ETA 00:00]
                                                                                                          
              
              ==> .
              ==> Step 1 of 9: Starting pplacer.
              ==> Running pplacer dev analysis on output_dir/align/gtdbtk.bac120.user_msa.fasta.gz....
              ==> Step 2 of 9: Pre-masking sequences.                                                 
              ==> Step 3 of 9: Determining figs.     
              ==> Step 4 of 9: Allocating memory for internal nodes.
              ==> Step 5 of 9: Caching likelihood information on reference tree.
              ==> Step 5 of 9: Caching likelihood information on reference tree (2.30/5.10 GB, 45.15%).
              ==> Step 6 of 9: Pulling exponents.                                                      
              ==> Step 7 of 9: Preparing the edges for baseball.
              ==> Step 8 of 9: Placing genomes.                 
              ==> Step 9 of 9: placing genome 1 of 1 (100.00%).
                                                               
              
              ==> .
              ==> Step 1 of 9: Starting pplacer.
              ==> Running pplacer dev analysis on output_dir/classify/intermediate_results/pplacer/tree_7/user_msa_file.fasta....
              ==> Step 2 of 9: Pre-masking sequences.                                                                            
              ==> Step 3 of 9: Determining figs.     
              ==> Step 4 of 9: Allocating memory for internal nodes.
              ==> Step 5 of 9: Caching likelihood information on reference tree.
              ==> Step 5 of 9: Caching likelihood information on reference tree (2.31/4.69 GB, 49.20%).
              ==> Step 6 of 9: Pulling exponents.                                                      
              ==> Step 7 of 9: Preparing the edges for baseball.
              ==> Step 8 of 9: Placing genomes.                 
              ==> Step 9 of 9: placing genome 1 of 1 (100.00%).
                                                               
              
              ==> Processed 0/1 genomes (0%) |               | [?genome/s, ETA ?]
                                                                                 
              
              ==> Processed 0/18 comparisons (0%) |               | [?comparison/s, ETA ?]
              ==> Processed 1/18 comparisons (6%) |▊              | [ 4.34comparison/s, ETA 00:03]
              ==> Processed 2/18 comparisons (11%) |█▋             | [ 4.75comparison/s, ETA 00:03]
              ==> Processed 3/18 comparisons (17%) |██▌            | [ 4.80comparison/s, ETA 00:03]
              ==> Processed 4/18 comparisons (22%) |███▎           | [ 4.67comparison/s, ETA 00:02]
              ==> Processed 5/18 comparisons (28%) |████▏          | [ 4.69comparison/s, ETA 00:02]
              ==> Processed 6/18 comparisons (33%) |█████          | [ 4.90comparison/s, ETA 00:02]
              ==> Processed 7/18 comparisons (39%) |█████▊         | [ 4.94comparison/s, ETA 00:02]
              ==> Processed 8/18 comparisons (44%) |██████▋        | [ 4.93comparison/s, ETA 00:02]
              ==> Processed 9/18 comparisons (50%) |███████▌       | [ 5.01comparison/s, ETA 00:01]
              ==> Processed 10/18 comparisons (56%) |████████▎      | [ 5.02comparison/s, ETA 00:01]
              ==> Processed 11/18 comparisons (61%) |█████████▏     | [ 5.16comparison/s, ETA 00:01]
              ==> Processed 12/18 comparisons (67%) |██████████     | [ 5.19comparison/s, ETA 00:01]
              ==> Processed 13/18 comparisons (72%) |██████████▊    | [ 5.26comparison/s, ETA 00:00]
              ==> Processed 14/18 comparisons (78%) |███████████▋   | [ 5.43comparison/s, ETA 00:00]
              ==> Processed 15/18 comparisons (83%) |████████████▌  | [ 5.54comparison/s, ETA 00:00]
              ==> Processed 16/18 comparisons (89%) |█████████████▎ | [ 5.61comparison/s, ETA 00:00]
              ==> Processed 17/18 comparisons (94%) |██████████████▏| [ 5.52comparison/s, ETA 00:00]
              ==> Processed 18/18 comparisons (100%) |███████████████| [ 5.67comparison/s, ETA 00:00]
                                                                                                     
              
              ==> Processed 18/18 comparisons (100%) |███████████████| [ 5.67comparison/s, ETA 00:00]
                                                                                                     

            Standard Output:

            • [2026-02-13 20:48:10] INFO: GTDB-Tk v2.5.2
              [2026-02-13 20:48:10] INFO: gtdbtk classify_wf --genome_dir input_dir --extension .fasta --out_dir output_dir --cpus 1 --min_perc_aa 10 --min_af 0.65 --skip_ani_screen
              [2026-02-13 20:48:11] INFO: Using GTDB-Tk reference data version r220: /cvmfs/data.galaxyproject.org/managed/gtdbtk_database_versioned/full_database_release_220_downloaded_2024-10-19
              [2026-02-13 20:48:11] WARNING: You are not using the reference data intended for this release: r226
              [2026-02-13 20:48:14] INFO: Identifying markers in 4 genomes with 1 threads.
              [2026-02-13 20:48:14] TASK: Running Prodigal V2.6.3 to identify genes.
              [2026-02-13 20:48:15] INFO: Completed 4 genomes in 0.81 seconds (4.94 genomes/second).
              [2026-02-13 20:48:15] TASK: Identifying TIGRFAM protein families.
              [2026-02-13 20:48:16] INFO: Completed 4 genomes in 1.13 seconds (3.54 genomes/second).
              [2026-02-13 20:48:16] TASK: Identifying Pfam protein families.
              [2026-02-13 20:48:17] INFO: Completed 4 genomes in 0.83 seconds (4.85 genomes/second).
              [2026-02-13 20:48:17] INFO: Annotations done using HMMER 3.4 (Aug 2023).
              [2026-02-13 20:48:17] TASK: Summarising identified marker genes.
              [2026-02-13 20:48:17] INFO: Completed 4 genomes in 0.00 seconds (1,589.35 genomes/second).
              [2026-02-13 20:48:17] INFO: Done.
              [2026-02-13 20:48:32] INFO: Aligning markers in 4 genomes with 1 CPUs.
              [2026-02-13 20:48:32] INFO: Processing 4 genomes identified as bacterial.
              [2026-02-13 20:54:26] INFO: Read concatenated alignment for 107,235 GTDB genomes.
              [2026-02-13 20:54:26] TASK: Generating concatenated alignment for each marker.
              [2026-02-13 20:54:27] INFO: Completed 4 genomes in 0.01 seconds (723.87 genomes/second).
              [2026-02-13 20:54:27] TASK: Aligning 22 identified markers using hmmalign 3.4 (Aug 2023).
              [2026-02-13 20:54:28] INFO: Completed 22 markers in 0.69 seconds (32.06 markers/second).
              [2026-02-13 20:54:28] TASK: Masking columns of bacterial multiple sequence alignment using canonical mask.
              [2026-02-13 20:57:41] INFO: Completed 107,236 sequences in 3.21 minutes (33,382.99 sequences/minute).
              [2026-02-13 20:57:41] INFO: Masked bacterial alignment from 41,084 to 5,035 AAs.
              [2026-02-13 20:57:41] INFO: 0 bacterial user genomes have amino acids in <10.0% of columns in filtered MSA.
              [2026-02-13 20:57:41] INFO: Creating concatenated alignment for 107,236 bacterial GTDB and user genomes.
              [2026-02-13 20:58:13] INFO: Creating concatenated alignment for 1 bacterial user genomes.
              [2026-02-13 20:58:13] INFO: Done.
              [2026-02-13 20:58:21] WARNING: pplacer requires ~55 GB of RAM to fully load the bacterial tree into memory. However, 16.37 GB was detected. This may affect pplacer performance, or fail if there is insufficient swap space.
              [2026-02-13 20:58:21] TASK: Placing 1 bacterial genomes into backbone reference tree with pplacer using 1 CPUs (be patient).
              [2026-02-13 20:58:21] INFO: pplacer version: dev
              [2026-02-13 20:58:48] INFO: Calculating RED values based on reference tree.
              [2026-02-13 20:58:48] INFO: 1 out of 1 have an class assignments. Those genomes will be reclassified.
              [2026-02-13 20:58:48] TASK: Placing 1 bacterial genomes into class-level reference tree 7 (1/1) with pplacer using 1 CPUs (be patient).
              [2026-02-13 20:59:12] INFO: Calculating RED values based on reference tree.
              [2026-02-13 20:59:13] TASK: Traversing tree to determine classification method.
              [2026-02-13 20:59:13] INFO: Completed 1 genome in 0.00 seconds (3,724.96 genomes/second).
              [2026-02-13 20:59:13] TASK: Calculating average nucleotide identity using skani (v0.3.0).
              [2026-02-13 20:59:16] INFO: Completed 18 comparisons in 3.35 seconds (5.37 comparisons/second).
              [2026-02-13 20:59:17] INFO: 1 genome(s) have been classified using skani and pplacer.
              [2026-02-13 20:59:17] WARNING: 3 of 4 genomes have a warning (see summary file).
              [2026-02-13 20:59:17] INFO: Note that Tk classification mode is insufficient for publication of new taxonomic designations. New designations should be based on one or more de novo trees, an example of which can be produced by Tk in de novo mode.
              [2026-02-13 20:59:17] INFO: Done.
              [2026-02-13 20:59:17] INFO: Removing intermediate files.
              [2026-02-13 20:59:17] INFO: Intermediate files removed.
              [2026-02-13 20:59:17] INFO: Done.
              

            Traceback:

            Job Parameters:

            • Job parameter Parameter value
              __input_ext "input"
              __workflow_invocation_uuid__ "3f427874091d11f1aadf7c1e52cc38fd"
              advanced {"force": false, "full_tree": false, "min_af": "0.65", "min_perc_aa": "10", "output_process_log": false}
              chromInfo "/tmp/tmpy3d6hil1/galaxy-dev/tool-data/shared/ucsc/chrom/?.len"
              dbkey "?"
              gtdbtk_db "full_database_release_220_downloaded_2024-10-19"
      • Step 3: Unlabelled step (cat1):

        • step_state: scheduled

        • Jobs
          • Job 1:

            • Job state is ok

            Container:

            • quay.io/biocontainers/coreutils:8.31--h14c3975_0

            Command Line:

            • cat '/tmp/tmpy3d6hil1/files/3/8/6/dataset_386e43eb-f286-4ec3-b8cc-136ac511f80a.dat' > '/tmp/tmpy3d6hil1/job_working_directory/000/6/outputs/dataset_5f3b6140-9e9a-41aa-9c17-84b7d2fc518e.dat'

            Exit Code:

            • 0

            Traceback:

            Job Parameters:

            • Job parameter Parameter value
              __input_ext "input"
              __workflow_invocation_uuid__ "3f427874091d11f1aadf7c1e52cc38fd"
              chromInfo "/tmp/tmpy3d6hil1/galaxy-dev/tool-data/shared/ucsc/chrom/?.len"
              dbkey "?"
              queries []
      • Step 4: Unlabelled step (Cut1):

        • step_state: scheduled

        • Jobs
          • Job 1:

            • Job state is ok

            Container:

            • quay.io/biocontainers/perl:5.26--2

            Command Line:

            • perl '/tmp/tmpy3d6hil1/galaxy-dev/tools/filters/cutWrapper.pl' '/tmp/tmpy3d6hil1/files/5/f/3/dataset_5f3b6140-9e9a-41aa-9c17-84b7d2fc518e.dat' 'c2' T '/tmp/tmpy3d6hil1/job_working_directory/000/7/outputs/dataset_aaba84c2-6339-46ce-8769-5c753cd3978e.dat'

            Exit Code:

            • 0

            Traceback:

            Job Parameters:

            • Job parameter Parameter value
              __input_ext "tsv"
              __workflow_invocation_uuid__ "3f427874091d11f1aadf7c1e52cc38fd"
              chromInfo "/tmp/tmpy3d6hil1/galaxy-dev/tool-data/shared/ucsc/chrom/?.len"
              columnList "c2"
              dbkey "?"
              delimiter "T"
      • Step 5: Unlabelled step (toolshed.g2.bx.psu.edu/repos/nml/collapse_collections/collapse_dataset/5.1.0):

        • step_state: scheduled

        • Jobs
          • Job 1:

            • Job state is ok

            Container:

            • quay.io/biocontainers/gawk:5.1.0--2

            Command Line:

            • (   cat "/tmp/tmpy3d6hil1/files/5/f/3/dataset_5f3b6140-9e9a-41aa-9c17-84b7d2fc518e.dat" ;   ) > /tmp/tmpy3d6hil1/job_working_directory/000/8/outputs/dataset_5a99aea1-3318-4615-98c5-560523bae4ba.dat

            Exit Code:

            • 0

            Traceback:

            Job Parameters:

            • Job parameter Parameter value
              __input_ext "input"
              __workflow_invocation_uuid__ "3f427874091d11f1aadf7c1e52cc38fd"
              chromInfo "/tmp/tmpy3d6hil1/galaxy-dev/tool-data/shared/ucsc/chrom/?.len"
              dbkey "?"
              filename {"__current_case__": 1, "add_name": false}
              one_header false
      • Step 6: Unlabelled step (toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_awk_tool/9.5+galaxy3):

        • step_state: scheduled

        • Jobs
          • Job 1:

            • Job state is ok

            Container:

            • quay.io/biocontainers/gawk:5.3.1

            Command Line:

            • env -i $(which awk) --sandbox -v FS='	' -v OFS='	' --re-interval -f '/tmp/tmpy3d6hil1/job_working_directory/000/9/configs/tmpgt6w7avy' '/tmp/tmpy3d6hil1/files/a/a/b/dataset_aaba84c2-6339-46ce-8769-5c753cd3978e.dat' > '/tmp/tmpy3d6hil1/job_working_directory/000/9/outputs/dataset_a6b4640f-476d-436f-bc5e-4d78d3d205ee.dat'

            Exit Code:

            • 0

            Traceback:

            Job Parameters:

            • Job parameter Parameter value
              __input_ext "input"
              __workflow_invocation_uuid__ "3f427874091d11f1aadf7c1e52cc38fd"
              chromInfo "/tmp/tmpy3d6hil1/galaxy-dev/tool-data/shared/ucsc/chrom/?.len"
              code "!seen[$0]++"
              dbkey "?"
              variables []
      • Step 7: Unlabelled step (toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_awk_tool/9.5+galaxy3):

        • step_state: scheduled

        • Jobs
          • Job 1:

            • Job state is ok

            Container:

            • quay.io/biocontainers/gawk:5.3.1

            Command Line:

            • env -i $(which awk) --sandbox -v FS='	' -v OFS='	' --re-interval -f '/tmp/tmpy3d6hil1/job_working_directory/000/10/configs/tmp9imcck8r' '/tmp/tmpy3d6hil1/files/5/a/9/dataset_5a99aea1-3318-4615-98c5-560523bae4ba.dat' > '/tmp/tmpy3d6hil1/job_working_directory/000/10/outputs/dataset_dd813b4e-24a2-442b-9559-49c5d19ba78f.dat'

            Exit Code:

            • 0

            Traceback:

            Job Parameters:

            • Job parameter Parameter value
              __input_ext "input"
              __workflow_invocation_uuid__ "3f427874091d11f1aadf7c1e52cc38fd"
              chromInfo "/tmp/tmpy3d6hil1/galaxy-dev/tool-data/shared/ucsc/chrom/?.len"
              code ` "NR == 0
              dbkey "?"
              variables []
      • Step 8: Unlabelled step (toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_replace_in_column/9.5+galaxy3):

        • step_state: scheduled

        • Jobs
          • Job 1:

            • Job state is ok

            Container:

            • quay.io/biocontainers/gawk:5.3.1

            Command Line:

            • awk -v OFS="\t" -v FS="\t" --re-interval --sandbox '{ $1 = gensub( /^.*;/, "", "g", $1 ) ; print $0 ; }' '/tmp/tmpy3d6hil1/files/a/6/b/dataset_a6b4640f-476d-436f-bc5e-4d78d3d205ee.dat' > '/tmp/tmpy3d6hil1/job_working_directory/000/11/outputs/dataset_6f2e3762-3e6a-4232-a685-2b9210b57d60.dat'

            Exit Code:

            • 0

            Traceback:

            Job Parameters:

            • Job parameter Parameter value
              __input_ext "input"
              __workflow_invocation_uuid__ "3f427874091d11f1aadf7c1e52cc38fd"
              chromInfo "/tmp/tmpy3d6hil1/galaxy-dev/tool-data/shared/ucsc/chrom/?.len"
              dbkey "?"
              replacements [{"__index__": 0, "column": "1", "find_pattern": "^.*;", "replace_pattern": null}]
      • Step 9: Unlabelled step (toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_replace_in_column/9.5+galaxy3):

        • step_state: scheduled

        • Jobs
          • Job 1:

            • Job state is ok

            Container:

            • quay.io/biocontainers/gawk:5.3.1

            Command Line:

            • awk -v OFS="\t" -v FS="\t" --re-interval --sandbox '{ $2 = gensub( /^.*;/, "", "g", $2 ) ; print $0 ; }' '/tmp/tmpy3d6hil1/files/d/d/8/dataset_dd813b4e-24a2-442b-9559-49c5d19ba78f.dat' > '/tmp/tmpy3d6hil1/job_working_directory/000/12/outputs/dataset_c845f2e4-bebf-4612-86e1-67a833b4205f.dat'

            Exit Code:

            • 0

            Traceback:

            Job Parameters:

            • Job parameter Parameter value
              __input_ext "input"
              __workflow_invocation_uuid__ "3f427874091d11f1aadf7c1e52cc38fd"
              chromInfo "/tmp/tmpy3d6hil1/galaxy-dev/tool-data/shared/ucsc/chrom/?.len"
              dbkey "?"
              replacements [{"__index__": 0, "column": "2", "find_pattern": "^.*;", "replace_pattern": null}]
      • Step 10: Unlabelled step (Remove beginning1):

        • step_state: scheduled

        • Jobs
          • Job 1:

            • Job state is ok

            Container:

            • quay.io/biocontainers/coreutils:8.31--h14c3975_0

            Command Line:

            • tail -n '+2' '/tmp/tmpy3d6hil1/files/6/f/2/dataset_6f2e3762-3e6a-4232-a685-2b9210b57d60.dat' > '/tmp/tmpy3d6hil1/job_working_directory/000/13/outputs/dataset_0e11182e-ce5d-4ebd-863b-814702c3299d.dat'

            Exit Code:

            • 0

            Traceback:

            Job Parameters:

            • Job parameter Parameter value
              __input_ext "input"
              __workflow_invocation_uuid__ "3f427874091d11f1aadf7c1e52cc38fd"
              chromInfo "/tmp/tmpy3d6hil1/galaxy-dev/tool-data/shared/ucsc/chrom/?.len"
              dbkey "?"
              num_lines "1"
      • Step 11: Unlabelled step (toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_awk_tool/9.5+galaxy3):

        • step_state: scheduled

        • Jobs
          • Job 1:

            • Job state is ok

            Container:

            • quay.io/biocontainers/gawk:5.3.1

            Command Line:

            • env -i $(which awk) --sandbox -v FS='	' -v OFS='	' --re-interval -f '/tmp/tmpy3d6hil1/job_working_directory/000/14/configs/tmpdkkwypd6' '/tmp/tmpy3d6hil1/files/6/f/2/dataset_6f2e3762-3e6a-4232-a685-2b9210b57d60.dat' > '/tmp/tmpy3d6hil1/job_working_directory/000/14/outputs/dataset_b47b63a3-0d1a-4e5c-8ede-56457b2ebcaf.dat'

            Exit Code:

            • 0

            Traceback:

            Job Parameters:

            • Job parameter Parameter value
              __input_ext "input"
              __workflow_invocation_uuid__ "3f427874091d11f1aadf7c1e52cc38fd"
              chromInfo "/tmp/tmpy3d6hil1/galaxy-dev/tool-data/shared/ucsc/chrom/?.len"
              code "BEGIN {\n rank[\"d__\"]=\"domain\";\n rank[\"p__\"]=\"phylum\";\n rank[\"c__\"]=\"class\";\n rank[\"o__\"]=\"order\";\n rank[\"f__\"]=\"family\";\n rank[\"g__\"]=\"genus\";\n rank[\"s__\"]=\"species\";\n}\nNR==1 {\n print $0 \"\\t\" \"rank\";\n next\n}\n{\n prefix = substr($1, 1, 3);\n\n if (prefix in rank)\n print $0 \"\\t\" rank[prefix];\n else\n print $0 \"\\t\" \"unclassified\";\n}"
              dbkey "?"
              variables []
      • Step 12: Unlabelled step (toolshed.g2.bx.psu.edu/repos/iuc/gtdb_to_taxdump/gtdb_to_taxdump/0.1.9+galaxy0):

        • step_state: scheduled

        • Jobs
          • Job 1:

            • Job state is ok

            Container:

            • quay.io/biocontainers/gtdb_to_taxdump:0.1.9--pyhcf36b3e_0

            Command Line:

            • mkdir output &&  ln -s '/cvmfs/data.galaxyproject.org/byhand/gtdbtk_database_metadata_versioned/_release_220_downloaded_2024-09-10/bac'* 'bac_meta.tsv' && ln -s '/cvmfs/data.galaxyproject.org/byhand/gtdbtk_database_metadata_versioned/_release_220_downloaded_2024-09-10/ar'* 'ar_meta.tsv' &&  ncbi-gtdb_map.py -q gtdb_taxonomy '/tmp/tmpy3d6hil1/files/0/e/1/dataset_0e11182e-ce5d-4ebd-863b-814702c3299d.dat' 'bac_meta.tsv' 'ar_meta.tsv' --completeness 50.0 --contamination 5.0 --fraction 0.9 --max-tips 100 --column 1   -o output --procs ${GALAXY_SLOTS:-8}

            Exit Code:

            • 0

            Standard Error:

            • 2026-02-13 21:00:28,778 - Loading: bac_meta.tsv
              2026-02-13 21:02:22,099 -   Entries lacking an NCBI taxonomy: 0
              2026-02-13 21:02:22,099 -   Completeness-filtered entries: 17
              2026-02-13 21:02:22,099 -   Contamination-filtered entries: 10353
              2026-02-13 21:02:22,099 -   Entries used: 574012
              2026-02-13 21:02:22,099 - Loading: ar_meta.tsv
              2026-02-13 21:02:24,527 -   Entries lacking an NCBI taxonomy: 0
              2026-02-13 21:02:24,527 -   Completeness-filtered entries: 1
              2026-02-13 21:02:24,527 -   Contamination-filtered entries: 401
              2026-02-13 21:02:24,527 -   Entries used: 12075
              2026-02-13 21:02:24,527 - Reading in queries: /tmp/tmpy3d6hil1/files/0/e/1/dataset_0e11182e-ce5d-4ebd-863b-814702c3299d.dat
              2026-02-13 21:02:24,527 - No. of queries: 2
              2026-02-13 21:02:24,527 - No. of de-rep queries: 2
              2026-02-13 21:02:24,527 - Batching queries...
              2026-02-13 21:02:24,527 -   No. of batches: 1
              2026-02-13 21:02:24,527 -   Queries per batch: 2
              2026-02-13 21:02:24,527 - Querying taxonomies...
              2026-02-13 21:02:24,529 - PID10: Finished! Queries=2, Hits=1, No-Hits=1
              2026-02-13 21:02:24,529 - File written: output/taxonomy_map_summary.tsv
              

            Traceback:

            Job Parameters:

            • Job parameter Parameter value
              __input_ext "input"
              __workflow_invocation_uuid__ "3f427874091d11f1aadf7c1e52cc38fd"
              add_prefix {"__current_case__": 1, "is_select": "n"}
              chromInfo "/tmp/tmpy3d6hil1/galaxy-dev/tool-data/shared/ucsc/chrom/?.len"
              column "1"
              completeness "50.0"
              contamination "5.0"
              data {"__current_case__": 0, "is_select": "cached", "metadata": "GTDB-Tk Database Version 220 Metadata from 09-10-2024 _release_220_downloaded_2024-09-10"}
              dbkey "?"
              fraction "0.9"
              header false
              max_tips "100"
              mode "gtdb"
              no_prefix false
      • Step 13: Unlabelled step (Cut1):

        • step_state: scheduled

        • Jobs
          • Job 1:

            • Job state is ok

            Container:

            • quay.io/biocontainers/perl:5.26--2

            Command Line:

            • perl '/tmp/tmpy3d6hil1/galaxy-dev/tools/filters/cutWrapper.pl' '/tmp/tmpy3d6hil1/files/5/f/9/dataset_5f984072-add0-4189-b30f-66072c93a94c.dat' 'c2' T '/tmp/tmpy3d6hil1/job_working_directory/000/16/outputs/dataset_c0d2b636-80af-440e-90c7-535bd47b7b3c.dat'

            Exit Code:

            • 0

            Traceback:

            Job Parameters:

            • Job parameter Parameter value
              __input_ext "tabular"
              __workflow_invocation_uuid__ "3f427874091d11f1aadf7c1e52cc38fd"
              chromInfo "/tmp/tmpy3d6hil1/galaxy-dev/tool-data/shared/ucsc/chrom/?.len"
              columnList "c2"
              dbkey "?"
              delimiter "T"
      • Step 14: Unlabelled step (toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_replace_in_column/9.5+galaxy3):

        • step_state: scheduled

        • Jobs
          • Job 1:

            • Job state is ok

            Container:

            • quay.io/biocontainers/gawk:5.3.1

            Command Line:

            • awk -v OFS="\t" -v FS="\t" --re-interval --sandbox '{ $1 = gensub( /[a-z]__/, "", "g", $1 ) ; print $0 ; }' '/tmp/tmpy3d6hil1/files/c/0/d/dataset_c0d2b636-80af-440e-90c7-535bd47b7b3c.dat' > '/tmp/tmpy3d6hil1/job_working_directory/000/17/outputs/dataset_ffd8c194-b00b-4379-ba61-e9a4d5e995b4.dat'

            Exit Code:

            • 0

            Traceback:

            Job Parameters:

            • Job parameter Parameter value
              __input_ext "input"
              __workflow_invocation_uuid__ "3f427874091d11f1aadf7c1e52cc38fd"
              chromInfo "/tmp/tmpy3d6hil1/galaxy-dev/tool-data/shared/ucsc/chrom/?.len"
              dbkey "?"
              replacements [{"__index__": 0, "column": "1", "find_pattern": "[a-z]__", "replace_pattern": null}]
      • Step 15: Unlabelled step (toolshed.g2.bx.psu.edu/repos/iuc/name2taxid/name2taxid/0.20.0+galaxy0):

        • step_state: scheduled

        • Jobs
          • Job 1:

            • Job state is ok

            Container:

            • quay.io/biocontainers/taxonkit:0.20.0--h9ee0642_1

            Command Line:

            • mkdir -p ../home/.taxonkit &&  ln -s '/cvmfs/data.galaxyproject.org/byhand/ncbi_taxonomy/2024-06-05/names.dmp' 'names.dmp' && ln -s '/cvmfs/data.galaxyproject.org/byhand/ncbi_taxonomy/2024-06-05/merged.dmp' 'merged.dmp' && ln -s '/cvmfs/data.galaxyproject.org/byhand/ncbi_taxonomy/2024-06-05/nodes.dmp' 'nodes.dmp' && ln -s '/cvmfs/data.galaxyproject.org/byhand/ncbi_taxonomy/2024-06-05/delnodes.dmp' 'delnodes.dmp' &&  taxonkit name2taxid --data-dir '.' --name-field '1'   '/tmp/tmpy3d6hil1/files/f/f/d/dataset_ffd8c194-b00b-4379-ba61-e9a4d5e995b4.dat' > '/tmp/tmpy3d6hil1/job_working_directory/000/18/outputs/dataset_0a3faace-b1a8-439d-b881-da5cd89b67b5.dat'

            Exit Code:

            • 0

            Traceback:

            Job Parameters:

            • Job parameter Parameter value
              __input_ext "input"
              __workflow_invocation_uuid__ "3f427874091d11f1aadf7c1e52cc38fd"
              chromInfo "/tmp/tmpy3d6hil1/galaxy-dev/tool-data/shared/ucsc/chrom/?.len"
              data {"__current_case__": 0, "is_select": "cached", "ncbi": "2024-06-05"}
              dbkey "?"
              name_field "1"
              sci_name false
              show_rank false
      • Step 16: Unlabelled step (toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_awk_tool/9.5+galaxy3):

        • step_state: scheduled

        • Jobs
          • Job 1:

            • Job state is ok

            Container:

            • quay.io/biocontainers/gawk:5.3.1

            Command Line:

            • env -i $(which awk) --sandbox -v FS='	' -v OFS='	' --re-interval -f '/tmp/tmpy3d6hil1/job_working_directory/000/19/configs/tmpnsl_l2wp' '/tmp/tmpy3d6hil1/files/0/a/3/dataset_0a3faace-b1a8-439d-b881-da5cd89b67b5.dat' > '/tmp/tmpy3d6hil1/job_working_directory/000/19/outputs/dataset_da059792-86df-4eb5-b9d5-20295b99eae2.dat'

            Exit Code:

            • 0

            Traceback:

            Job Parameters:

            • Job parameter Parameter value
              __input_ext "input"
              __workflow_invocation_uuid__ "3f427874091d11f1aadf7c1e52cc38fd"
              chromInfo "/tmp/tmpy3d6hil1/galaxy-dev/tool-data/shared/ucsc/chrom/?.len"
              code "NR==1{$2=\"TaxID\"} {print}"
              dbkey "?"
              variables []
      • Step 17: Unlabelled step (Paste1):

        • step_state: scheduled

        • Jobs
          • Job 1:

            • Job state is ok

            Container:

            • quay.io/biocontainers/coreutils:8.31--h14c3975_0

            Command Line:

            • paste '/tmp/tmpy3d6hil1/files/5/f/9/dataset_5f984072-add0-4189-b30f-66072c93a94c.dat' '/tmp/tmpy3d6hil1/files/d/a/0/dataset_da059792-86df-4eb5-b9d5-20295b99eae2.dat' > '/tmp/tmpy3d6hil1/job_working_directory/000/20/outputs/dataset_06758650-a787-4a4f-8690-43187d81952e.dat'

            Exit Code:

            • 0

            Traceback:

            Job Parameters:

            • Job parameter Parameter value
              __input_ext "input"
              __workflow_invocation_uuid__ "3f427874091d11f1aadf7c1e52cc38fd"
              chromInfo "/tmp/tmpy3d6hil1/galaxy-dev/tool-data/shared/ucsc/chrom/?.len"
              dbkey "?"
              delimiter "T"
      • Step 18: Unlabelled step (toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_replace_in_column/9.5+galaxy3):

        • step_state: scheduled

        • Jobs
          • Job 1:

            • Job state is ok

            Container:

            • quay.io/biocontainers/gawk:5.3.1

            Command Line:

            • awk -v OFS="\t" -v FS="\t" --re-interval --sandbox '{ $1 = gensub( /gtdb_taxonomy/, "classification", "g", $1 ) ; $1 = gensub( //, "", "g", $1 ) ; print $0 ; }' '/tmp/tmpy3d6hil1/files/0/6/7/dataset_06758650-a787-4a4f-8690-43187d81952e.dat' > '/tmp/tmpy3d6hil1/job_working_directory/000/21/outputs/dataset_dc02e7bb-8e3d-4d6b-a565-047cade6af46.dat'

            Exit Code:

            • 0

            Traceback:

            Job Parameters:

            • Job parameter Parameter value
              __input_ext "input"
              __workflow_invocation_uuid__ "3f427874091d11f1aadf7c1e52cc38fd"
              chromInfo "/tmp/tmpy3d6hil1/galaxy-dev/tool-data/shared/ucsc/chrom/?.len"
              dbkey "?"
              replacements [{"__index__": 0, "column": "1", "find_pattern": "gtdb_taxonomy", "replace_pattern": "classification"}, {"__index__": 1, "column": "1", "find_pattern": null, "replace_pattern": null}]
      • Step 19: Unlabelled step (toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_easyjoin_tool/9.5+galaxy3):

        • step_state: scheduled

        • Jobs
          • Job 1:

            • Job state is ok

            Container:

            • quay.io/biocontainers/mulled-v2-c5660771860859a51697ce13d5d74251dc4c8eb6:71ac32d991a15b36d275b8338fe9a13ee2a68f51-0

            Command Line:

            • cp '/tmp/shed_dir/toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/ab83aa685821/text_processing/sort-header' ./ && chmod +x sort-header && perl '/tmp/shed_dir/toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/ab83aa685821/text_processing/easyjoin'  -t $'\t' --header -e '0' -o auto  -1 '2' -2 '1' '/tmp/tmpy3d6hil1/files/c/8/4/dataset_c845f2e4-bebf-4612-86e1-67a833b4205f.dat' '/tmp/tmpy3d6hil1/files/d/c/0/dataset_dc02e7bb-8e3d-4d6b-a565-047cade6af46.dat' > '/tmp/tmpy3d6hil1/job_working_directory/000/22/outputs/dataset_571ba3fd-40cb-4dfc-8f6c-db644b160b1d.dat'

            Exit Code:

            • 0

            Traceback:

            Job Parameters:

            • Job parameter Parameter value
              __input_ext "input"
              __workflow_invocation_uuid__ "3f427874091d11f1aadf7c1e52cc38fd"
              chromInfo "/tmp/tmpy3d6hil1/galaxy-dev/tool-data/shared/ucsc/chrom/?.len"
              column1 "2"
              column2 "1"
              dbkey "?"
              empty_string_filler "0"
              header true
              ignore_case false
              jointype " "
      • Step 20: Unlabelled step (Cut1):

        • step_state: scheduled

        • Jobs
          • Job 1:

            • Job state is ok

            Container:

            • quay.io/biocontainers/perl:5.26--2

            Command Line:

            • perl '/tmp/tmpy3d6hil1/galaxy-dev/tools/filters/cutWrapper.pl' '/tmp/tmpy3d6hil1/files/5/7/1/dataset_571ba3fd-40cb-4dfc-8f6c-db644b160b1d.dat' 'c2,c1,c25,c26' T '/tmp/tmpy3d6hil1/job_working_directory/000/23/outputs/dataset_d26fa979-315e-40a4-a14f-c292a46a5d79.dat'

            Exit Code:

            • 0

            Traceback:

            Job Parameters:

            • Job parameter Parameter value
              __input_ext "tsv"
              __workflow_invocation_uuid__ "3f427874091d11f1aadf7c1e52cc38fd"
              chromInfo "/tmp/tmpy3d6hil1/galaxy-dev/tool-data/shared/ucsc/chrom/?.len"
              columnList "c2,c1,c25,c26"
              dbkey "?"
              delimiter "T"
      • Step 21: Unlabelled step (toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_easyjoin_tool/9.5+galaxy3):

        • step_state: scheduled

        • Jobs
          • Job 1:

            • Job state is ok

            Container:

            • quay.io/biocontainers/mulled-v2-c5660771860859a51697ce13d5d74251dc4c8eb6:71ac32d991a15b36d275b8338fe9a13ee2a68f51-0

            Command Line:

            • cp '/tmp/shed_dir/toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/ab83aa685821/text_processing/sort-header' ./ && chmod +x sort-header && perl '/tmp/shed_dir/toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/ab83aa685821/text_processing/easyjoin'  -t $'\t' --header -e '0' -o auto  -1 '1' -2 '2' '/tmp/tmpy3d6hil1/files/b/4/7/dataset_b47b63a3-0d1a-4e5c-8ede-56457b2ebcaf.dat' '/tmp/tmpy3d6hil1/files/d/2/6/dataset_d26fa979-315e-40a4-a14f-c292a46a5d79.dat' > '/tmp/tmpy3d6hil1/job_working_directory/000/24/outputs/dataset_6b18fc97-1533-4bb7-a285-e85ec7076b3a.dat'

            Exit Code:

            • 0

            Traceback:

            Job Parameters:

            • Job parameter Parameter value
              __input_ext "input"
              __workflow_invocation_uuid__ "3f427874091d11f1aadf7c1e52cc38fd"
              chromInfo "/tmp/tmpy3d6hil1/galaxy-dev/tool-data/shared/ucsc/chrom/?.len"
              column1 "1"
              column2 "2"
              dbkey "?"
              empty_string_filler "0"
              header true
              ignore_case false
              jointype " "
      • Step 22: Unlabelled step (toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_awk_tool/9.5+galaxy3):

        • step_state: scheduled

        • Jobs
          • Job 1:

            • Job state is ok

            Container:

            • quay.io/biocontainers/gawk:5.3.1

            Command Line:

            • env -i $(which awk) --sandbox -v FS='	' -v OFS='	' --re-interval -f '/tmp/tmpy3d6hil1/job_working_directory/000/25/configs/tmp4aox0fs5' '/tmp/tmpy3d6hil1/files/6/b/1/dataset_6b18fc97-1533-4bb7-a285-e85ec7076b3a.dat' > '/tmp/tmpy3d6hil1/job_working_directory/000/25/outputs/dataset_3dda887c-a740-4f1a-a45d-22f5dcee73ef.dat'

            Exit Code:

            • 0

            Traceback:

            Job Parameters:

            • Job parameter Parameter value
              __input_ext "input"
              __workflow_invocation_uuid__ "3f427874091d11f1aadf7c1e52cc38fd"
              chromInfo "/tmp/tmpy3d6hil1/galaxy-dev/tool-data/shared/ucsc/chrom/?.len"
              code "NR == 1 { $0 = \"BinID\\tRank\\tGTDB_name\\tNCBI_name\\tNCBI_TaxID\"; } { print }"
              dbkey "?"
              variables []
      • Step 23: Unlabelled step (toolshed.g2.bx.psu.edu/repos/iuc/multiqc/multiqc/1.33+galaxy0):

        • step_state: scheduled

        • Jobs
          • Job 1:

            • Job state is ok

            Container:

            • quay.io/biocontainers/multiqc:1.33--pyhdfd78af_0

            Command Line:

            • die() { echo "$@" 1>&2 ; exit 1; } &&  mkdir multiqc_WDir &&   mkdir multiqc_WDir/gtdbtk_0 &&         grep -Pq 'user_genome	classification	closest_genome_reference	closest_genome_reference_radius	closest_genome_taxonomy	closest_genome_ani' /tmp/tmpy3d6hil1/files/3/8/6/dataset_386e43eb-f286-4ec3-b8cc-136ac511f80a.dat || die "Module 'gtdbtk: 'user_genome	classification	closest_genome_reference	closest_genome_reference_radius	closest_genome_taxonomy	closest_genome_ani' not found in the file 'gtdbtk_bac120_summary'" && ln -s '/tmp/tmpy3d6hil1/files/3/8/6/dataset_386e43eb-f286-4ec3-b8cc-136ac511f80a.dat' 'multiqc_WDir/gtdbtk_0/gtdbtk_bac120_summary'  &&    mkdir multiqc_WDir/custom_content_1 && ln -s '/tmp/tmpy3d6hil1/files/3/d/d/dataset_3dda887c-a740-4f1a-a45d-22f5dcee73ef.dat' 'multiqc_WDir/custom_content_1/file_1_0' && more /tmp/tmpy3d6hil1/files/3/d/d/dataset_3dda887c-a740-4f1a-a45d-22f5dcee73ef.dat &&   multiqc multiqc_WDir --filename 'report'      --config '/tmp/tmpy3d6hil1/job_working_directory/000/26/configs/tmpnqnbn5gy'  && mkdir -p ./plots && ls -l ./report_data/ && cp ./report_data/*plot*.txt ./plots/ | true

            Exit Code:

            • 0

            Standard Error:

            • /// MultiQC 🔍 v1.33
              
                          config | Loading config settings from: /tmp/tmpy3d6hil1/job_working_directory/000/26/configs/tmpnqnbn5gy
                     file_search | Search path: /tmp/tmpy3d6hil1/job_working_directory/000/26/working/multiqc_WDir
              
                  custom_content | section_1: Found 2 samples (PlotType.TABLE)
                          gtdbtk | Found 4 reports
              
                   write_results | Data        : report_data
                   write_results | Report      : report.html
                         multiqc | MultiQC complete
              cp: cannot stat './report_data/*plot*.txt': No such file or directory
              

            Standard Output:

            • ::::::::::::::
              /tmp/tmpy3d6hil1/files/3/d/d/dataset_3dda887c-a740-4f1a-a45d-22f5dcee73ef.dat
              ::::::::::::::
              BinID	Rank	GTDB_name	NCBI_name	NCBI_TaxID
              Unclassified	unclassified	50contig_reads_binette_bin2.fasta	unclassified	12908
              Unclassified	unclassified	50contig_reads_binette_bin3.fasta	unclassified	12908
              Unclassified	unclassified	50contig_reads_binette_bin4.fasta	unclassified	12908
              s__Erysipelothrix rhusiopathiae	species	50contig_reads_binette_bin1.fasta	Erysipelothrix rhusiopathiae	1648
              total 1072
              -rw-r--r-- 1 1001 1001    789 Feb 13 21:04 gtdbtk-first-table.txt
              -rw-r--r-- 1 1001 1001   5156 Feb 13 21:04 llms-full.txt
              -rw-r--r-- 1 1001 1001  10089 Feb 13 21:04 multiqc.log
              -rw-r--r-- 1 1001 1001 913007 Feb 13 21:04 multiqc.parquet
              -rw-r--r-- 1 1001 1001    122 Feb 13 21:04 multiqc_citations.txt
              -rw-r--r-- 1 1001 1001 137823 Feb 13 21:04 multiqc_data.json
              -rw-r--r-- 1 1001 1001    340 Feb 13 21:04 multiqc_general_stats.txt
              -rw-r--r-- 1 1001 1001   1774 Feb 13 21:04 multiqc_gtdbtk.txt
              -rw-r--r-- 1 1001 1001    230 Feb 13 21:04 multiqc_section_1_table.txt
              -rw-r--r-- 1 1001 1001    167 Feb 13 21:04 multiqc_sources.txt
              

            Traceback:

            Job Parameters:

            • Job parameter Parameter value
              __input_ext "input"
              __workflow_invocation_uuid__ "3f427874091d11f1aadf7c1e52cc38fd"
              chromInfo "/tmp/tmpy3d6hil1/galaxy-dev/tool-data/shared/ucsc/chrom/?.len"
              comment ""
              dbkey "?"
              export false
              flat false
              image_content_input None
              png_plots false
              results [{"__index__": 0, "software_cond": {"__current_case__": 46, "input": {"values": [{"id": 5, "src": "hdca"}]}, "software": "gtdbtk"}}, {"__index__": 1, "software_cond": {"__current_case__": 48, "description": null, "input": {"values": [{"id": 22, "src": "hdca"}]}, "plot_type": "table", "section_name": "MGnify magtch table", "software": "custom_content", "title": null, "xlab": null, "ylab": null}}]
              title ""
    • Other invocation details
      • history_id

        • 71d9b1051d7c0983
      • history_state

        • ok
      • invocation_id

        • 71d9b1051d7c0983
      • invocation_state

        • scheduled
      • workflow_id

        • 71d9b1051d7c0983

Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds a new Galaxy workflow under workflows/microbiome/ to classify MAGs with GTDB-Tk and map the resulting taxonomy to NCBI names/taxIDs, along with IWC packaging files so it can be tested and published via Dockstore.

Changes:

  • Introduces the MAGs taxonomy annotation Galaxy workflow (.ga) including MultiQC reporting.
  • Adds a Dockstore descriptor (.dockstore.yml) and a wftest file (*-tests.yml) using Zenodo-hosted inputs.
  • Adds workflow documentation (README.md) and a versioned CHANGELOG.md.

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 12 comments.

Show a summary per file
File Description
workflows/microbiome/mags-taxonomy-annotation/MAGs-taxonomy-annotation.ga New Galaxy workflow implementing GTDB-Tk classification, GTDB→NCBI mapping, and MultiQC reporting.
workflows/microbiome/mags-taxonomy-annotation/MAGs-taxonomy-annotation-tests.yml New workflow test definition using Zenodo inputs and output assertions.
workflows/microbiome/mags-taxonomy-annotation/.dockstore.yml Dockstore entry to publish the workflow and run tests.
workflows/microbiome/mags-taxonomy-annotation/README.md User-facing workflow documentation (inputs/outputs/logic).
workflows/microbiome/mags-taxonomy-annotation/CHANGELOG.md Initial release entry for the new workflow.

@@ -0,0 +1,1002 @@
{
"a_galaxy_workflow": "true",
"annotation": "This workflow uses GTDB-Tk to classify an input sequence collection (e.g., bins from SemiBin2 or MetaBat) and maps the resulting taxonomy to NCBI taxIDs and names to reconcile differences between classification systems.\n",
Copy link

Copilot AI Feb 16, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Workflow-level annotation does not follow the repository guidance to start with "This workflow does/runs/performs …". Please rephrase the annotation to match that required format (and keep it a short description of action + output).

Suggested change
"annotation": "This workflow uses GTDB-Tk to classify an input sequence collection (e.g., bins from SemiBin2 or MetaBat) and maps the resulting taxonomy to NCBI taxIDs and names to reconcile differences between classification systems.\n",
"annotation": "This workflow performs taxonomic classification of an input sequence collection (e.g., bins from SemiBin2 or MetaBat) using GTDB-Tk and maps the resulting taxonomy to NCBI taxIDs and names to reconcile differences between classification systems.\n",

Copilot uses AI. Check for mistakes.
- /MAGs-taxonomy-annotation-tests.yml
authors:
- name: Santino Faack
orcid: 0000-0003-2982-388X
Copy link

Copilot AI Feb 16, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Author ORCID here (0000-0003-2982-388X) must match the creator metadata in the .ga workflow file. Currently the workflow creator.identifier is different, which breaks the required alignment between .dockstore.yml and workflow metadata.

Suggested change
orcid: 0000-0003-2982-388X

Copilot uses AI. Check for mistakes.
"owner": "bgruening",
"tool_shed": "toolshed.g2.bx.psu.edu"
},
"tool_state": "{\"code\": \"NR == 0 || !seen[$0]++\", \"infile\": {\"__class__\": \"ConnectedValue\"}, \"variables\": [], \"__page__\": 0, \"__rerun_remap_job_id__\": null}",
Copy link

Copilot AI Feb 16, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The AWK condition NR == 0 is never true (AWK record numbers start at 1). As written, this step is equivalent to !seen[$0]++ and will not preserve a header line if that was intended. If you meant to keep the first line, use NR == 1 (or handle headers explicitly).

Suggested change
"tool_state": "{\"code\": \"NR == 0 || !seen[$0]++\", \"infile\": {\"__class__\": \"ConnectedValue\"}, \"variables\": [], \"__page__\": 0, \"__rerun_remap_job_id__\": null}",
"tool_state": "{\"code\": \"NR == 1 || !seen[$0]++\", \"infile\": {\"__class__\": \"ConnectedValue\"}, \"variables\": [], \"__page__\": 0, \"__rerun_remap_job_id__\": null}",

Copilot uses AI. Check for mistakes.
"owner": "bgruening",
"tool_shed": "toolshed.g2.bx.psu.edu"
},
"tool_state": "{\"infile\": {\"__class__\": \"ConnectedValue\"}, \"replacements\": [{\"__index__\": 0, \"column\": \"1\", \"find_pattern\": \"gtdb_taxonomy\", \"replace_pattern\": \"classification\"}, {\"__index__\": 1, \"column\": \"1\", \"find_pattern\": null, \"replace_pattern\": null}], \"__page__\": 0, \"__rerun_remap_job_id__\": null}",
Copy link

Copilot AI Feb 16, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This Replace Text step includes an extra replacement entry where both find_pattern and replace_pattern are null. This is atypical in this repo and may cause parameter validation/runtime failures depending on the tool’s expectations. Please remove the empty replacement row so only intentional replacements are configured.

Copilot uses AI. Check for mistakes.
Comment on lines 3 to 4
This workflow create a taxonomy annotation for MAGs with GTDB-Tk. With the help of some other tools MAGs also get the classification from NCBI.

Copy link

Copilot AI Feb 16, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

README has multiple grammar issues in the intro sentence (e.g., subject/verb agreement: "This workflow create" → "This workflow creates"). Please proofread and correct for clarity.

Copilot uses AI. Check for mistakes.
Comment on lines 3 to 4
This workflow create a taxonomy annotation for MAGs with GTDB-Tk. With the help of some other tools MAGs also get the classification from NCBI.

Copy link

Copilot AI Feb 16, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There is an existing microbiome workflow that already performs GTDB-Tk-based MAG taxonomy as part of a broader MAG pipeline (workflows/microbiome/mags-building). Per workflow README guidelines, consider adding a short note explaining how this workflow differs and when to prefer it (e.g., lightweight taxonomy mapping vs end-to-end MAG generation).

Suggested change
This workflow create a taxonomy annotation for MAGs with GTDB-Tk. With the help of some other tools MAGs also get the classification from NCBI.
This workflow creates a taxonomy annotation for MAGs with GTDB-Tk. With the help of some other tools, MAGs also get the classification from NCBI.
A related workflow, `workflows/microbiome/mags-building`, already performs GTDB-Tk-based taxonomy as part of a broader end-to-end MAG generation pipeline (including assembly, binning, and quality control). In contrast, this MAGs taxonomy annotation workflow is a lightweight option that only takes precomputed MAGs as input and adds consistent GTDB and NCBI taxonomic labels. Prefer this workflow when you already have MAG bins and only need standardized taxonomy annotation; prefer the MAGs building workflow when you need to generate MAGs and assign taxonomy in a single comprehensive pipeline.

Copilot uses AI. Check for mistakes.
"creator": [
{
"class": "Person",
"identifier": "https://orcid.org/0009-0004-0382-2023",
Copy link

Copilot AI Feb 16, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The workflow creator ORCID (https://orcid.org/0009-0004-0382-2023) does not match the ORCID listed for the author in this workflow’s .dockstore.yml (0000-0003-2982-388X). Please align these so .dockstore.yml authors match the workflow creator metadata.

Suggested change
"identifier": "https://orcid.org/0009-0004-0382-2023",
"identifier": "https://orcid.org/0000-0003-2982-388X",

Copilot uses AI. Check for mistakes.
"owner": "iuc",
"tool_shed": "toolshed.g2.bx.psu.edu"
},
"tool_state": "{\"advanced\": {\"min_perc_aa\": \"10\", \"force\": false, \"min_af\": \"0.65\", \"full_tree\": false, \"output_process_log\": false}, \"gtdbtk_db\": \"full_database_release_220_downloaded_2024-10-28\", \"input\": {\"__class__\": \"ConnectedValue\"}, \"__page__\": 0, \"__rerun_remap_job_id__\": null}",
Copy link

Copilot AI Feb 16, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

gtdbtk_db is hardcoded to a specific instance/data-manager entry ("full_database_release_220_downloaded_2024-10-28"). This makes the workflow less portable across Galaxy servers. Consider exposing the GTDB-Tk database as a workflow input (as done in workflows/microbiome/mags-building/MAGs-generation.ga, where gtdbtk_db is a ConnectedValue) and wiring it into this step.

Suggested change
"tool_state": "{\"advanced\": {\"min_perc_aa\": \"10\", \"force\": false, \"min_af\": \"0.65\", \"full_tree\": false, \"output_process_log\": false}, \"gtdbtk_db\": \"full_database_release_220_downloaded_2024-10-28\", \"input\": {\"__class__\": \"ConnectedValue\"}, \"__page__\": 0, \"__rerun_remap_job_id__\": null}",
"tool_state": "{\"advanced\": {\"min_perc_aa\": \"10\", \"force\": false, \"min_af\": \"0.65\", \"full_tree\": false, \"output_process_log\": false}, \"gtdbtk_db\": {\"__class__\": \"ConnectedValue\"}, \"input\": {\"__class__\": \"ConnectedValue\"}, \"__page__\": 0, \"__rerun_remap_job_id__\": null}",

Copilot uses AI. Check for mistakes.
"owner": "iuc",
"tool_shed": "toolshed.g2.bx.psu.edu"
},
"tool_state": "{\"comment\": \"\", \"export\": false, \"flat\": false, \"image_content_input\": {\"__class__\": \"RuntimeValue\"}, \"png_plots\": false, \"results\": [{\"__index__\": 0, \"software_cond\": {\"software\": \"gtdbtk\", \"__current_case__\": 46, \"input\": {\"__class__\": \"RuntimeValue\"}}}, {\"__index__\": 1, \"software_cond\": {\"software\": \"custom_content\", \"__current_case__\": 48, \"plot_type\": \"table\", \"section_name\": \"MGnify magtch table\", \"title\": null, \"description\": null, \"xlab\": null, \"ylab\": null, \"input\": {\"__class__\": \"RuntimeValue\"}}}], \"title\": \"\", \"__page__\": 0, \"__rerun_remap_job_id__\": null}",
Copy link

Copilot AI Feb 16, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

MultiQC custom section name contains a typo: MGnify magtch tableMGnify match table (or another accurate section title).

Suggested change
"tool_state": "{\"comment\": \"\", \"export\": false, \"flat\": false, \"image_content_input\": {\"__class__\": \"RuntimeValue\"}, \"png_plots\": false, \"results\": [{\"__index__\": 0, \"software_cond\": {\"software\": \"gtdbtk\", \"__current_case__\": 46, \"input\": {\"__class__\": \"RuntimeValue\"}}}, {\"__index__\": 1, \"software_cond\": {\"software\": \"custom_content\", \"__current_case__\": 48, \"plot_type\": \"table\", \"section_name\": \"MGnify magtch table\", \"title\": null, \"description\": null, \"xlab\": null, \"ylab\": null, \"input\": {\"__class__\": \"RuntimeValue\"}}}], \"title\": \"\", \"__page__\": 0, \"__rerun_remap_job_id__\": null}",
"tool_state": "{\"comment\": \"\", \"export\": false, \"flat\": false, \"image_content_input\": {\"__class__\": \"RuntimeValue\"}, \"png_plots\": false, \"results\": [{\"__index__\": 0, \"software_cond\": {\"software\": \"gtdbtk\", \"__current_case__\": 46, \"input\": {\"__class__\": \"RuntimeValue\"}}}, {\"__index__\": 1, \"software_cond\": {\"software\": \"custom_content\", \"__current_case__\": 48, \"plot_type\": \"table\", \"section_name\": \"MGnify match table\", \"title\": null, \"description\": null, \"xlab\": null, \"ylab\": null, \"input\": {\"__class__\": \"RuntimeValue\"}}}], \"title\": \"\", \"__page__\": 0, \"__rerun_remap_job_id__\": null}",

Copilot uses AI. Check for mistakes.
Comment on lines 20 to 24
- GTDB-Tk summary files(s)
- GTDB-NCBI mapping file(s)
- NCBI name to taxID mapping file(s)
- a full table of all mappings joint together
- MultiQC HTML report with GTBD-Tk and the full mapping table as input No newline at end of file
Copy link

Copilot AI Feb 16, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Output section contains several typos (e.g., "files(s)", "joint together", and "GTBD-Tk" should be "GTDB-Tk"). Please fix these to avoid confusing users.

Suggested change
- GTDB-Tk summary files(s)
- GTDB-NCBI mapping file(s)
- NCBI name to taxID mapping file(s)
- a full table of all mappings joint together
- MultiQC HTML report with GTBD-Tk and the full mapping table as input
- GTDB-Tk summary file(s)
- GTDB-NCBI mapping file(s)
- NCBI name to taxID mapping file(s)
- a full table of all mappings joined together
- MultiQC HTML report with GTDB-Tk and the full mapping table as input

Copilot uses AI. Check for mistakes.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants