-
Notifications
You must be signed in to change notification settings - Fork 93
Description
Dear prof. Ruan
I assembled a plant genome (~600m ~1.94% heterozygosity) based on ~400G Pacbio Sequel II CCS data with the followed line:
wtdbg2 -t 0 -x ccs -g 600m -i ccs23.fastq.gz -o beichai34 -e 2
The kmer distribution was like this:
|
|
|
|
|
|
|
|
|
||
||
||
||
||
|||
|||
||||
|||||
|||||||
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
********************** 1 - 201 **********************
Quatiles:
10% 20% 30% 40% 50% 60% 70% 80% 90% 95%
1 2 4 6 11 20 55 269 1779 9742
** PROC_STAT(0) **: real 2439.237 sec, user 6326.840 sec, sys 773.720 sec, maxrss 95177552.0 kB, maxvsize 130789572.0 kB
[Wed Apr 19 12:10:56 2023] - high frequency kmer depth is set to 13776
[Wed Apr 19 12:10:56 2023] - Total kmers = 728629161
[Wed Apr 19 12:10:56 2023] - average kmer depth = 7
[Wed Apr 19 12:10:56 2023] - 368836231 low frequency kmers (<2)
[Wed Apr 19 12:10:56 2023] - 4011 high frequency kmers (>13776)
Finally obtained only 4 contigs TOT 54784.
How to adjust the parameter to get a reliable output in this case ? thanks alot.