Hello Megan/Malt devs and community!
I’m trying to build a custom malt database to use for assigning taxonomies in a metagenomic fasta with malt version 0.6.1 in a remote linux server. My problem is that malt-run performs the alignment but can’t seem to make any taxonomic assignments.
My reference fasta is pretty small, it includes 12,420 mtDNA sequences (16,043,541 bp) with NCBI accession numbers as the headers. I ran malt-build multiple ways, using the -a2t flag with both the megan-nucl-Feb2022.db and NCBI’s nucl_gb.accession2taxid. Malt-build runs without errors but I get the same issue with malt-run whether I use the megan or ncbi file.
malt-build code with the megan db file:
malt-build --input PacificNorthWest_PalaeoFaunalReferences.fasta.gz -a2t megan-nucl-Feb2022.db --index malt_index_megandb --sequenceType DNA --verbose
I want to run a weighted LCA on my input fasta.gz file, but I have also tried the naive LCA option, with similarly un-assigned results. Here is the weighted LCA code:
malt-run --inFile SC-10c-TCMP_S207_L001.unmapped.fasta.gz --index malt_index_megandb --mode BlastN --output test_malt_megandb --alignments test_malt_megandb --maxExpected 0.00001 --weightedLCA TRUE --minSupport 3 --minPercentIdentityLCA 95 --lcaCoveragePercent 80 --numThreads 12 --verbose
I’ve run this sample with Kraken and with Bowtie2 to the refseq mitochondrion database and there is a high diversity of taxa so it’s not an issue with the sample.
The job runs without errors. The initial alignment steps seem to be working well, with 106,255 aligned queries and 2,230,669 alignments, so I suspect there isn’t an issue with my input fasta file that I’m trying to assign taxonomies to. I ran the LCA with a variety of more permissive arguments so I don’t think it’s that I’ve been too strict. However, no matter how I set it the LCA computation step assigns 0 total matches, 0 total references, and 0 total weights. When I malt-run with the naive LCA I get “Assig. Taxonomy” of 0. The rma files also show many alignments and 0 assignments. I’ve attached a text file with the run output.
I’m not seeing a similar question in the megan community forum and it’s coming up with different LCA options, so I’m guessing this must be an issue with the way I’m building the database. Can you provide any guidance on this issue?
Thanks very much for any help!
Libby
malt-run_output.txt (3.8 KB)