I am an old hand at MEGAN vers 5.9 but I am having a bit of trouble using DIAMOND 0.9.9 and MEGAN 6.8.9.
I have been testing using diamond to do blast against NCBI nr downloaded and formated for diamond. I get nice blast outputs in the default tab format with fasta or fastq, and have got the .daa output for the same files.
I use the GUI for importing blast or meganising the .daa file. The problem is I keep getting no taxonomic assignments and very few functional ones even though I am specifying the mappng files from the download page for the community version. I also made sure I used the correct kegg map file (old version). This happens when I try to use either the .daa file or a blast file… I am blasting 23 million RNA sequences with diamond. The blast will return 72670123 hits. When I import I get even less hits at 5,449,250.
Any idea why this is happening and how I can fix it to get proper assignment like I used to in MEGAN5?
here is the info from the MEGAN log:
Executing: import blastFile=’/group/soil/helen/diamond/AV160testfastq.m8’ fastaFile=’/group/soil/helen/diamond/AV160_L007_MGFiltered_2P.fastq’ meganFile=’/group/soil/helen/diamond/AV160testfastq.rma6’ useCompression=true format=BlastTab mode=BlastX maxMatches=5 minScore=50.0 maxExpected=0.01 minPercentIdentity=0.0 topPercent=10.0 minSupportPercent=0.01 minSupport=1 lcaAlgorithm=naive minPercentReadToCover=0.0 minComplexity=0.0 useIdentityFilter=false readAssignmentMode=readCount fNames=EGGNOG INTERPRO2GO KEGG SEED;
Executing: ‘import’‘blastFile’’=’’/group/soil/helen/diamond/AV160testfastq.m8’‘fastaFile’’=’’/group/soil/helen/diamond/AV160_L007_MGFiltered_2P.fastq’‘meganFile’’=’’/group/soil/helen/diamond/AV160testfastq.rma6’‘useCompression’’=’‘true’‘format’’=’‘BlastTab’‘mode’’=’‘BlastX’‘maxMatches’’=’‘5’‘minScore’’=’‘50.0’‘maxExpected’’=’‘0.01’‘minPercentIdentity’’=’‘0.0’‘topPercent’’=’‘10.0’‘minSupportPercent’’=’‘0.01’‘minSupport’’=’‘1’‘lcaAlgorithm’’=’‘naive’‘minPercentReadToCover’’=’‘0.0’‘minComplexity’’=’‘0.0’‘useIdentityFilter’’=’‘false’‘readAssignmentMode’’=’‘readCount’‘fNames’’=’‘EGGNOG’‘INTERPRO2GO’‘KEGG’‘SEED’;
Classifications: Taxonomy,SEED,EGGNOG,KEGG,INTERPRO2GO
Parsing file: /group/soil/helen/diamond/AV160testfastq.m8
Total reads: 5,449,250
Alignments: 21,222,213
Binning reads…
Using Naive LCA algorithm for binning: Taxonomy
Using Best-Hit algorithm for binning: SEED
Using Best-Hit algorithm for binning: EGGNOG
Using Best-Hit algorithm for binning: KEGG
Using Best-Hit algorithm for binning: INTERPRO2GO
Total reads: 5,449,250
With hits: 5,449,250
Alignments: 21,222,213
Assig. Taxonomy: 0
Assig. SEED: 0
Assig. EGGNOG: 0
Assig. KEGG: 0
Assig. INTERPRO2GO: 0
MinSupport set to: 544
Min-supp. changes: 0
Numb. Tax. classes: 1
Numb. SEED classes: 1
Numb. EGG. classes: 1
Numb. KEGG classes: 1
Numb. INT. classes: 1
Class. Taxonomy: 1
Class. SEED: 1
Class. EGGNOG: 1
Class. KEGG: 1
Class. INTERPRO2GO: 1
Info: Command completed (1427s): ‘import’‘blastFile’’=’’/group/soil/helen/diamond/AV160testfastq.m8’‘fastaFile’’=’’/group/soil/helen/diamond/AV160_L007_MGFi…
Induced tree has 2 of 1,601,131 nodes
Induced tree has 2 of 1,601,131 nodes