I’m currently doing an internship in bioinformatics(Metagenome analysis) and I am trying to make MEGAN6 run on the files which our Metagenome sequencing alignment pipeline https://github.com/MHH-RCUG/Wochenende puts out. It is a BAM file, which I converted to SAM.
I have downloaded the nucl_acc2tax and nucl_gi2tax files and created a synonym mapping file according to the instructions in the manual, to map the reference sequences to the corresponding NCBI TaxID.
For testing purposes, I created an artificial metagenome, containing 4 microorganisms and the human genome, with a read length of 75.
When I try to import the SAM File (Import from BLAST) I get no proper results and MEGAN gives me the following messages:
"WARNING: Failed to find read ‘…’ in file ‘…’ "
MEGAN can not find the reads from the pipeline output in the pipeline input .fastq containing the reads. I have checked and they are definetly present in the file. What can be the issue here?
Secondly, whether I have added the (optional) READS fastq, I am getting the following Error message:
“Initializing binning…
Using ‘Naive LCA’ algorithm (80.0 %) for binning: Taxonomy
Binning reads…
StringIndexOutOfBoundsException: String index out of range: 75”
What can be the reason for this exception? Is the read length the problem here? I have searched the forum for a similar problem, but I did not get that lucky.
Thanks in advance!