I’m new to MEGAN and I’m interested in the taxonomic analysis at the moment. I used DIAMOND to align my reads but I have a problem with daa-meganizer. Here is the error message I get:
java.lang.NullPointerException
at java.lang.System.arraycopy(Native Method)
at java.io.BufferedOutputStream.write(BufferedOutputStream.java:128)
at megan.daa.io.DAAModifier.appendBlocks(DAAModifier.java:137)
at megan.daa.io.DAAModifier.appendBlocks(DAAModifier.java:155)
at megan.daa.DAAReferencesAnnotator.apply(DAAReferencesAnnotator.java:150)
at megan.daa.Meganize.apply(Meganize.java:66)
at megan.tools.DAAMeganizer.run(DAAMeganizer.java:216)
at megan.tools.DAAMeganizer.main(DAAMeganizer.java:54)
Before using daa-meganiser, I use DIAMOND to align my reads against the NR database. Here is the command I used:
I’ve taken a look at the part of the code where meganizer throws an exception. It is not clear to me what the problem is. Could you tell me: what is the size of the DAA file? (Is it very big, i.e. more than 50GB, although that shouldn’t be a problem). Is it possible that you ran out of disk space (that could case the null pointer exception)?
When did download the file prot_acc2tax-Nov2016.abin? I recently uploaded a new version of the file because it was causing problems for some users? Unless you downloaded it within the last few days, please try re-downloading the file and then rerunning the program.
If all that fails, then please give me access to a file that exhibits the problem and I will run the program in my debugger.
D
I don’t think that you need to download the NR database and DIAMOND again. My guess is that it might be the mapping file prot_acc2tax-Nov2016.abin (because the error occurs in a part of the code that deals with the mapping file).
In theory MEGAN should be able to process SAM as generated by Bowtie, but this hasn’t been tested much (because it usually isn’t a good idea to run Bowtie on microbiome data unless the organisms in the sample are very closely related to sequenced reference organisms). If you run into any problems, please let me know.
Please note that daa-meganizer does not produce a new file, so -o is not a valid option for the program.
Unfortunately, I can’t reproduce the error that you reported; for me, daa-meganizer runs without problems on your dataset. The log file that you sent me indicates that the problem lies with parsing of the mapping file prot_acc2tax-Nov2016.abin.
Can you please double check that your downloaded version of prot_acc2tax-Nov2016.abin is not corrupted,
to do so, please run
md5sum prot_acc2tax-Nov2016.abin
and verify that you get this checksum: af338c5056e1a450ebc9d089f330ca40
If not, please re-download the file and check again. Let me know if you still get a discrepancy.