In the paper we did suggest 80% as the default, but we have since seen that 51% seems to work better. However, there is a tradeoff between false positives (too specific assignment) and false negatives (too unspecific assignment)…
Re 2: did you see the following line in the message window:
Thanks for the reply! Is that recommended lcp for contigs, long reads or both? Does it matter if this is nucleotide-nucleotide alignments or nucleotide-protein? Do you have a new set of recommended parameters?
Re #2: I I just sent you an RMA file and details to your email to debug.
I took a look at your file. The RMA was generated without supplying the associated read sequences. Unfortunately, MEGAN uses those sequences to determine the length of reads and uses a length of 0, if the sequence is present. That is why all your reads pass the min coverage filter. Can you recreate the RMA files and supply the corresponding fasta or fastq files while doing so… Then the filter should work.
Re to your question along nucleotide-nucleotide alignments: we have not looked into those… DNA-to-DNA only makes sense (in my view) if you know that the organisms in your sample have already been sequenced and their DNA is available… That is usually not the case.