Extract reads from DAA files

Hi, I need to extract read counts for specific taxa from DAA files after running a DIAMOND analysis. I want to do this from the command line to avoid loading the files into MEGAN 6. What is the command-line instruction to extract reads for particular taxa?

Hi @vinoth

You can use tools/daa2info to extract classification counts. The general command is:

tools/daa2info -i file.daa -o output.txt -c2c Taxonomy -n true -p true
  • -n specifies that you need names instead of taxa IDs.
  • -p indicates you need the whole path.

You can also use -mro to include major ranks only, and --sum to get the summary count instead of the assigned count, depending on your requirements.

Regards,
Anupam

Hi Anupam,
Thanks for sharing the information

1 Like