You can import data in CSV format.
For this, you prepare a file that either contains:
accession-string <tab> peptide-count
or
peptide-count <tab> accession-string <tab> 50
In the first case, you list accessions and how many peptides have been assigned to each accession.
In the second case, you list a peptide name (it could be the peptide sequence itself) and then the accession that the peptide
maps to. This needs to be followed by a score, such as 50, say.
Then, when importing you have to select “Taxonomy” and then click the “Parse accession ids”. The latter will drop you into a dialog that will allow you to specify a “mapping” file.
This could be one downloaded from the MEGAN webpage, or one that you setup yourself.
Here is a fake example:
GGLEQHLIAL,WP_057281773.1,50
EAILLKESGI,WP_035448405.1,50
(using commas rather than tabs…)
If you set the taxonomy accession mapping file to
prot_acc2tax-Nov2018X1.abin then MEGAN will happily import these two fake peptides and will assign one to
Bacteroides and the other to Bacteroides uniformis.
If a peptide has multiple assignments then list it multiple times consecutively with the different accessions.
Let me know if you have problems getting this to work.