Skip to content

Commit 91a355e

Browse files
committed
adding mmseqs2
1 parent 6400ca0 commit 91a355e

File tree

1 file changed

+9
-2
lines changed

1 file changed

+9
-2
lines changed

MMSeqs2/README.md

Lines changed: 9 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -38,7 +38,6 @@ mmseqs easy-taxonomy sequence.fasta UniRef50/UniRef50 easy-taxonomy/sequence_tax
3838

3939
The results will be in a series of files in the `easy-taxonomy` directory, whose names start with `sequence_taxonomy`:
4040

41-
SAGCFN_22_00809_S34_lca.tsv.gz SAGCFN_22_00809_S34_report.gz SAGCFN_22_00809_S34_tophit_aln.gz SAGCFN_22_00809_S34_tophit_report.gz
4241

4342
- `sequence_taxonomy_lca.tsv.gz`: The lowest common ancestor of the sequences in tab separated text.
4443

@@ -53,7 +52,7 @@ Columns are:
5352
2. the taxonomy ID from [NCBI taxonomy](https://www.ncbi.nlm.nih.gov/datasets/taxonomy/tree). For example, this is [310915](https://www.ncbi.nlm.nih.gov/datasets/taxonomy/310915/)
5453
3. the taxonomic clade. `Species` in this example
5554
4. The organism name. `Pangasianodon hypophthalmus`
56-
5.
55+
5756

5857

5958
- `sequence_taxonomy_report.gz` a Kraken2 style output report
@@ -64,6 +63,14 @@ Example output:
6463
0.8561 9653 9653 species 310915 Pangasianodon hypophthalmus
6564
```
6665

66+
The columns are
67+
1. The fraction of reads that map at this clade
68+
2. The number of reads that map at the clade, or lower
69+
3. The number of reads that map at exactly this clade
70+
4. The taxonomic level
71+
5. The taxonomy ID. For example, this is [310915](https://www.ncbi.nlm.nih.gov/datasets/taxonomy/310915/)
72+
6. The taxonomy name
73+
6774
- `sequence_taxonomy_tophit_aln.gz` the `blast m8` format
6875

6976
Example output:

0 commit comments

Comments
 (0)