.. _cgmlst_add:
.. toctree::
:glob:
===========================
Add strains to the database
===========================
Next, you need to iteratively add your strains to the database. You
can use a draft genome (we recommend using `Spades
`_ for assembly).
You can also add a reference genome for comparison.
.. note::
You need to add each strain one by one to the database. You can
specify strain name using **-s** option.
Genome data
^^^^^^^^^^^
You can add strains using GENOME fasta(.gz) sequence file.
.. code-block:: bash
wgMLST add --help
Usage: wgMLST add [OPTIONS] DATABASE GENOME
Adds a strain GENOME to the wgMLST DATABASE.
Options:
-s, --strain TEXT Name of the strain (default:genome name)
-i, --identity FLOAT Minimum identity to search gene (default=0.95)
-c, --coverage FLOAT Minimum coverage to search gene (default=0.9)
Reads data
^^^^^^^^^^
Alternatively, you can also add strains from raw reads direcly with
single or paired FASTQS(.gz) files.
.. code-block:: bash
wgMLST add2 --help
Usage: wgMLST add2 [OPTIONS] DATABASE [FASTQS]...
Adds a strain from FASTQS(.gz) reads to the wgMLST DATABASE.
Options:
-s, --strain TEXT Name of the strain (default:genome name).
-i, --identity FLOAT Minimum identity to search gene (default=0.95).
-c, --coverage FLOAT Minimum coverage to search gene (default=0.9).
-r, --reads INTEGER Minimum reads coverage to search a gene
(default=10).
.. note::
The defaut identity and coverage treshold are set to 0.9 and can be
modulated with **-i** and **-c** options.
.. warning::
Carefully check that the allele calling has been performed
correctly for each genome. Check the number of genes found for each
strain using the :ref:`strain command `.