> For the complete documentation index, see [llms.txt](https://help.idm.illumina.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://help.idm.illumina.com/dragen-targeted-microbial/dragen-targeted-microbial.md).

# DRAGEN Targeted Microbial App Documentation

![Picture1](https://github.com/illumina-swi/indigo-docs/assets/146757649/19fbcf42-e0aa-4601-a96c-e91a25f4a482)

<mark style="color:red;">**NOTE: This application will be obsoleted on May 31, 2025. If using an Illumina Infectious Disease/Micro Enrichment panel workflow, use DRAGEN Microbial Enrichment Plus. If using IMAP, IMAP-FLU or COVID-seq kit please start using the DRAGEN Microbial Amplicon app instead.**</mark>

## ▶️ DRAGEN Targeted Microbial App Documentation

### <img src="/files/0wE64kKXLIrgcGI2kZlQ" alt="" data-size="original"> Overview

#### Summary <a href="#summary" id="summary"></a>

DRAGEN Targeted Microbial is a software application designed to analyze sequencing data from enrichment and amplicon library preps (both DNA and RNA) on microbiological samples, with an emphasis on viruses. Illumina sequencing reads are processed to remove human-origin sequence, then assembled into consensus sequences that represent a best estimate of the population of viral sequences in each sample. Where appropriate, these consensus sequences are further analyzed by the phylogenetic analysis tools NextClade and/or Pangolin to provide an identification of the clade or lineage of each sequence.

#### Inputs <a href="#inputs" id="inputs"></a>

* Samples / biosamples with FASTQ datasets (see details in library preparation documents)
* A project containing one or more samples / biosamples with FASTQ datasets
  * all samples / biosamples in the selected project will be analyzed

**Supported hybrid-capture enrichment panels**

* [Viral Surveillance Panel (VSP)](https://www.illumina.com/products/by-type/sequencing-kits/library-prep-kits/viral-surveillance-panel.html)
* [Pan-Coronavirus Panel (Pan-Cov)](https://www.illumina.com/products/by-type/sequencing-kits/library-prep-kits/pan-coronavirus-panel.html)
* [Respiratory Virus Oligo Panel (RVOP)](https://www.illumina.com/products/by-type/sequencing-kits/library-prep-kits/respiratory-virus-oligo-panel.html)

**Supported amplicon primer schemes**

* Chikungunya ([Grubaugh lab](https://doi.org/10.1038/nprot.2017.066); [Illumina](https://www.illumina.com/content/dam/illumina/gcs/assembled-assets/marketing-literature/illumina-microbial-amplicon-prep-app-note-m-gl-02215/ilmn-microbial-amplicon-prep-app-note-m-gl-02215.pdf))
* Dengue Serotype 1 ([Grubaugh Lab](https://github.com/grubaughlab/DENV-genomics); [Illumina](https://www.illumina.com/content/dam/illumina/gcs/assembled-assets/marketing-literature/illumina-microbial-amplicon-prep-app-note-m-gl-02215/ilmn-microbial-amplicon-prep-app-note-m-gl-02215.pdf))
* Influenza [A](https://doi.org/10.1007/978-1-61779-621-0_11)/[B](https://doi.org/10.1128/jcm.03265-13) (Zhou et al)
* MPXV ([Grubaugh Lab](https://dx.doi.org/10.17504/protocols.io.5qpvob1nbl4o/v1))
* Respiratory Syncytial Virus (RSV) ([WCCRRI](https://doi.org/10.1016/j.jcv.2023.105423); [CDC](https://doi.org/10.1016%2Fj.jviromet.2019.113676))
* SARS-CoV-2 ([ARTIC v3, v4, v4.1, v5.3.2](https://github.com/artic-network/artic-ncov2019/tree/master/primer_schemes/nCoV-2019))
* Zika ([Grubaugh lab](https://doi.org/10.1038/nprot.2017.066))

**Custom genomes and panels**

Supports uploading FASTA files to use as reference genomes for both enrichment and amplicon panels, as well as custom primer definitions for amplicon panels. Multiplexed amplicon panels targeting multiple organisms in the same reaction are supported.

#### Analysis Pipeline <a href="#analysis-pipeline" id="analysis-pipeline"></a>

1. Reads are trimmed and filtered using [Trimmomatic](http://www.usadellab.org/cms/?page=trimmomatic) with the following parameters: `LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:36`.
2. Human reads are removed with a modified version of the [SRA Human Read Scrubber tool](https://github.com/ncbi/sra-human-scrubber).
3. [MEGAHIT](https://github.com/voutcn/megahit) is used to perform de novo assembly on the scrubbed reads.
4. [CD-HIT-EST](http://weizhong-cluster.ucsd.edu/cd-hit/) is used to cluster similar contigs to reduce redundancy.
5. The resulting contigs are mapped to a set of reference genomes using [minimap2](https://github.com/lh3/minimap2).
6. The best matching reference for each contig is selected for short read mapping.
7. The scrubbed reads from step 2 are aligned to the selected reference genomes using DRAGEN v4.2.4
8. Sequence variants are called from the alignments using DRAGEN Somatic Small Variant Caller v4.2.4 and applied to the corresponding reference sequences to create consensus sequences.
9. If applicable, Pangolin and/or Nextclade are run on the consensus sequences.

#### Outputs <a href="#outputs" id="outputs"></a>

The software generates consensus sequences representing a best estimate of the population of targeted sequences in the sample. NextClade and Pangolin analysis are run on select organisms. See this page for details:

{% content-ref url="/pages/e5FXLYRolP5ftmugbgq1" %}
[Output files](/dragen-targeted-microbial/dragen-targeted-microbial/output-files.md)
{% endcontent-ref %}

### Important Notes <a href="#important-notes" id="important-notes"></a>

* The sequences are labeled according to the best match in the panel references. These references are not exhaustive and the labels should not be taken as definitive for strain-typing. If strain typing is needed, the built-in NextClade and/or Pangolin tools can be used for supported organisms. Alternatively, a BLAST or similar search of nucleotide databases may provide a more detailed match.
* Because of sequence homology, it is possible that organisms with very few reads will result in the generation of a sequence not present (false positive). Although the de novo assembly step of this software largely mitigates such instances, sequences with very low horizontal coverage (< 5%) should be treated with caution and are highlighted as "low confidence" in the reports.

#### Currently supported platforms

* BaseSpace Sequence Hub (native BaseSpace app)


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://help.idm.illumina.com/dragen-targeted-microbial/dragen-targeted-microbial.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
