Supported Applications

Search software:
Search software:
Filter by keywords:
- available keywords
  - All
  - High-Throughput Sequencing
  - Genomics
  - Proteomics
  - Visualization
  - Other
  - Alternative Splicing
  - Association Mapping
  - ATAC-Seq
  - Bioconductor Packages
  - Bioimaging
  - Bioinformatics
  - Bioinformatics Infrastructure
  - bisulfite-Seq
  - Cell Tracking
  - ChIP-Sequencing
  - CLIP-Seq Analysis
  - Comparative Genomics
  - Complex Trait Prediction
  - Computational Chemistry
  - CRISPR/Cas9 Screen Analysis
  - De Novo Sequencing Analysis
  - De Novo Transcriptome Assembly
  - DNA Sequence Data Compression
  - DNA-Sequencing
  - Electron Microscopy
  - Epigenomics
  - Figure Creation
  - Genome Annotation
  - Genome Assembly
  - Genome Visualization
  - Genomics
  - Genotype-Phenotype Analysis
  - Germline SNP Detection
  - GWAS Analysis
  - Hi-C
  - HiChIP
  - High Performance Computing
  - High-throughput sequencing
  - High-Tpeak Calling
  - Homology-Based Taxonomic Classification
  - Image-Analysis Libraries
  - Machine Learning
  - Metabolic Network Analysis
  - Metagenomic Sequencing Analysis
  - Motif Comparison
  - Motif Discovery
  - MRI Analysis
  - Multiple Nucleotide Sequence Alignment
  - Multiple Structure Alignment
  - Nanopore
  - Neuroimaging
  - Normalization/Differential Expression
  - Nucleic Acids
  - Nucleotide Sequence Homology Search
  - Other
  - PacBio Sequencing
  - PCR
  - Phylogenetic Inference
  - Phylogenomics
  - Pipelines
  - PLAC-seq
  - Programming Tools
  - Protein Database Search
  - Protein-Ligand Docking
  - Protein-Protein Interaction Prediction
  - Protein-protein sequence alignment
  - Protein Structure Analysis
  - Proteomics
  - Python Module
  - quantitative trait loci (QTLs) mapping/discovery
  - RADSeq
  - Read Alignment
  - Read Quality Control
  - RNA-Seq Analysis
  - RNA-Sequencing
  - scDNA-Seq Analysis
  - scRNA-Seq Analysis
  - Sequence Alignment Analysis
  - Sequence Alignment Visualization
  - Sequence Logo Generation
  - Single-Cell Assemblers
  - Spliced Read Alignment
  - Statistical Analysis
  - Structural Biology
  - Structural Variant Analysis
  - Structure Visualization & Analysis
  - Target Gene Detection
  - taxonomy
  - Tertiary Structure Prediction
  - Transcriptomics
  - Transcript Quantification
  - Variant Aggregation/Summarization
  - Variant Analysis
  - Virus Sequence Detection
  - Visualization
  - WGS Analysis
  - Workflow Management System
Filter by OS:
- available OS
  - macOS
  - Linux
Filter by member or license type:
- available types
  - Member type
  - Academic
  - Beamline
  - Government
  - Industry
  - Non-profit
  - All
  - License type
  - Commercial
  - Open
  - Registration required
  - All

AppCiter will help you create a bibliography of the programs you wish to cite.

AppCiter Programs:

No programs selected

Clear All

Continue to Step 2

Results:

Name	Description	Links
10xbamtofastq	tool for converting 10x BAMs produced by Cell Ranger, Space Ranger, Cell Ranger ATAC, Cell Ranger DNA, and Long Ranger back to FASTQ files that can be used as inputs to re-run analysis. Keywords: High-Throughput Sequencing	Visit Website»
A5 David Coil, Aaron E Darling, Guillaume Jospin	A5-miseq is a pipeline for assembling DNA sequence data generated on the Illumina sequencing platform. A5-miseq can produce high-quality microbial genome assemblies on a laptop computer without any parameter tuning by automating the process of adapter trimming, quality filtering, error correction, contig and scaffold generation and detection of misassemblies. Keywords: Genome Assembly High-throughput sequencing Metagenomic Sequencing Analysis Genomics	Visit Website» Web Forum»
abeona	a simple transcriptome assembler based on kallisto and Cortex graphs. Keywords: Transcriptomics High-Throughput Sequencing	Visit Website»
abismal	abismal is a fast and memory-efficient mapper for short bisulfite sequencing reads Keywords: bisulfite-Seq High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
abPOA	an extended version of Partial Order Alignment (POA) that performs adaptive banded dynamic programming (DP) with an SIMD implementation. Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website»
ABRicate	mass screening of contigs for antibiotic resistance genes. Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website»
AbundanceBin Yuzhen Ye, Yu-Wei Wu	an abundance-based tool for binning metagenomic sequences. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation»
AFNI	(Analysis of Functional NeuroImages) is a set of C programs for processing, analyzing, and displaying functional MRI (FMRI) data - a technique for mapping human brain activity. Keywords: Image-Analysis Libraries Neuroimaging Visualization	Visit Website» Documentation» Web Forum»
AGAT	(Another Gff Analysis Toolkit) a suite of tools to handle gene annotations in any GTF/GFF format. Keywords: High-throughput sequencing	Visit Website»
Assembled Genomes Compressor	Assembled Genomes Compressor (AGC) is a tool designed to compress collections of de-novo assembled genomes. It can be used for various types of datasets: short genomes (viruses) as well as long (humans). Keywords: Genome Assembly High-Throughput Sequencing	Visit Website»
AGFusion	a python package for annotating gene fusions from the human or mouse genomes. Keywords: Genome Annotation High-throughput sequencing	Visit Website»
AKT Rudy Arthur, Jared O’Connell	(Ancestry and Kinship Toolkit) a statistical genetics tool for analysing large cohorts of whole-genome sequenced samples. It provides a handful of useful statistical genetics routines using the htslib API for input/output. This means it can seamlessly read BCF/VCF files and play nicely with bcftools. Keywords: Genomics Statistical Analysis Genomics High-Throughput Sequencing	Visit Website» Documentation»
alevin‑fry	is a tool for the efficient processing of single-cell data based on RAD files produced by alevin. Keywords: scRNA-Seq Analysis High-Throughput Sequencing	Visit Website»
Alfred	an efficient and versatile command-line application that computes multi-sample quality control metrics in a read-group aware manner. Keywords: High-throughput sequencing Read Quality Control	Visit Website» Documentation»
AlignStats	AlignStats produces various alignment, whole genome coverage, and capture coverage metrics for sequence alignment files in SAM, BAM, and CRAM format. Keywords: High-throughput sequencing	Visit Website»
allo	Multi-mapped read rescue strategy for gene regulatory analyses Keywords: High-throughput sequencing	Visit Website»
AlphaFold Tom Ward, Augustin Zidek, Saran Tunyasuvunakool, John Jumper, Demis Hassabis	an implementation of the inference pipeline of AlphaFold using a completely new model that was entered in CASP14. Keywords: Machine Learning Protein-Protein Interaction Prediction Protein Structure Analysis Visualization	Visit Website» Documentation» Web Forum» Webinars
AlphaPept	a modern and open framework for MS-based proteomics. Keywords: Proteomics Proteomics	Visit Website»
AmberTools David Case, Thomas E Cheatham III, Kenneth M Merz Jr	a suite of programs that allows users to carry out molecular dynamics simulations, particularly on biomolecules. The suite can be used to carry out complete (non-periodic) molecular dynamics simulations (using NAB) with either explicit water or generalized Born solvent models. The independently developed packages work well by themselves, and with Amber itself. Keywords: Computational Chemistry Other	Visit Website» Documentation» Web Forum» Mailing List»
AMPS Geoff Barton	(Alignment of Multiple Protein Sequences) a suite of programs for protein multiple sequence alignment, pairwise alignment, statistical analysis and flexible pattern matching. Keywords: Comparative Genomics Genomics Sequence Alignment Visualization Proteomics	Visit Website» Documentation»
AMPtk	AMPtk: Amplicon tool kit for processing high throughput amplicon sequencing data. Keywords: High-throughput sequencing	Visit Website»
AnchorWave	AnchorWave (Anchored Wavefront Alignment) identifies collinear regions via conserved anchors (full-length CDS and full-length exon have been implemented currently) and breaks collinear regions into shorter fragments, i.e., anchor and inter-anchor intervals. Keywords: Genome Annotation Genomics High-throughput sequencing	Visit Website»
andi	estimates the evolutionary distance between closely related genomes. These distances can be used to rapidly infer phylogenies for big sets of genomes. Because andi does not compute full alignments, it is so efficient that it scales even up to thousands of bacterial genomes. Keywords: Genomics Genomics	Visit Website» Documentation»
antiSMASH	antiSMASH (antibiotics and Secondary Metabolite Analysis SHell) allows the rapid genome-wide identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genomes. It integrates and cross-links with a large number of in silico secondary metabolite analysis tools that have been published earlier. Keywords: Metabolic Network Analysis Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation» Mailing List»
ANTs	(Advanced Normalization Tools) extracts information from complex datasets that include imaging (Word Cloud). Paired with ANTsR (answer), ANTs is useful for managing, interpreting and visualizing multidimensional data. ANTs is popularly considered a state-of-the-art medical image registration and segmentation toolkit. ANTsR is an emerging tool supporting standardized multimodality image analysis. ANTs depends on the Insight ToolKit (ITK), a widely used medical image processing library to … Keywords: Image-Analysis Libraries Other	Visit Website» Documentation» Web Forum»
anvi'o A. Murat Eren	an open-source, community-driven analysis and visualization platform for ‘omics data. Its interactive interface facilitates the management of metagenomic contigs and associated data for automatic or human-guided identification of genome bins and their curation. Keywords: High-throughput sequencing Metagenomic Sequencing Analysis	Visit Website» Documentation» Web Forum»
ARAGORN	ARAGORN identifies tRNA and tmRNA genes. The program employs heuristic algorithms to predict tRNA secondary structure, based on homology with recognized tRNA consensus sequences and ability to form a base‐paired cloverleaf. Keywords: RNA-Seq Analysis High-Throughput Sequencing	Visit Website»
arcasHLA	high-resolution HLA typing from RNA seq. Keywords: RNA-Seq Analysis High-Throughput Sequencing	Visit Website»
arcs	Scaffolding genome sequence assemblies using linked or long reads. Keywords: Genome Assembly High-Throughput Sequencing	Visit Website»
ARIBA	(Antibiotic Resistance Identification By Assembly) a tool that identifies antibiotic resistance genes by running local assemblies. It can also be used for MLST calling. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
ASCIIGenome	a command-line genome browser running from terminal window and solely based on ASCII characters. Keywords: Genome Visualization Genomics	Visit Website» Documentation» Web Forum»
assembly‑stats	Get assembly statistics from FASTA and FASTQ files. Keywords: Genome Assembly High-throughput sequencing	Visit Website» Documentation»
atropos	trim adapters from high-throughput sequencing reads. Keywords: High-throughput sequencing	Visit Website» Documentation»
AUGUSTUS	a gene prediction program for eukaryotes that can be used as an ab initio program, which means it bases its prediction purely on the sequence. Keywords: Genome Annotation Genomics	Visit Website»
AWS CLI	(Amazon Web Services Command Line Interface) a command line interface tool to manage multiple Amazon Web Services and automate them through scripts. Keywords: Other High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
Bakta	rapid and standardized annotation of bacterial genomes & plasmids. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation»
Balrog	A universal protein model for prokaryotic gene prediction Keywords: Genome Annotation Genomics High-Throughput Sequencing	Visit Website»
bam2fastx	bam2fastx provides conversion of PacBio BAM files into gzipped fasta and fastq files, including splitting of barcoded data. Keywords: High-throughput sequencing Other	Visit Website»
bam‑readcount	bam-readcount generates metrics at single nucleotide positions. Keywords: High-throughput sequencing	Visit Website»
BAMscale	BAMscale is a one-step tool for either 1) quantifying and normalizing the coverage of peaks or 2) generated scaled BigWig files for easy visualization of commonly used DNA-seq capture based methods. Keywords: High-throughput sequencing	Visit Website» Documentation»
BamToCov	Extract coverage information from BAM files, supporting stranded and physical coverage and streams. Keywords: High-throughput sequencing	Visit Website»
bamtofastq	Tool for converting 10x BAMs produced by Cell Ranger Keywords: High-throughput sequencing	Visit Website»
bamtools Derek Barnett, Erik Garrison, Gabor T Marth	a fast, flexible C++ API & toolkit for reading, writing, and manipulating BAM files. Keywords: ChIP-Sequencing High-throughput sequencing WGS Analysis	Visit Website» Documentation» Web Forum» Mailing List»
bamUtil Mary Kate Wing	a repository that contains several programs that perform operations on SAM/BAM files. All of these programs are built into a single executable, bam. Keywords: High-throughput sequencing	Visit Website» Web Forum»
Barrnap	(BAsic Rapid Ribosomal RNA Predictor) predicts the location of ribosomal RNA genes in genomes (bacteria, archaea, metazoan mitochondria and eukaryotes). Keywords: Genomics Genomics	Visit Website»
bazam	is a tool to extract paired reads in FASTQ format from coordinate sorted BAM files. Bazam is a smarter way to realign reads from one genome to another. If you've tried to use Picard SAMtoFASTQ or samtools bam2fq before and ended up unsatisfied with complicated, long running inefficient pipelines, bazam might be what you wanted. Bazam will output FASTQ in a form that can … Keywords: High-throughput sequencing	Visit Website»
BBTools Brian Bushnell, JGI BBTools Team	a suite of fast, multithreaded bioinformatics tools designed for analysis of DNA and RNA sequence data. BBTools can handle common sequencing file formats such as fastq, fasta, sam, scarf, fasta+qual, compressed or raw, with autodetection of quality encoding and interleaving. Keywords: Genomics High-throughput sequencing	Visit Website» Documentation» Web Forum»
bcalm	is a bioinformatics tool for constructing the compacted de Bruijn graph from sequencing data. Keywords: High-throughput sequencing	Visit Website»
bcbio‑nextgen Brad Chapman	provides best-practice pipelines for automated analysis of high throughput sequencing data with the goal of being quantifiable, analyzable, scalable and reproducible. The development process is fully open and sustained by contributors from multiple institutions. Bioinformaticians, biologists and the general public should be able to run these tools on inputs ranging from research materials to clinical samples to personal genomes. Keywords: Genomics High-throughput sequencing RNA-Sequencing	Visit Website» Documentation» Web Forum»
bcbio‑prioritize	Prioritize small variants, structural variants and coverage based on biological inputs. The goal is to use pre-existing knowledge of relevant genes, domains and pathways involved with a disease to extract the most interesting signal from a set of high quality small or structural variant calls. Given information on coverage, it will be able to identify poorly covered regions in potential genes of interest. Keywords: Genomics Variant Analysis Genomics	Visit Website»
bcbio‑variation	bcbio-variation is a toolkit to analyze genome variation data, built on top of the Genome Analysis Toolkit (GATK) with Clojure. It supports scoring for the Archon Genomics X PRIZE competition and is also a general framework for variant file comparison. It enables validation of variants and exploration of algorithm differences between calling methods by automating the process involved with comparing two sets of variants. … Keywords: Genomics Variant Analysis Genomics	Visit Website»
bcbio‑variation‑recall	Parallel merging, squaring off and ensemble calling for genomic variants. Provide a general framework meant to combine multiple variant calls, either from single individuals, batched family calls, or multiple approaches on the same sample. Splits inputs based on shared genomic regions without variants, allowing independent processing of smaller regions with variant calls. Keywords: Variant Analysis Genomics	Visit Website»
BCFtools Heng Li, John Marshall, Petr Danecek, Shane McCarthy	a set of utilities that manipulate variant calls in the Variant Call Format (VCF) and its binary counterpart BCF. All commands work transparently with both VCFs and BCFs, both uncompressed and BGZF-compressed. Keywords: High-throughput sequencing WGS Analysis	Visit Website» Documentation» Web Forum» Mailing List»
BEAGLE	is a software package for phasing genotypes and imputing ungenotyped markers. Keywords: Genotype-Phenotype Analysis Genomics	Visit Website»
BEAST	is a cross-platform program for Bayesian analysis of molecular sequences using MCMC. Keywords: Sequence Alignment Analysis Genomics	Visit Website»
BEDOPS	BEDOPS is an open-source command-line toolkit that performs highly efficient and scalable Boolean and other set operations, statistical calculations, archiving, conversion and other management of genomic data of arbitrary scale. Tasks can be easily split by chromosome for distributing whole-genome analyses across a computational cluster. Keywords: Genomics High-throughput sequencing Genomics	Visit Website» Documentation» Web Forum» Mailing List»
bedtools Aaron R Quinlan	a swiss-army knife of tools for a wide-range of genomics analysis tasks. The most widely-used tools enable genome arithmetic. Bedtools allows one to intersect, merge, count, complement, and shuffle genomic intervals from multiple files in widely-used genomic file formats such as BAM, BED, GFF, VCF. While each individual tool is designed to do a relatively simple task (e.g., intersect two interval files), sophisticated analyses … Keywords: High-Throughput Sequencing Genomics	Visit Website» Documentation» Web Forum» Mailing List»
BETA Clifford A Meyer, X Shirley Liu, Yong Zhang	(Binding and Expression Target Analysis) a software package that integrates ChIP-seq of transcription factors or chromatin regulators with differential gene expression data to infer direct target genes. Keywords: ChIP-Sequencing High-throughput sequencing Target Gene Detection	Visit Website» Documentation» Web Forum» Mailing List»
bfc	a standalone high-performance tool for correcting sequencing errors from Illumina sequencing data. Keywords: High-throughput sequencing	Visit Website»
BGT	is a compact file format for efficiently storing and querying whole-genome genotypes of tens to hundreds of thousands of samples. It can be considered as an alternative to genotype-only BCFv2. BGT is more compact in size, more efficient to process, and more flexible on query. Keywords: Genotype-Phenotype Analysis Variant Analysis High-Throughput Sequencing	Visit Website»
bids‑validator	Brain Imaging Data Structure (BIDS) validator. Keywords: MRI Analysis Visualization	Visit Website»
BIGpre Tongwu Zhang	A quality assessment package for next-genomics sequencing data. BIGpre contains all the functions of other quality assessment software, such as the correlation between forward and reverse reads, read GC-content distribution, and base Ns quality. More importantly, BIGpre incorporates associated programs to detect and remove duplicate reads after taking sequencing errors into account and trimming low quality reads from raw data as well. Keywords: Read Quality Control High-Throughput Sequencing	Visit Website»
bioawk Aaron R Quinlan, Heng Li	an extension to Brian Kernighan's awk, with added support for several common biological data formats, including optionally gzip'ed BED, GFF, SAM, VCF, FASTA/Q, and TAB-delimited formats with column names along with new built-in functions and a command line option to use TAB as the input/output delimiter. When the new functionality is not used, bioawk should behave exactly like the original BWK awk. Keywords: High-Throughput Sequencing Genomics	Visit Website» Documentation» Web Forum»
biobambam2 Andrew Whitwham, David K Jackson, German Tischler	tools for early stage NGS alignment file processing including fast sorting and duplicate marking. Keywords: High-throughput sequencing	Visit Website»
Bioconductor	tools to analyze and comprehend high-throughput genomic data. Keywords: GWAS Analysis High-throughput sequencing Other	Visit Website» Documentation» Web Forum»
BioGrids Installer	Installation Client for the BioGrids software collection. Keywords: Other	Visit Website»
BioHansel	subtype microbial whole-genome sequencing (WGS) data using SNV targeting k-mer subtyping schemes. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation»
bioinfokit	The bioinfokit toolkit aims to provide various easy-to-use functionalities to analyze, visualize, and interpret the biological data generated from genome-scale omics experiments. Keywords: High-Throughput Sequencing	Visit Website» Documentation»
BioPhi	an open-source antibody design platform. It features methods for automated antibody humanization (Sapiens), humanness evaluation (OASis) and an interface for computer-assisted antibody sequence design.	Visit Website»
BISCUIT	a utility for analyzing sodium bisulfite conversion-based DNA methylation/modification data. It was written to perform alignment, DNA methylation and mutation calling, and allele specific methylation from bisulfite sequencing data. Keywords: DNA-Sequencing High-throughput sequencing	Visit Website» Documentation» Web Forum»
Bismark Felix Krueger	a set of tools for the time-efficient analysis of Bisulfite-Seq (BS-Seq) data. Bismark performs alignments of bisulfite-treated reads to a reference genome and cytosine methylation calls at the same time. Keywords: bisulfite-Seq Genomics	Visit Website» Documentation» Web Forum»
BLASR	(Basic Local Alignment with Successive Refinement) maps Single Molecule Sequencing (SMS) reads that are thousands of bases long, with divergence between the read and genome dominated by insertion and deletion error. Keywords: High-throughput sequencing PacBio Sequencing	Visit Website» Documentation»
BLAST	(Basic Local Alignment Search Tool) finds regions of similarity between biological sequences. Keywords: ChIP-Sequencing Comparative Genomics Genomics Nucleotide Sequence Homology Search RNA-Sequencing High-Throughput Sequencing Other	Visit Website» Documentation»
BLAST+ Christiam Camacho, Tao Tao, Tom Madden	a suite of BLAST (Basic Local Alignment Search Tool) tools that utilizes the NCBI C++ Toolkit with a number of performance and feature improvements over the legacy BLAST applications. Keywords: ChIP-Sequencing High-throughput sequencing Homology-Based Taxonomic Classification RNA-Sequencing Other	Visit Website» Documentation» Web Forum» Mailing List»
Blender	a 3D creation suite that supports the entirety of the 3D pipeline—modeling, rigging, animation, simulation, rendering, compositing and motion tracking.	Visit Website» Documentation» Web Forum»
Bloocoo	is a k-mer spectrum-based read error corrector, designed to correct large datasets with a very low memory footprint. It uses the disk streaming k-mer counting algorithm contained in the GATB library, and inserts solid k-mers in a bloom-filter. The correction procedure is similar to the Musket multistage approach. Bloocoo yields similar results while requiring far less memory: as an example, it can correct whole … Keywords: High-throughput sequencing Read Quality Control	Visit Website» Documentation»
bmtagger	aka Best Match Tagger is for removing human reads from metagenomics datasets Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation»
bmtool	bmtool is part of BMTagger aka Best Match Tagger, for removing human reads from metagenomics datasets. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
Boltz‑1 Itamar Chinn, Mateo Reveiz, Jacob Silterra, Tally Portnoi, Regina Barzilay, Gabriele Corso, Tommi Jaakkola, Saro Passaro, Jeremy Wohlwend	an open-source model which predicts the 3D structure of proteins, rna, dna and small molecules; it handles modified residues, covalent ligands and glycans, as well as condition the generation on pocket residues. Keywords: Computational Chemistry Protein Structure Analysis Structure Visualization & Analysis	Visit Website» Documentation» Web Forum»
Boto 3	the Amazon Web Services (AWS) SDK for Python, which allows Python developers to write software that makes use of Amazon services like S3 and EC2. Boto provides an easy to use, object-oriented API as well as low-level direct service access. Keywords: High Performance Computing Other	Visit Website» Documentation» Web Forum»
Bowtie Ben Langmead, Cole Trapnell	an ultrafast, memory-efficient short read aligner for short DNA sequences (reads) from next-gen sequencers. Keywords: ChIP-Sequencing DNA-Sequencing High-throughput sequencing Read Alignment RNA-Sequencing WGS Analysis	Visit Website» Documentation» Web Forum» Mailing List»
Bowtie 2 Ben Langmead, Steven Salzberg	an ultrafast and memory-efficient tool for aligning sequencing reads to long reference sequences. Keywords: ChIP-Sequencing DNA-Sequencing High-throughput sequencing Read Alignment RNA-Sequencing WGS Analysis	Visit Website» Documentation» Web Forum» Mailing List»
bpp	implements a versatile high-performance version of the BPP software Keywords: Phylogenetic Inference High-Throughput Sequencing	Visit Website» Documentation»
Bracken	(Bayesian Reestimation of Abundance with KrakEN) is a highly accurate statistical method that computes the abundance of species in DNA sequences from a metagenomics sample. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
BreakDancer Ken Chen	a Perl/Cpp package that provides genome-wide detection of structural variants from next generation paired-end sequencing reads. It includes two complementary programs. Keywords: Genomics High-throughput sequencing Genomics	Visit Website» Web Forum»
breseq Daniel Deatherage, Dave Knoester, Jeffrey Barrick	a computational pipeline for finding mutations relative to a reference sequence in short-read DNA re-sequencing data for microbial sized genomes. It reports single-nucleotide mutations, point insertions and deletions, large deletions, and new junctions supported by mosaic reads. Keywords: High-throughput sequencing WGS Analysis Other	Visit Website» Documentation» Web Forum»
BUStools	bustools is a program for manipulating BUS files for single cell RNA-Seq datasets. It can be used to error correct barcodes, collapse UMIs, produce gene count or transcript compatbility count matrices, and is useful for many other tasks. See the kallisto \| bustools website for examples and instructions on how to use bustools as part of a single-cell RNA-seq workflow. Keywords: scRNA-Seq Analysis High-Throughput Sequencing	Visit Website» Documentation»
BVATools Louis Letourneau, Mathieu Bourgey	Bam and Variant Analysis Tools Keywords: Genomics High-throughput sequencing Variant Aggregation/Summarization Genomics	Visit Website»
BWA Heng Li, Richard Durbin	(Burrows-Wheeler Aligner) a software package for mapping low-divergent sequences against a large reference genome, such as the human genome. It consists of three algorithms: BWA-backtrack, BWA-SW and BWA-MEM. Keywords: ChIP-Sequencing DNA-Sequencing High-throughput sequencing Read Alignment WGS Analysis	Visit Website» Documentation» Mailing List»
Convert3D	is a command-line tool for converting 3D images between common file formats. Keywords: Image-Analysis Libraries Visualization	Visit Website» Documentation»
C3POa	(Concatemeric Consensus Caller with Partial Order alignments) is a computational pipeline for calling consensi on R2C2 nanopore data. Keywords: Nanopore High-Throughput Sequencing	Visit Website»
Cactus	a reference-free whole-genome multiple alignment program based upon notion of Cactus graphs. Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website»
calib	clusters paired-end reads using their barcodes and sequences. Keywords: High-throughput sequencing	Visit Website»
Canu	Canu is a fork of the Celera Assembler designed for high-noise single-molecule sequencing. Canu specializes in assembling PacBio or Oxford Nanopore sequences. Canu operates in three phases: correction, trimming and assembly. The correction phase will improve the accuracy of bases in reads. Keywords: Genome Assembly PacBio Sequencing High-Throughput Sequencing	Visit Website»
Canvas	a tool for calling copy number variants (CNVs) from human DNA sequencing data. Keywords: Variant Analysis Genomics	Visit Website» Documentation»
CapCruncher	is designed to process Capture-C, Tri-C and Tiled-C data. Unlike other pipelines that are designed to process Hi-C or Capture-HiC data, the filtering steps in CapCruncher are specifically optimized for these datasets. Keywords: Hi-C High-throughput sequencing	Visit Website» Documentation» Web Forum»
Captus	Assembly of Phylogenomic Datasets from High-Throughput Sequencing data Keywords: Phylogenomics High-Throughput Sequencing	Visit Website» Documentation»
cartopy	is a Python package designed to make drawing maps for data analysis and visualisation easy. Keywords: Other	Visit Website»
cas‑offinder	Cas-OFFinder is OpenCL based, ultrafast and versatile program that searches for potential off-target sites of CRISPR/Cas-derived RNA-guided endonucleases (RGEN). Keywords: CRISPR/Cas9 Screen Analysis High-Throughput Sequencing	Visit Website»
CAVIAR	CAVIAR (CAusal Variants Identication in Associated Regions): a statistical framework that quantifies the probability of each variant to be causal while allowing with arbitrary number of causal variants. Keywords: Statistical Analysis Variant Analysis Genomics High-Throughput Sequencing	Visit Website» Documentation» Mailing List»
cd‑hit	clusters and compares protein or nucleotide sequences. Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website»
CEFCIG	CEFCIG (Computational Epigenetic Framework for Cell Identity Gene Discovery) Keywords: Epigenomics Genomics	Visit Website»
cell2location	Comprehensive mapping of tissue cell architecture via integrated single cell and spatial transcriptomics (cell2location model) Keywords: scRNA-Seq Analysis High-Throughput Sequencing	Visit Website» Documentation»
CellBender	a software package for eliminating technical artifacts from high-throughput single-cell RNA sequencing (scRNA-seq) data. Keywords: scRNA-Seq Analysis High-Throughput Sequencing	Visit Website»
CellPhoneDB	is a publicly available repository of curated receptors, ligands and their interactions. Keywords: Other	Visit Website» Documentation»
Cellpose	Cellpose-SAM: cell and nucleus segmentation with superhuman generalization. It can be optimized for your own data, applied in 3D, works on images with shot noise, (an)isotropic blur, undersampling, contrast inversions, regardless of channel order and object sizes. Keywords: Visualization Visualization	Visit Website» Documentation» Web Forum»
CellProfiler Anne E Carpenter, Lee Kamentsky, Mark-Anthony Bray, Thouis Raymond Jones	a cell image analysis software designed to enable biologists without training in computer vision or programming to quantitatively measure phenotypes from thousands of images automatically. Keywords: Bioimaging Cell Tracking Other	Visit Website» Documentation» Web Forum»
Cell Ranger	a set of analysis pipelines that process Chromium single-cell RNA-seq output to align reads, generate feature-barcode matrices and perform clustering and gene expression analysis. Keywords: scRNA-Seq Analysis Genomics	Visit Website» Documentation»
cellranger‑arc	The set of analysis pipelines in this suite perform sample demultiplexing, barcode processing, identification of open chromatin regions, and simultaneous counting of transcripts and peak accessibility in single cells. Keywords: scRNA-Seq Analysis High-Throughput Sequencing	Visit Website»
Cell Ranger ATAC	a set of analysis pipelines that perform identification of open chromatin regions, motif annotation, and differential accessibility analysis for Single Cell ATAC data. Keywords: scRNA-Seq Analysis High-Throughput Sequencing	Visit Website»
CellRank	CellRank is a modular framework to study cellular dynamics based on Markov state modeling of multi-view single-cell data. CellRank scales to large cell numbers, is fully compatible with the scverse ecosystem, and easy to use. In the backend, it is powered by pyGPCCA (Reuter et al. (2018)). Keywords: scRNA-Seq Analysis High-Throughput Sequencing	Visit Website» Documentation» Mailing List»
cellsnp‑lite	Efficient genotyping bi-allelic SNPs on single cells Keywords: scRNA-Seq Analysis Genomics	Visit Website»
cellxgene	an interactive explorer for single-cell transcriptomics data Keywords: scDNA-Seq Analysis scRNA-Seq Analysis Genomics	Visit Website»
Centrifuge	is a very rapid and memory-efficient system for the classification of DNA sequences from microbial samples, with better sensitivity than and comparable accuracy to other leading systems. The system uses a novel indexing scheme based on the Burrows-Wheeler transform (BWT) and the Ferragina-Manzini (FM) index, optimized specifically for the metagenomic classification problem. Centrifuge requires a relatively small index (e.g., 4.3 GB for ~4,100 bacterial … Keywords: Metagenomic Sequencing Analysis Genomics High-Throughput Sequencing	Visit Website» Documentation»
chewBBACA	A complete suite for gene-by-gene schema creation and strain identification. Keywords: Genome Annotation Genomics	Visit Website»
ChIPs	ChIPs is a tool for simulating ChIP-sequencing experiments. Keywords: ChIP-Sequencing High-Throughput Sequencing	Visit Website»
CHISEL	Copy-number Haplotype Inference in Single-cell by Evolutionary Links CHISEL is an algorithm to infer allele- and haplotype-specific copy numbers in individual cells from low-coverage single-cell DNA sequencing data (e.g., those generated by Direct Library Preparation+ (DLP+), 10x Genomics CNV Solution, DOP-PCR, etc.). Keywords: Genomics High-throughput sequencing Genomics	Visit Website» Documentation» Mailing List»
chopper	Rust implementation of NanoFilt+NanoLyse, both originally written in Python. This tool, intended for long read sequencing such as PacBio or ONT, filters and trims a fastq file. Keywords: Read Quality Control High-Throughput Sequencing	Visit Website»
Chromap	is an ultrafast method for aligning and preprocessing high throughput chromatin profiles. Keywords: High-throughput sequencing Genomics	Visit Website»
CIRCexplorer2	a comprehensive and integrative circular RNA analysis toolset. Keywords: Structure Visualization & Analysis Visualization	Visit Website»
Circlator	Circlator is a tool to circularize genome assemblies. The input is a genome assembly in FASTA format and corrected PacBio or nanopore reads in FASTA or FASTQ format. Circlator will attempt to identify each circular sequence and output a linearised version of it. It does this by assembling all reads that map to contig ends and comparing the resulting contigs with the input assembly. Keywords: Genome Assembly PacBio Sequencing High-Throughput Sequencing	Visit Website» Documentation»
Circos	a software package for visualizing data and information. It visualizes data in a circular layout. Keywords: Visualization	Visit Website»
CITE‑seq‑Count	count antibody TAGS from a CITE-seq and/or cell hashing experiment. Keywords: High-throughput sequencing	Visit Website» Documentation»
Clair3	a tool for symphonizing pileup and full-alignment for high-performance long-read variant calling Keywords: Structural Variant Analysis Variant Analysis High-Throughput Sequencing	Visit Website»
CLARK	fast, accurate and versatile k-mer based classification system. Keywords: High-throughput sequencing	Visit Website»
Clustal Andreas Wilm, David Dineen, Des Higgins, Fabian Sievers	a general purpose multiple sequence alignment program for DNA or proteins. Keywords:	Visit Website» Documentation»
clustalo	is the latest version of Clustal: a multiple sequence alignment program for DNA or proteins. Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website»
Clustal Omega Andreas Wilm, David Dineen, Des Higgins, Fabian Sievers	a multiple sequence alignment program that uses seeded guide trees and HMM profile-profile techniques to generate alignments between three or more sequences. Keywords: Other	Visit Website» Documentation»
CNVkit	a command-line toolkit and Python library for detecting copy number variants and alterations genome-wide from high-throughput sequencing. Keywords: Genomics Genomics	Visit Website» Documentation» Web Forum»
code‑server Ammar Bandukwala, Kyle Carberry	a tool that makes Run VS Code on any machine anywhere and access it in the browser. Keywords: Programming Tools Other	Visit Website» Documentation» Web Forum»
ColabFold Sergey Ovchinnikov, Milot Mirdita, Martin Steinegger	an easy-to-use Notebook based environment for fast and convenient protein structure predictions. Keywords: Protein Structure Analysis	Visit Website» Documentation» Web Forum»
Comet	Comet MS/MS searches uninterpreted tandem mass spectra of peptides against sequence databases. Keywords: Proteomics Proteomics	Visit Website» Web Forum»
CoNIFER	uses exome sequencing data to find copy number variants (CNVs) and genotype the copy-number of duplicated genes. Keywords: Genomics High-throughput sequencing Genomics	Visit Website» Documentation»
ConSurf Guy Yachdav	is a bioinformatics tool designed for estimating the evolutionary conservation of amino and nucleic acid positions in protein, DNA, and RNA molecules. It leverages phylogenetic relationships among homologous sequences to assess conservation, providing insights into structural and functional importance. ConSurf employs advanced computational methods, including empirical Bayesian and maximum likelihood approaches, to deliver accurate evolutionary rate estimations. Keywords: Computational Chemistry	Visit Website» Documentation»
Control‑FREEC	Copy number and genotype annotation from whole genome and whole exome sequencing data. Keywords: Genomics Genomics	Visit Website» Documentation»
cooler	is a support library for a sparse, compressed, binary persistent storage format, also called cooler, used to store genomic interaction data, such as Hi-C contact matrices. Keywords: Hi-C High-Throughput Sequencing	Visit Website» Documentation»
corset	Software for clustering de novo assembled transcripts and counting overlapping reads. Keywords: RNA-Sequencing High-Throughput Sequencing	Visit Website»
covtobed	a tool to generate BED coverage tracks from BAM files. It reads one (or more) alignment files (sorted BAM) and prints a BED with the coverage. It will join consecutive bases with the same coverage, and can be used to only print a BED file with the regions having a specific coverage range. Keywords: High-throughput sequencing	Visit Website» Documentation»
crass	Crass is designed to identify and reconstruct CRISPR loci from raw metagenomic data without the need for assembly or prior knowledge of CRISPR in the data set. Keywords: CRISPR/Cas9 Screen Analysis High-Throughput Sequencing	Visit Website»
crimson	Bioinformatics tool outputs converter to JSON or YAML. Keywords: Bioinformatics Infrastructure High-Throughput Sequencing	Visit Website»
CRISPRCasFinder Christine Pourcel, David Couvin	a tool that enables the easy detection of CRISPRs and cas genes in user-submitted sequence data (allows sequences up to 50 Mo otherwise download standalone program). This is an update of the CRISPRFinder program with improved specificity and indication on the CRISPR orientation. MacSyFinder is used to identify cas genes, the CRISPR-Cas type and subtype. Keywords: CRISPR/Cas9 Screen Analysis Genomics Genomics Other	Visit Website» Documentation» Web Forum»
Cromwell Jeff Gentry	a Workflow Management System geared towards scientific workflows. Keywords: Workflow Management System Other	Visit Website» Documentation» Web Forum»
CrossMap	a program for genome coordinates conversion between different genome assemblies. Keywords: Genome Assembly Genomics	Visit Website»
Crumble	controllable lossy compression of BAM/CRAM files. Keywords: High-throughput sequencing	Visit Website»
csvtk	a set of tools for manipulation of CSV/TSV files. It is convenient for rapid data investigation and integration into analysis pipelines. Keywords: Other	Visit Website» Documentation»
CUDA	a tool that helps redistributable software libraries to support CUDA applications for Linux. Keywords: Other	Visit Website» Documentation» Web Forum»
Cufflinks Cole Trapnell, Geo Pertea	a reference-guided assembler that assembles transcripts, estimates their abundances, and tests for differential expression and regulation in RNA-Seq samples. Keywords: High-throughput sequencing RNA-Sequencing Transcript Quantification	Visit Website» Documentation» Web Forum» Mailing List»
Cutadapt Marcel Martin	finds and removes adapter sequences, primers, poly-A tails and other types of unwanted sequence from your high-throughput sequencing reads. Keywords: Read Quality Control High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
Cuttlefish	a fast, parallel, and very lightweight memory tool to construct the compacted de Bruijn graph from genome reference(s). Keywords: Genome Assembly High-Throughput Sequencing	Visit Website»
Cyberduck	a libre server and cloud storage browser for Mac and Windows with support for FTP, SFTP, WebDAV, Amazon S3, OpenStack Swift, Backblaze B2, Microsoft Azure & OneDrive, Google Drive and Dropbox. Keywords: Other	Visit Website» Documentation»
Cytoscape Barry Demchak, Benno Schwikowski, Keiichiro Ono, Trey Ideker	a software platform for visualizing molecular interaction networks and biological pathways and integrating these networks with annotations, gene expression profiles and other state data. Keywords: Bioinformatics Infrastructure Figure Creation Visualization Visualization	Visit Website» Documentation» Web Forum» Mailing List»
cyvcf2	a cython wrapper around htslib built for fast parsing of Variant Call Format (VCF) files. Keywords: Variant Analysis High-Throughput Sequencing	Visit Website»
daligner	finds all significant local alignments between reads. Keywords: Read Alignment High-Throughput Sequencing	Visit Website»
dammit	simple de novo transcriptome annotator Keywords: Genome Annotation High-throughput sequencing Transcriptomics	Visit Website»
DANPOS3	a toolkit for Dynamic Analysis of Nucleosome and Protein Occupancy by Sequencing. Keywords: High-throughput sequencing Nucleic Acids Genomics	Visit Website» Documentation» Web Forum»
Dask	a flexible library for parallel computing in Python. Keywords: Machine Learning Other	Visit Website» Documentation»
DataLad	provides joint management of analysis code and data. This enables you to comprehensively track the exact state of any analysis inputs that produced your results — across the entire lifetime of a project, and across multiple datasets. Keywords: Other	Visit Website» Documentation»
datamash Assaf Gordon	GNU datamash is a command-line program which performs basic numeric,textual and statistical operations on input textual data files. Keywords: Other	Visit Website»
dcm2niix	is a designed to convert neuroimaging data from the DICOM format to the NIfTI format. Keywords:	Visit Website» Documentation»
DCMTK	DCMTK is a collection of libraries and applications implementing large parts the DICOM standard. Keywords: Image-Analysis Libraries Visualization	Visit Website»
dDocent	dDocent is simple bash wrapper to QC, assemble, map, and call SNPs from almost any kind of RAD sequencing. If you have a reference already, dDocent can be used to call SNPs from almost any type of NGS data set. Keywords: RADSeq High-Throughput Sequencing	Visit Website» Documentation»
deblur	Deblur is a greedy deconvolution algorithm for amplicon sequencing based on Illumina Miseq/Hiseq error profiles. Keywords: High-throughput sequencing	Visit Website»
DeepLC	Retention time prediction for (modified) peptides using Deep Learning. Keywords: Proteomics Proteomics	Visit Website»
deepTools Thomas Manke, Devon Ryan, Fidel Ramírez	a suite of python tools particularly developed for the efficient analysis of high-throughput sequencing data, such as ChIP-seq, RNA-seq or MNase-seq. Keywords: High-throughput sequencing Other	Visit Website» Documentation» Web Forum»
delly	an integrated structural variant (SV) prediction method that can discover, genotype and visualize deletions, tandem duplications, inversions and translocations at single-nucleotide resolution in short-read and long-read massively parallel sequencing data. Keywords: Structural Variant Analysis High-Throughput Sequencing	Visit Website» Web Forum»
demuxlet	Genetic multiplexing of barcoded single cell RNA-seq. Keywords: RNA-Sequencing High-Throughput Sequencing	Visit Website»
dEploid	deconvolutes mixed genomes with unknown proportions. Keywords: Genomics Genomics	Visit Website»
DERNA	RNA sequence design for a target protein sequence Keywords: Other	Visit Website»
deSALT	De Bruijn graph-based Spliced Aligner for Long Transcriptome reads Keywords: Genome Assembly High-Throughput Sequencing	Visit Website»
DESeq2 Simon Anders, Michael Love, Wolfgang Huber	a Bioconductor software package installed in R 3.2.2 that estimates variance-mean dependence in count data from high-throughput sequencing assays and test for differential expression based on a model using the negative binomial distribution. Keywords: Bioconductor Packages High-throughput sequencing Normalization/Differential Expression RNA-Sequencing	Visit Website» Documentation»
Dextractor	bax file decoder and data compressor. Keywords: PacBio Sequencing High-Throughput Sequencing	Visit Website»
DFAST	is a flexible and customizable pipeline for prokaryotic genome annotation as well as data submission to the INSDC. Keywords: Genome Annotation Genomics	Visit Website»
DIAMOND Benjamin Buchfink	a high-throughput program for aligning a file of short DNA sequencing reads against a protein reference database such as NR, at 20,000 times the speed of BLASTX, with high sensitivity. Keywords: High-throughput sequencing Metagenomic Sequencing Analysis Protein Database Search	Visit Website» Documentation» Web Forum»
dicey	In-silico PCR and variant primer design Keywords: PCR Genomics High-Throughput Sequencing	Visit Website»
dnaio	is a Python 3.7+ library for very efficient parsing and writing of FASTQ and also FASTA files. Keywords: High-throughput sequencing	Visit Website» Documentation»
dnarrange	Find rearrangements in "long" DNA reads relative to a genome sequence. Keywords: Genome Assembly High-throughput sequencing	Visit Website» Documentation» Web Forum»
DNAscent	DNAscent is software designed to detect the base analogues BrdU and EdU in single molecules of DNA sequenced on the Oxford Nanopore platform Keywords: Nanopore High-Throughput Sequencing	Visit Website» Documentation»
dnmtools	a set of tools for analyzing DNA methylation data from bisulfite sequencing Keywords: DNA-Sequencing High-Throughput Sequencing	Visit Website»
downpore	a suite of tools for use in genome assembly and consensus. Keywords: Genome Assembly High-Throughput Sequencing	Visit Website»
dREG	Detection of Regulatory DNA Sequences using GRO-seq Data. Keywords: High-throughput sequencing	Visit Website» Documentation» Mailing List»
dRep	a python program for rapidly comparing large numbers of genomes, dRep can also "de-replicate" a genome set by identifying groups of highly similar genomes and choosing the best representative genome for each genome set. Keywords: High-throughput sequencing Metagenomic Sequencing Analysis	Visit Website» Documentation»
DROP	(Detection of RNA Outlier Pipeline) pipeline to find aberrant gene expression events in RNA sequencing data. Keywords: RNA-Seq Analysis High-Throughput Sequencing	Visit Website» Documentation»
dsh‑bio	Tools for BED, FASTA, FASTQ, GAF, GFA1/2, GFF3, PAF, SAM, and VCF files Keywords: High-throughput sequencing	Visit Website»
DWGSIM Nils Homer	a whole genome simulator for next-generation sequencing based off of wgsim found in SAMtools, which was written by Heng Li, and forked from DNAA. It was modified to handle ABI SOLiD and Ion Torrent data, as well as various assumptions about aligners and positions of indels. Many new features have been subsequently added. Keywords: Genomics High-throughput sequencing Genomics	Visit Website»
dysgu	dysgu (pronounced duss-key) is a set of command line tools and python-API, for calling structural variants using paired-end or long read sequencing data. Keywords: Structural Variant Analysis High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
Eagle	estimates haplotype phase either within a genotyped cohort or using a phased reference panel. Eagle2 is now the default phasing method used by the Sanger and Michigan imputation servers and uses a new, very fast HMM-based algorithm that improves speed and accuracy over existing methods via two key ideas: a new data structure based on the positional Burrows-Wheeler transform and a rapid search algorithm … Keywords: Genomics Genomics	Visit Website» Documentation»
edgeR Aaron Lun, Davis McCarthy, Yunshun Chen	a Bioconductor software package installed in R 3.2.2 for examining differential expression of replicated count data. Keywords: Bioconductor Packages High-throughput sequencing Normalization/Differential Expression RNA-Sequencing	Visit Website» Documentation»
Entrez Edirect Utilities	provides access to the NCBI's suite of interconnected databases (publication, sequence, structure, gene, variation, expression, etc.) from a UNIX terminal window. Functions take search terms from command-line arguments. Individual operations are combined to build multi-step queries. Record retrieval and formatting normally complete the process. Keywords: Other	Visit Website» Documentation»
EggNOG‑mapper	Fast genome-wide functional annotation through orthology assignment. Keywords: Genome Annotation High-Throughput Sequencing	Visit Website»
EIGENSOFT	The EIGENSOFT package combines functionality from our population genetics methods (Patterson et al. 2006) and our EIGENSTRAT stratification correction method (Price et al. 2006). Keywords: Genomics Genomics	Visit Website» Web Forum»
elPrep	a high-performance tool for analyzing .sam/.bam files (up to and including variant calling) in sequencing pipelines. Keywords: High-throughput sequencing Variant Analysis	Visit Website»
EMA	Fast & accurate alignment of barcoded short-reads Keywords: High-throughput sequencing	Visit Website»
Emacs	an extensible, customizable, free/libre text editor. Keywords: Other	Visit Website» Documentation» Mailing List»
EMBOSS Alan Bleasby, Peter Rice	a program that integrates a range of currently available packages and tools for sequence analysis into a seamless whole. Keywords: High-throughput sequencing WGS Analysis Other	Visit Website» Documentation»
EMu	EMu is a relative abundance estimator for 16S genomic sequences Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
ENANO	a FASTQ lossless compression algorithm especially designed for nanopore sequencing FASTQ files. Keywords: High-throughput sequencing	Visit Website»
ensembl_vep	predicts the functional effects of genomic variants Keywords: Genomics Genomics	Visit Website» Documentation»
EPA‑ng	a complete rewrite of the Evolutionary Placement Algorithm (EPA), previously implemented in RAxML. It uses libpll and pll-modules to perform maximum likelihood-based phylogenetic placement of genetic sequences on a user-supplied reference tree and alignment. Keywords: Phylogenomics High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
epic2	Ultraperformant Chip-Seq broad domain finder based on SICER. Keywords: ChIP-Sequencing High-Throughput Sequencing	Visit Website»
EVcouplings Chris Sander, Debora S Marks, Thomas Hopf	a tool to predict protein structure, function, and mutations using evolutionary sequence covariation. Keywords: Protein Structure Analysis Proteomics	Visit Website» Documentation» Web Forum» Webinars
ExaBayes	is a software package for Bayesian tree inference. Keywords: Phylogenetic Inference High-Throughput Sequencing	Visit Website»
Exomiser Damian Smedley, Peter N Robinson, Sebastian Köhler	a Java program that finds potential disease-causing variants from whole-exome or whole-genome sequencing data. Starting from a VCF file and a set of phenotypes encoded using the Human Phenotype Ontology (HPO), it will annotate, filter and prioritize likely causative variants based on user-defined criteria such as a variant's predicted pathogenicity, frequency of occurrence in a population and also how closely the given phenotype matches … Keywords: Genome Annotation Genomics Genotype-Phenotype Analysis Genomics	Visit Website» Documentation» Web Forum»
Exonerate	Exonerate is a generic tool for pairwise sequence comparison. It allows you to align sequences using a many alignment models, either exhaustive dynamic programming or a variety of heuristics. Keywords: Sequence Alignment Analysis Other	Visit Website» Documentation»
eXpress Adam Roberts, Lior Pachter, Xprs Ask	a streaming tool for quantifying the abundances of a set of target sequences from sampled subsequences. Keywords: High-throughput sequencing RNA-Sequencing Transcript Quantification	Visit Website» Documentation» Web Forum»
falco	is a drop-in C++ implementation of FastQC to assess the quality of sequence reads. Keywords: Read Quality Control High-Throughput Sequencing	Visit Website»
FAMSA	(Fast and Accurate Multiple Sequence Aligner) implements an algorithm for large-scale multiple sequence alignments (400k proteins in 2 hours and 8BG of RAM). Keywords: Protein-protein sequence alignment Proteomics	Visit Website»
FASTA William Pearson	a DNA and protein sequence alignment software package that searches for matching sequence patterns or words, called k-tuples. Keywords: Comparative Genomics Genomics Nucleotide Sequence Homology Search High-Throughput Sequencing Other	Visit Website»
FastANI	developed for fast alignment-free computation of whole-genome Average Nucleotide Identity (ANI). Keywords: Genomics Genomics	Visit Website»
Fasten	Perform random operations on fastq files, using unix streaming. Secure your analysis with Fasten! Keywords: High-throughput sequencing	Visit Website» Documentation»
FastK	FastK is a k‑mer counter that is optimized for processing high quality DNA assembly data sets such as those produced with an Illumina instrument or a PacBio run in HiFi mode. Keywords: High-throughput sequencing	Visit Website» Web Forum»
FastME	FastME provides distance algorithms to infer phylogenies. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
Fastool	Fastool is a simple and quick tool to read huge FastQ and FastA files (both normal and gzipped) and manipulate them. It makes use of the KSeq library (http://lh3lh3.users.sourceforge.net/kseq.shtml) for fast access to FastQ/A files. Keywords: High-throughput sequencing	Visit Website»
fastp	is a tool designed to provide fast all-in-one preprocessing for FastQ files. This tool is developed in C++ with multithreading supported to afford high performance. Keywords: High-throughput sequencing	Visit Website» Web Forum»
FastQC Simon Andrews	a quality control tool for high throughput sequence data. Keywords: ChIP-Sequencing DNA-Sequencing High-throughput sequencing Read Quality Control RNA-Sequencing WGS Analysis	Visit Website» Documentation» Webinars
fastq‑dl	A tool to download FASTQs associated with Study, Experiment, or Run accessions. Keywords: High-throughput sequencing	Visit Website»
fastq‑scan	fastq-scan reads a FASTQ from STDIN and outputs summary statistics (read lengths, per-read qualities, per-base qualities) in JSON format. Keywords: High-Throughput Sequencing	Visit Website»
FastQ Screen Simon Andrews	allows you to screen a library of sequences in FastQ format against a set of sequence databases so you can see if the composition of the library matches with what you expect. Keywords: High-throughput sequencing Read Quality Control WGS Analysis	Visit Website»
FastQTL	a fast, flexible, user-friendly, cluster-friendly QTL mapper. Keywords: Genomics quantitative trait loci (QTLs) mapping/discovery High-Throughput Sequencing	Visit Website» Documentation»
FastTree Morgan N Price	infers approximately-maximum-likelihood phylogenetic trees from alignments of nucleotide or protein sequences. FastTree can handle alignments with up to a million sequences in a reasonable amount of time and memory. Keywords: Metagenomic Sequencing Analysis Genomics High-Throughput Sequencing	Visit Website» Documentation»
fastv	an ultra-fast tool for identification of SARS-CoV-2 and other microbes from sequencing data. Keywords: Virus Sequence Detection High-Throughput Sequencing	Visit Website»
FASTX_Toolkit Assaf Gordon	a collection of command line tools for Short-Reads FASTA/FASTQ files preprocessing. Keywords: High-throughput sequencing	Visit Website» Documentation» Web Forum»
Fast Data Transfer ‑ FDT	an application for Efficient Data Transfers which is capable of reading and writing at disk speed over wide area networks (with standard TCP). It is written in Java, runs an all major platforms and it is easy to use. Keywords: Other	Visit Website» Documentation»
fgbio	a set of tools to analyze genomic data with a focus on Next Generation Sequencing. Keywords: Genomics High-Throughput Sequencing	Visit Website»
fibertools‑rs	a CLI tool for interacting with fiberseq bam files. Keywords: High-throughput sequencing	Visit Website»
Fiji Mark Hiner, Curtis Rueden, Kevin Eliceiri, Pavel Tomancak	an image processing package. It can be described as a distribution of ImageJ (and ImageJ2) together with Java, Java 3D and a lot of plugins organized into a coherent menu structure. Fiji compares to ImageJ as Ubuntu compares to Linux. Keywords: Other	Visit Website» Documentation» Web Forum» Mailing List»
Filtlong	a tool for filtering long reads by quality. Keywords: Read Quality Control High-Throughput Sequencing	Visit Website»
FLASH	(Fast Length Adjustment of SHort reads) is a very fast and accurate software tool to merge paired-end reads from next-generation sequencing experiments. FLASH is designed to merge pairs of reads when the original DNA fragments are shorter than twice the length of reads. The resulting longer reads can significantly improve genome assemblies. They can also improve transcriptome assembly when FLASH is used to merge … Keywords: High-throughput sequencing	Visit Website»
FlashPCA	performs fast principal component analysis (PCA) of single nucleotide polymorphism (SNP) data, similar to smartpca from EIGENSOFT (http://www.hsph.harvard.edu/alkes-price/software/) and shellfish (https://github.com/dandavison/shellfish). FlashPCA is based on the https://github.com/yixuan/spectra/ library. Keywords: Variant Analysis Genomics	Visit Website»
flexbar	preprocesses high-throughput sequencing data efficiently Keywords: High-throughput sequencing	Visit Website»
Flye	a fast and accurate de novo assembler for single molecule sequencing reads. Keywords: Genome Assembly High-Throughput Sequencing	Visit Website»
Foldseek Martin Steinegger, Johannes Soeding, Charlotte Tumescheit, Milot Mirdita, Stephanie Kim, Michel van Kempen	a program that enables fast and sensitive comparisons of large structure sets. Keywords: Structural Biology Structure Visualization & Analysis Visualization Visualization	Visit Website» Documentation» Web Forum» Webinars
fpa	Filter Pairwise Alignment filter long read mapping information to save disk space Keywords: High-throughput sequencing	Visit Website»
fqgrep	is an approximate sequence pattern matcher for FASTQ/FASTA files. Keywords: High-Throughput Sequencing	Visit Website»
fqtools	an efficient FASTQ manipulation suite. Keywords: High-Throughput Sequencing	Visit Website»
freebayes	Bayesian haplotype-based polymorphism discovery and genotyping. Keywords: Genotype-Phenotype Analysis High-Throughput Sequencing	Visit Website»
FreeSurfer	a software package for the analysis and visualization of structural and functional neuroimaging data from cross-sectional or longitudinal studies. Keywords: Image-Analysis Libraries Other	Visit Website» Documentation» Web Forum»
FsnViz	Tool for plotting gene fusion events detected by various tools using Circos. Keywords: Visualization High-Throughput Sequencing	Visit Website»
FusionCatcher	finds somatic fusion-genes in RNA-seq data. Keywords: Genomics Genomics	Visit Website»
GangSTR Nima Mousavi	a tool for genome-wide profiling tandem repeats from short reads. A key advantage of GangSTR over existing tools (e.g. lobSTR or hipSTR) is that it can handle repeats that are longer than the read length. GangSTR takes aligned reads (BAM) and a set of repeats in the reference genome as input and outputs a VCF file containing genotypes for each locus. Keywords: Genomics High-throughput sequencing Genomics	Visit Website» Documentation» Web Forum»
gappa	Genesis Applications for Phylogenetic Placement Analysis Keywords: Phylogenomics Genomics	Visit Website»
GATE	GATE a Monte-Carlo simulation toolkit for medical physics applications Keywords: Visualization Other	Visit Website»
GATK Eric Banks	(Genome Analysis Toolkit) a software package developed to analyze high-throughput sequencing data capable of taking on projects of any size with a primary focus on variant discovery, genotyping, and data quality assurance. Keywords: DNA-Sequencing Germline SNP Detection High-throughput sequencing RNA-Sequencing WGS Analysis	Visit Website» Documentation» Web Forum»
GCEN	a command-line toolkit that allows biologists to easily build gene co-expression network and predict gene function, especially in RNA-Seq research or lncRNAs annotation Keywords: RNA-Seq Analysis High-Throughput Sequencing	Visit Website»
Google Cloud SDK	a set of tools and libraries for interacting with Google Cloud products and services. Keywords: Other	Visit Website» Documentation» Web Forum» Mailing List»
GCTA Jian Yang, Peter Visscher, Mike Goddard, Andrew Bakshi	(Genome-wide Complex Trait Analysis) a tool for genome-wide complex trait analysis with five main functions: data management, estimation of the genetic relationships from SNPs, mixed linear model analysis of variance explained by the SNPs, estimation of the linkage disequilibrium structure, and GWAS simulation. GCTA estimates the variance explained by all the SNPs on a chromosome or on the whole genome for a complex trait … Keywords: Complex Trait Prediction Genotype-Phenotype Analysis GWAS Analysis Genomics	Visit Website» Documentation» Web Forum»
gdcm	Grassroots DiCoM is a C++ library for DICOM medical files. It is accessible from Python, C#, Java and PHP. Keywords: Visualization Visualization	Visit Website» Documentation» Mailing List»
GEDI	a software platform for working with genomic data such as sequencing reads, sequences, per-base numeric values or annotations written in Java. Keywords: High-throughput sequencing	Visit Website» Documentation» Web Forum»
Genepop François Rousset	a population genetics package that computes exact tests for Hardy-Weinberg equilibrium, for population differentiation and for genotypic disequilibrium among pairs of loci; computes estimates of F-statistics, null allele frequencies, allele size-based statistics for microsatellites, etc.; and performs analyses of isolation by distance from pairwise comparisons of individuals or population samples, including confidence intervals for “neighborhood size”. Keywords: Genomics Statistical Analysis High-Throughput Sequencing	Visit Website» Documentation»
Genion	Characterizing gene fusions using long transcriptomics reads Keywords: High-throughput sequencing	Visit Website»
GenomeBrowse	a free tool offered by Golden Helix that delivers stunning visualizations of your genomic data, enabling you to see what is occurring at each base pair in your samples. Keywords: Genome Visualization Genomics Visualization	Visit Website» Documentation» Web Forum»
Genrich	a peak-caller for genomic enrichment assays (e.g. ChIP-seq, ATAC-seq). Keywords: ATAC-Seq ChIP-Sequencing Genomics	Visit Website» Documentation»
geofetch	Downloads data and metadata from GEO and SRA and creates standard PEPs. Keywords: High-Throughput Sequencing	Visit Website» Documentation»
gfastats	gfastats is a single fast and exhaustive tool for summary statistics and simultaneous fa (fasta, fastq, gfa [.gz]) genome assembly file manipulation. gfastats also allows seamless fasta<>fastq<>gfa[.gz] conversion. It has been tested in genomes even >100Gbp. Keywords: Genome Assembly High-Throughput Sequencing	Visit Website»
gffcompare Geo Pertea	compares and evaluates the accuracy of RNA-Seq transcript assemblers (Cufflinks, Stringtie), collapses (merges) duplicate transcripts from multiple GTF/GFF3 files (e.g. resulted from assembly of different samples), and classifies transcripts from one or multiple GTF/GFF3 files as they relate to reference transcripts provided in a annotation file (also in GTF/GFF3 format). Keywords: Genomics High-throughput sequencing RNA-Seq Analysis	Visit Website» Documentation»
gffread Geo Pertea	validates, filters, converts and performs various other operations on GFF files (use gffread -h to see the various usage options). Because the program shares the same GFF parser code with Cufflinks, Stringtie, and gffcompare, it could be used to verify that a GFF file from a certain annotation source is correctly "understood" by these programs. Thus the gffread utility can be used to simply … Keywords: Genomics High-throughput sequencing RNA-Seq Analysis	Visit Website» Documentation»
Ghostscript	an interpreter for the PostScript (TM) language. It can display and convert postscript files. Software can be involved with gs command. Keywords: Other	Visit Website» Documentation» Web Forum» Mailing List»
ghostz	is a highly efficient remote homologue detection tool. Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website»
GimmeMotifs	a suite of motif tools, including a motif prediction pipeline for ChIP-seq experiments. Keywords: ChIP-Sequencing High-Throughput Sequencing	Visit Website»
glimpse‑bio	GLIMPSE is a phasing and imputation method for large-scale low-coverage sequencing studies. Keywords: High-throughput sequencing	Visit Website»
Globus CLI Stephen Rosen	(Globus Command Line Interface) a command line wrapper over the Globus SDK for Python. It is a standalone application that can be installed on the user’s machine and used to access the Globus service. Keywords: Other	Visit Website» Documentation» Web Forum» Mailing List»
GMAP	Genomic mapping and alignment program for mRNA and EST sequences. Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website»
GNUVID	(GNU-based Virus IDentification) a Python3 program for Gene Novelty Unit-based Virus Identification for SARS-CoV-2. It ranks CDS nucleotide sequences in a genome fna file based on the number of observed exact CDS nucleotide matches in a public or private database. It was created to type SARS-CoV-2 genomes using a whole genome multilocus sequence typing (wgMLST) approach. Keywords: DNA-Sequencing High-Throughput Sequencing	Visit Website»
Goalign Frédéric Lemoine	a set of command line tools to manipulate multiple alignments. Implemented in Go language, Goalign aims to handle multiple alignments in Phylip, Fasta, Nexus, and Clustal formats, through several basic commands. Each command may print result (an alignment, for example) in the standard output, and thus can be piped to the standard input of the next goalign command. Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
gofasta	provides functions for working on alignments in fasta format. Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website»
GoldRush	memory-efficient de novo assembly of long reads Keywords: Genome Assembly High-Throughput Sequencing	Visit Website»
goleft	goleft is a collection of bioinformatics tools written in go distributed together as a single binary. Keywords: High-throughput sequencing	Visit Website»
GOR	a tool based on a genomic ordered relational architecture and allows analysis of large sets of genomic and phenotypic tabular data using a declarative query language, in a parallel execution engine. It is very efficient in a wide range of use-cases, including genome wide batch analysis, range-queries, genomic table joins of variants and segments, filtering, aggregation etc. Keywords: Genomics Genomics	Visit Website» Documentation»
Gotree Frédéric Lemoine	a set of command line tools to manipulate phylogenetic trees. It is implemented in Go language. The goal is to handle phylogenetic trees in Newick, Nexus and PhyloXML formats, through several basic commands. Each command may print result (a tree for example) in the standard output, and thus can be piped to the standard input of the next gotree command. Keywords: Metagenomic Sequencing Analysis Phylogenetic Inference Other	Visit Website» Documentation» Web Forum»
grabix	grabix leverages the fantastic BGZF library in samtools to provide random access into text files that have been compressed with bgzip. grabix creates it's own index (.gbi) of the bgzipped file. Once indexed, one can extract arbitrary lines from the file with the grab command. Or choose random lines with the, well, random command. Keywords: Other	Visit Website»
GraphAligner	Sequence to graph aligner for long reads Keywords: Genome Assembly High-Throughput Sequencing	Visit Website»
GraphMap	GraphMap is a novel mapper targeted at aligning long, error-prone third-generation sequencing data. It is designed to handle Oxford Nanopore MinION 1d and 2d reads with very high sensitivity and accuracy, and also presents a significant improvement over the state-of-the-art for PacBio read mappers. Keywords: DNA-Sequencing High-throughput sequencing	Visit Website»
GraphMap2	GraphMap2 update containins tuning of alignments specific for long RNA reads. GraphMap2 is a novel mapper targeted at aligning long, error-prone third-generation sequencing data. It is designed to handle Oxford Nanopore MinION 1d and 2d reads with very high sensitivity and accuracy, and also presents a significant improvement over the state-of-the-art for PacBio read mappers. Keywords: DNA-Sequencing High-throughput sequencing	Visit Website» Documentation»
GRIDSS	a module software suite containing tools useful for the detection of genomic rearrangements. GRIDSS includes a genome-wide break-end assembler, as well as a structural variation caller for Illumina sequencing data. GRIDSS calls variants based on alignment-guided positional de Bruijn graph genome-wide break-end assembly, split read, and read pair evidence. Keywords: Genomics Sequence Alignment Analysis Structural Variant Analysis Genomics	Visit Website» Documentation»
GROOT	GROOT is a tool to type Antibiotic Resistance Genes (ARGs) in metagenomic samples (a.k.a. Resistome Profiling). It combines variation graph representation of gene sets with an LSH indexing scheme to allow for fast classification of metagenomic reads. Subsequent hierarchical local alignment of classified reads against graph traversals facilitates accurate reconstruction of full-length gene sequences using a simple scoring scheme. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation»
GSAlign	an ultra-fast sequence alignment algorithm for intra-species genome comparison. Keywords: Genomics Sequence Alignment Analysis Genomics	Visit Website»
GSEApy	a Python/Rust implementation for GSEA and wrapper for Enrichr. GSEApy can be used for RNA-seq, ChIP-seq, Microarray data. It can be used for convenient GO enrichment and to produce publication quality figures in python. Keywords: Genotype-Phenotype Analysis GWAS Analysis Genomics High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
GSearch	an ultra-fast and scalable microbial genome search program based on MinHash-like metric and graph-based approximate nearest neighbor search Keywords: Genomics Genomics	Visit Website»
gsMap	gsMap (genetically informed spatial mapping of cells for complex traits) integrates spatial transcriptomics (ST) data with genome-wide association study (GWAS) summary statistics to map cells to human complex traits, including diseases, in a spatially resolved manner. Keywords: GWAS Analysis High-throughput sequencing Transcriptomics	Visit Website» Documentation»
gsort	a tool to sort genomic files according to a genomefile. Keywords: High-throughput sequencing Genomics	Visit Website»
GTDBTk	a software toolkit for assigning objective taxonomic classifications to bacterial and archaeal genomes based on the Genome Database Taxonomy (GTDB). Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation» Web Forum» Mailing List»
GToTree	is a user-friendly workflow for phylogenomics intended to give more researchers the capability to create phylogenomic trees. Keywords: Phylogenomics Genomics	Visit Website» Documentation»
gw	a fast browser for genomic sequencing data (.bam/.cram format) used directly from the terminal. GW also allows you to view and annotate variants from vcf/bcf files. Keywords: High-throughput sequencing	Visit Website» Documentation»
GWAMA	(Genome-Wide Association Meta Analysis) software performs meta-analysis of the results of GWA studies of binary or quantitative phenotypes. Fixed- and random-effect meta-analyses are performed for both directly genotyped and imputed SNPs. Keywords: Genomics GWAS Analysis High-throughput sequencing Genomics	Visit Website» Documentation» Web Forum»
H2O	a scalable machine learning and predictive analytics platform. Keywords: Machine Learning Other	Visit Website» Documentation»
Hail	an open-source, general-purpose, Python-based data analysis library with additional data types and methods for working with genomic data. Keywords: Comparative Genomics GWAS Analysis High-throughput sequencing Genomics	Visit Website» Documentation»
hapLOHseq Paul Scheet, Anthony San Lucas	Developed for the detection of subtle allelic imbalance events from next-generation sequencing data, hapLOHseq is a sequencing-based extension of hapLOH, which is a method for the detection of subtle allelic imbalance events from SNP array data. It is capable of identifying events of 10 mega-bases or greater occurring in as little as 16% of the sample using exome sequencing data (at 80x) and 4% … Keywords: Genomics High-throughput sequencing Variant Analysis Genomics	Visit Website» Documentation»
hatchet	(Holistic Allele-specific Tumor Copy-number Heterogeneity) is an algorithm that infers allele and clone-specific CNAs and WGDs jointly across multiple tumor samples from the same patient, and that leverages the relationships between clones in these samples. Keywords: Genomics Genomics	Visit Website»
HD‑BET	Automated brain extraction of multi-sequence MRI using artificial neural networks. Keywords: MRI Analysis Visualization	Visit Website»
hdbscan	Hierarchical Density-Based Spatial Clustering of Applications with Noise. Performs DBSCAN over varying epsilon values and integrates the result to find a clustering that gives the best stability over epsilon. Keywords:	Visit Website»
HDF5	s a data model, library, and file format for storing and managing data. Keywords: Other	Visit Website» Documentation» Web Forum»
hera	a bioinformatics tool that helps analyze RNA-seq data, providing base-to-base alignment BAM files, transcript abundance estimation, and fusion gene detection. Keywords: RNA-Seq Analysis High-Throughput Sequencing	Visit Website»
HHsuite Johannes Soeding, Martin Steinegger, Milot Mirdita	an open-source software package for sensitive protein sequence searching based on the pairwise alignment of hidden Markov models (HMMs). Keywords: High-throughput sequencing Protein-protein sequence alignment Proteomics Proteomics	Visit Website» Documentation» Web Forum»
hic_breakfinder	a framework that integrates optical mapping, high-throughput chromosome conformation capture (Hi-C), and whole genome sequencing to systematically detect SVs in a variety of normal or cancer samples and cell lines. Keywords: Structural Variant Analysis Genomics	Visit Website» Documentation»
HiCExplorer	is a set of programs to process, normalize, analyze and visualize Hi-C and cHi-C data. Keywords: Hi-C High-Throughput Sequencing	Visit Website» Documentation» Mailing List»
hichip‑peaks	A package that can be used to find enriched peak regions from HiChIP datasets that can then be used as an input to available loop calling tools or to do differential peak analysis. Keywords: ChIP-Sequencing High-Throughput Sequencing	Visit Website»
HiCPro	An optimized and flexible pipeline for Hi-C data processing Keywords: Hi-C High-Throughput Sequencing	Visit Website»
hictk	Blazing fast toolkit to work with .hic and .cool files Keywords: Hi-C Genomics	Visit Website» Documentation»
HiCUP	A tool for mapping and performing quality control on Hi-C data Keywords: Hi-C High-Throughput Sequencing	Visit Website»
Hifiasm	Haplotype-resolved assembler for accurate Hifi reads Keywords: Genome Assembly High-Throughput Sequencing	Visit Website»
hifiasm_meta	Metagenome assembler for Hifi reads, based on hifiasm. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
HiLine	HiC alignment and classification pipeline. Keywords: Hi-C High-Throughput Sequencing	Visit Website»
HipSTR Thomas Willems	(Haplotype inference and phasing for Short Tandem Repeats) a novel haplotype-based method for robustly genotyping and phasing STRs from Illumina sequencing data. HipSTR was specifically developed to deal with short tandem repeats (STRs) in genomic sequences in the hopes of obtaining more robust STR genotypes. Keywords: Genomics High-throughput sequencing Genomics	Visit Website» Documentation» Web Forum»
HISAT2 Daehwan Kim, Steven Salzberg	(Hierarchical Indexing for Spliced Alignment of Transcripts) a fast and sensitive alignment program for mapping next-generation sequencing reads (both DNA and RNA) against the general human population (as well as against a single reference genome). HISAT2 is a successor to both HISAT and TopHat2. Keywords: High-throughput sequencing RNA-Sequencing Spliced Read Alignment	Visit Website» Documentation» Mailing List»
HMMER Sean R Eddy	is used for searching sequence databases for sequence homologs, and for making sequence alignments. It implements methods using probabilistic models called profile hidden Markov models (profile HMMs). Keywords: Sequence Alignment Analysis Proteomics	Visit Website» Documentation»
HOMER Chris Benner	(Hypergeometric Optimization of Motif EnRichment) a suite of sequencing analysis and sequence motif discovery tools. Keywords: ChIP-Sequencing High-throughput sequencing Motif Discovery	Visit Website» Documentation»
Hopla	Hopla enables classic genomic single, duo, trio, etc., analysis, by studying a single (multisample) vcf-file, eventually generating interactive visualizations. Keywords: Genomics Variant Analysis Genomics	Visit Website»
Horovod	Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet. Keywords: Machine Learning Other	Visit Website»
htop	is an interactive process viewer. Keywords: Other	Visit Website»
HTSeq Simon Anders	a Python package that provides infrastructure to process data from high-throughput sequencing assays. Keywords: High-throughput sequencing RNA-Sequencing WGS Analysis	Visit Website» Documentation»
HTSlib	a C library for reading/writing high-throughput sequencing data. Keywords: High-throughput sequencing	Visit Website» Documentation» Web Forum» Mailing List»
htstream	is a quality control and processing pipeline for High Throughput Sequencing data. Keywords: Read Quality Control High-Throughput Sequencing	Visit Website»
HULK	(Histosketching Using Little Kmers) a tool that creates small, fixed-size sketches from streaming microbiome sequencing data, enabling rapid metagenomic dissimilarity analysis. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
HUMAnN2	is a pipeline for efficiently and accurately profiling the presence/absence and abundance of microbial pathways in a community from metagenomic or metatranscriptomic sequencing data (typically millions of short DNA/RNA reads). Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
humann3	is a pipeline for efficiently and accurately profiling the presence/absence and abundance of microbial pathways in a community from metagenomic or metatranscriptomic sequencing data (typically millions of short DNA/RNA reads). Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation»
HyPhy	(Hypothesis Testing using Phylogenies) an open-source software package for comparative sequence analysis using stochastic evolutionary models. Keywords: Genomics Genomics	Visit Website»
IDR	The IDR (Irreproducible Discovery Rate) framework is a uniﬁed approach to measure the reproducibility of ﬁndings identiﬁed from replicate experiments and provide highly stable thresholds based on reproducibility. Keywords: Statistical Analysis Genomics	Visit Website» Documentation»
IGV Helga Thorvaldsdottir, Jacob Silterra, Jill Mesirov, Jim Robinson	(Integrative Genomics Viewer) a high-performance visualization tool for interactive exploration of large, integrated genomic datasets. It supports a wide variety of data types, including array-based and next-generation sequence data, and genomic annotations. Keywords: ChIP-Sequencing DNA-Sequencing Genome Annotation Genome Visualization Genomics Visualization Visualization	Visit Website» Documentation» Web Forum»
IGV Reports	Creates self-contained html pages for visual variant review with IGV (igv.js). Keywords: Visualization Visualization	Visit Website»
igvtools	command line tools for IGV Keywords: High-throughput sequencing	Visit Website»
ImageJ Wayne Rasband	a Java image processing program inspired by NIH Image that can display, edit, analyze, process, save and print 8-bit, 16-bit, and 32-bit images. It can read many image formats including TIFF, GIF, JPEG, BMP, DICOM, FITS and "raw" and supports "stacks", a series of images that share a single window. It is multithreaded, so time-consuming operations can be performed in parallel with other operations. Keywords: Electron Microscopy	Visit Website» Documentation» Web Forum» Mailing List»
ImageMagick Dirk Lemstra, Glenn Randers-Pehrson, John Cristy	a software suite to create, edit, compose, or convert bitmap images. Keywords: Other	Visit Website» Documentation» Web Forum»
ImmuneBuilder Brennan Abanades Kenyon, Charlotte Deane	is a high-performance deep learning framework developed by the Oxford Protein Informatics Group (OPIG) specifically for predicting the 3D structures of immune receptor proteins. By specializing in antibodies, nanobodies, and T-cell receptors (TCRs), it achieves state-of-the-art accuracy while delivering results over 100 times faster than general-purpose models like AlphaFold2. Keywords: Protein-Protein Interaction Prediction Protein Structure Analysis Structure Visualization & Analysis	Visit Website» Documentation»
IMSEQ	(IMmunogenetic SEQuence Analysis) is a fast, PCR and sequencing error aware tool to analyze high throughput data from recombined T-cell receptor or immunoglobolin gene sequencing experiments. It derives immune repertoires from sequencing data in FASTA / FASTQ format. Keywords: Genomics High-throughput sequencing	Visit Website» Documentation»
Infant FreeSurfer	An open source neuroimaging toolkit for processing, analyzing, and visualizing human brain MR images Keywords: Neuroimaging Visualization	Visit Website»
Infernal Sean R Eddy, Nawrocki P Eric	(INFERence of RNA ALignment) a program that searches DNA sequence databases for RNA structure and sequence similarities and uses a special case of profile stochastic context-free grammars called covariance models (CMs). In many cases It is more capable of identifying RNA homologs that conserve their secondary structure more than their primary sequence. Keywords: Comparative Genomics Genomics Multiple Nucleotide Sequence Alignment Genomics	Visit Website» Documentation» Web Forum»
InSilicoSeq	A sequencing simulator. Keywords: High-throughput sequencing	Visit Website»
IntaRNA	efficient RNA-RNA interaction prediction incorporating seeding and accessibility of interacting sites. Keywords: RNA-Sequencing High-Throughput Sequencing	Visit Website»
intervene	is a tool for intersection and visualization of multiple genomic region and gene sets (or lists of items). Keywords: Genomics Genomics	Visit Website» Documentation»
IQ‑TREE	efficient and versatile phylogenomic software by maximum likelihood. Keywords: Phylogenomics High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
isoseq3	Scalable De Novo Isoform Discovery Keywords: PacBio Sequencing High-Throughput Sequencing	Visit Website»
IsoTree Jin Zhao	an efficient de novo trascriptome assembler for RNA-Seq data. It can assemble transcripts from RNA-Seq reads (in fasta format). Unlike most of de novo assembly methods that build de Bruijn graph or splicing graph by connecting k-mers which are sets of overlapping substrings generated from reads, IsoTree constructs splicing graph by connecting reads directly. For each splicing graph, IsoTree applies an iterative scheme of … Keywords: High-throughput sequencing RNA-Sequencing Transcriptomics	Visit Website» Documentation»
ITK	(Insight Toolkit) is an open-source, cross-platform library that provides developers with an extensive suite of software tools for image analysis Keywords: Image-Analysis Libraries Visualization	Visit Website» Documentation» Web Forum»
ITK‑SNAP	is a software application used to segment structures in 3D medical images. Keywords: Image-Analysis Libraries MRI Analysis Neuroimaging Other	Visit Website» Documentation» Web Forum» Mailing List»
ivar	is a computational package that contains functions broadly useful for viral amplicon-based sequencing. Keywords: Virus Sequence Detection High-Throughput Sequencing	Visit Website»
JBrowse2	a new kind of genome browser that runs on your desktop. Keywords: Visualization Genomics High-Throughput Sequencing Visualization	Visit Website» Documentation» Web Forum» Mailing List»
Jellyfish Marçais Guillaume, Carl Kingsford	a tool for fast, memory-efficient counting of k-mers in DNA. A k-mer is a substring of length k, and counting the occurrences of all such substrings is a central step in many analyses of DNA sequence. JELLYFISH can count k-mers quickly by using an efficient encoding of a hash table and by exploiting the "compare-and-swap" CPU instruction to increase parallelism. Keywords: High-throughput sequencing Genomics	Visit Website» Documentation»
jo	a small utility to create JSON objects. Keywords: Other	Visit Website» Documentation»
jq Stephen Dolan, William Langford, Nicolas Williams	a lightweight and flexible command-line JSON processor. Keywords: Other	Visit Website» Documentation» Web Forum»
Juicer	a one-click pipeline for processing terabase scale Hi-C datasets. Using Juicer, you can: Go from raw fastq files to Hi-C maps binned at many resolutions Automatically annotate loops and contact domains with the Juicer tools Run the pipeline in the cloud, on LSF, Univa, or SLURM, or on a single CPU Juicer creates hic files from raw (unaligned) reads derived from a Hi-C experiment. Keywords: Hi-C High-throughput sequencing	Visit Website» Documentation» Web Forum»
Julia Alan Edelman, Valentin Churavy, Jeff Bezanson, Viral B Shah, Stefan Karpinski	a flexible dynamic language appropriate for scientific and numerical computing with performance comparable to traditional statically-typed languages. Keywords: Programming Tools Other	Visit Website» Documentation» Web Forum»
Jupyter Thomas Kluyver, Matthias Bussonnier, Benjamin Ragan-Kelley	a language-agnostic HTML notebook application for Project Jupyter. Keywords: Pipelines Programming Tools Other	Visit Website» Documentation» Web Forum»
JupyterLab Steven Silvester, Afshin Darian, Jason Grout	a program for the next-generation web-based user interface for Project Jupyter. JupyterLab enables you to work with documents and activities such as Jupyter notebooks, text editors, terminals, and custom components in a flexible, integrated, and extensible manner. Keywords: Programming Tools Other	Visit Website» Documentation» Web Forum»
Kaiju	fast and sensitive taxonomic classification for metagenomics. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
kalign2	a fast and accurate multiple sequence alignment algorithm designed to align large numbers of protein sequences. Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website» Documentation»
kallisto Lior Pachter, Nicolas L Bray, Páll Melsted	a program for quantifying abundances of transcripts from RNA-Seq data, or more generally of target sequences using high-throughput sequencing reads. It is based on the novel idea of pseudoalignment for rapidly determining the compatibility of reads with targets, without the need for alignment. Keywords: RNA-Sequencing High-Throughput Sequencing	Visit Website» Documentation» Web Forum» Mailing List» Webinars
kb‑python	kb-python is a python package for processing single-cell RNA-sequencing. It wraps the kallisto \| bustools single-cell RNA-seq command line tools in order to unify multiple processing workflows. Keywords: scRNA-Seq Analysis High-Throughput Sequencing	Visit Website» Documentation» Mailing List»
Keras François Chollet, Frédéric Branchaud-Charron, Taehoon Lee	a high-level neural networks API, written in Python and capable of running on top of TensorFlow, CNTK, or Theano. Developed with a focus on enabling fast experimentation, Keras is a deep learning library that allows for easy and fast prototyping (through user friendliness, modularity, and extensibility); supports both convolutional networks and recurrent networks, as well as combinations of the two; and runs seamlessly on … Keywords: Other	Visit Website» Documentation» Web Forum»
Kleborate	Kleborate: a tool for typing and screening pathogen genome assemblies Keywords: Genomics High-throughput sequencing Metagenomic Sequencing Analysis Genomics	Visit Website»
km	software for RNA-seq investigation using k-mer decomposition Keywords: RNA-Seq Analysis High-Throughput Sequencing	Visit Website»
kma	a method designed to map raw reads directly against redundant databases, in an ultra-fast manner using seed and extend. KMA is particulary good at aligning high quality reads against highly redundant databases, where unique matches often does not exist. It works for long low quality reads as well, such as those from Nanopore. Non-unique matches are resolved using the "ConClave" sorting scheme, and a … Keywords: High-throughput sequencing Read Alignment	Visit Website»
KMC	KMC—K-mer Counter is a utility designed for counting k-mers (sequences of consecutive k symbols) in a set of reads from genome sequencing projects. K-mer counting is important for many bioinformatics applications, e.g., developing de Bruijn graph assemblers. Building de Bruijn graphs is a commonly used approach for genome assembly with data from second-generation sequencer. Unfortunately, sequencing errors (frequent in practice) results in huge memory … Keywords: High-throughput sequencing	Visit Website»
KMCP	accurate metagenomic profiling of both prokaryotic and viral populations by pseudo-mapping Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
KneadData	is a tool designed to perform quality control on metagenomic sequencing data, especially data from microbiome experiments. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
Kraken 2	a system for assigning taxonomic labels to short DNA sequences, usually obtained through metagenomic studies. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
krakenuniq	Metagenomics classifier with unique k-mer counting for more specific results Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
Krona Tools	is a set of scripts to create Krona charts from several Bioinformatics tools as well as from text and XML files. Keywords: Visualization Visualization	Visit Website»
LAST	finds & aligns related regions of sequences. LAST is designed for moderately large data (e.g. genomes, DNA reads, proteomes). Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website»
lastz	LASTZ is a program for aligning DNA sequences, a pairwise aligner. Keywords: DNA-Sequencing High-Throughput Sequencing	Visit Website»
LazyPredict	Lazy Predict helps build a lot of basic models without much code and helps understand which models works better without any parameter tuning. Keywords: Machine Learning Other	Visit Website» Documentation»
LCA	Lowest Common Ancestor calculation tool Keywords: Phylogenomics High-Throughput Sequencing	Visit Website»
LDAK	a powerful and computationally efficient method for mixed-model association analysis in genome-wide association studies (GWAS). It is part of the LDAK software, which is written in C. Keywords: GWAS Analysis Genomics	Visit Website» Documentation» Mailing List»
LDSC	a command line tool for estimating heritability and genetic correlation from GWAS summary statistics. ldsc also computes LD Scores. Keywords: Genomics GWAS Analysis Genomics	Visit Website» Documentation»
leafcutter	Leafcutter quantifies RNA splicing variation using short-read RNA-seq data. Keywords: RNA-Seq Analysis High-Throughput Sequencing	Visit Website»
LevioSAM2	Fast and accurate coordinate conversion between assemblies Keywords: High-throughput sequencing	Visit Website»
LFTP	is a file transfer program that allows sophisticated FTP, HTTP and other connections to other hosts. If site is specified then LFTP will connect to that site otherwise a connection has to be established with the open command. Keywords: Other	Visit Website» Documentation» Mailing List»
LiftoffTools	is a toolkit to compare genes lifted between genome assemblies. Keywords: Genome Annotation Genomics	Visit Website»
Lighter	a kmer-based error correction method for whole genome sequencing data. Keywords: Read Quality Control High-Throughput Sequencing	Visit Website»
lima	is the standard tool to identify barcode and primer sequences in PacBio single-molecule sequencing data. Keywords: PacBio Sequencing High-Throughput Sequencing	Visit Website»
locarna	Tools for the structural analysis of RNA Keywords: Structure Visualization & Analysis High-Throughput Sequencing	Visit Website»
LoFreq	is a fast and sensitive variant-caller for inferring SNVs and indels from next-generation sequencing data. It makes full use of base-call qualities and other sources of errors inherent in sequencing (e.g. mapping or base/indel alignment uncertainty), which are usually ignored by other methods or only used for filtering. Keywords: Genomics High-throughput sequencing Variant Analysis Genomics	Visit Website» Documentation»
LongGF	a computational algorithm and software tool for fast and accurate detection of gene fusion by long-read transcriptome sequencing Keywords: Genome Annotation High-Throughput Sequencing	Visit Website» Web Forum»
LongReadSum	LongReadSum supports FASTA, FASTQ, BAM, FAST5, and sequencing_summary.txt file formats for quick generation of QC data in HTML and text format. Keywords: Read Quality Control High-Throughput Sequencing	Visit Website»
Longshot	a variant calling tool for diploid genomes using long error prone reads such as Pacific Biosciences (PacBio) SMRT and Oxford Nanopore Technologies (ONT). Keywords: Variant Analysis High-Throughput Sequencing	Visit Website»
lorax	A long-read analysis toolbox for cancer genomics. Keywords: Genomics High-throughput sequencing Genomics	Visit Website»
lordec	A hybrid error correction program for long, PacBio reads Keywords: PacBio Sequencing High-Throughput Sequencing	Visit Website»
lorikeet	is a tool for digital spoligotyping of MTB strains from Illumina read data. Keywords: High-throughput sequencing	Visit Website»
LRez	Standalone tool and library for working with barcoded linked-reads. Keywords: High-throughput sequencing	Visit Website»
LUMPY Ryan Layer, Ira M Hall, Colby Chiang	a probabilistic framework for structural variant discovery. Keywords: Genomics Genomics	Visit Website» Documentation» Web Forum»
Luna	is an open-source C/C++ software package for manipulating and analyzing polysomnographic recordings, with a focus on the sleep EEG. Keywords: Image-Analysis Libraries	Visit Website» Documentation»
Macrel	(Meta)genomic AMP Classification and Retrieval Pipeline to mine antimicrobial peptides (AMPs) from (meta)genomes. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing Genomics	Visit Website» Documentation» Web Forum»
MACS2 Tao Liu	(Model Based Analysis of ChIP-Seq data) a novel algorithm for identifying transcript factor binding sites. Keywords: ChIP-Sequencing High-throughput sequencing High-Tpeak Calling	Visit Website» Documentation» Web Forum»
MACS3	Model Based Analysis for ChIP-Seq data. Keywords: ChIP-Sequencing High-Throughput Sequencing	Visit Website»
MacSyFinder	a program to model and detect macromolecular systems, genetic pathways in protein datasets. In prokaryotes, these systems have often evolutionarily conserved properties: they are made of conserved components and are encoded in compact loci (conserved genetic architecture). The user models these systems with MacSyFinder to reflect these conserved features and to allow their efficient detection. Keywords: CRISPR/Cas9 Screen Analysis Genomics Genomics	Visit Website» Documentation» Web Forum»
MAFFT Katoh Kazutaka	a multiple sequence alignment program for unix-like operating systems. It offers a range of multiple alignment methods, L-INS-i (accurate; for alignment of <200 sequences), FFT-NS-2 (fast; for alignment of <30,000 sequences). Keywords: Multiple Structure Alignment Protein Structure Analysis Proteomics Proteomics	Visit Website» Documentation» Web Forum» Webinars
MAGeCK	(Model-based Analysis of Genome-wide CRISPR-Cas9 Knockout) a computational tool to identify important genes from the recent genome-scale CRISPR-Cas9 knockout screens (or GeCKO) technology. Keywords: CRISPR/Cas9 Screen Analysis Genomics	Visit Website» Documentation» Web Forum»
MAGeCK‑VISPR ‑	a comprehensive quality control, analysis and visualization workflow for CRISPR/Cas9 screens. Keywords: CRISPR/Cas9 Screen Analysis Genomics	Visit Website» Web Forum»
MAMA	multi-ancestry meta-analysis (MAMA) is a Python-based command line tool that meta-analyzes GWAS summary statistics generated from distinct ancestry groups. Keywords: GWAS Analysis Genomics	Visit Website» Documentation» Web Forum» Mailing List»
Manta Christopher T Saunders	calls structural variants (SVs) and indels from mapped paired-end sequencing reads. Manta is optimized for analysis of germline variation in small sets of individuals and somatic variation in tumor/normal sample pairs. It discovers, assembles, and scores large-scale SVs, medium-sized indels and large insertions within a single efficient workflow. The method is designed for rapid analysis on standard compute hardware: NA12878 at 50x genomic coverage … Keywords: Genomics High-throughput sequencing Genomics	Visit Website» Documentation» Web Forum»
MapCaller	An efficient and versatile approach for short-read alignment and variant detection in high-throughput sequenced genomes. Keywords: High-throughput sequencing	Visit Website»
MAPS	MAPS (Model-based Analysis of PLAC-Seq data) pipeline is a a set of multiple scripts used to analyze PLAC-Seq and HiChIP data. Keywords: HiChIP PLAC-seq High-Throughput Sequencing	Visit Website»
MAPseq	a set of fast and accurate sequence read classification tools designed to assign taxonomy and OTU classifications to ribosomal RNA sequences. This is done by using a reference set of full-length ribosomal RNA sequences for which known taxonomies are known, and for which a set of high quality OTU clusters has been previously generated. For each read, the best guess and corresponding confidence in … Keywords: High-throughput sequencing Metagenomic Sequencing Analysis	Visit Website» Documentation» Web Forum»
maq Richard Durbin, Heng Li	(Mapping and Assembly with Qualities) builds mapping assemblies from short reads generated by the next-generation sequencing machines. Keywords: ChIP-Sequencing High-throughput sequencing Read Alignment WGS Analysis	Visit Website» Documentation» Web Forum» Mailing List»
MarViN Rudy Arthur	a method for rapid genotype refinement for whole-genome sequencing data using multi-variate normal distribution. Whole-genome low-coverage sequencing has been combined with linkage-disequilibrium (LD) based genotype refinement to accurately and cost-effectively infer genotypes in large cohorts of individuals. Keywords: Genomics Genotype-Phenotype Analysis High-throughput sequencing Genomics	Visit Website»
Mash	is a fast sequence distance estimator that uses the MinHash algorithm and is designed to work with genomes and metagenomes in the form of assemblies or reads. Keywords: Genomics Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation»
MashMap	A fast approximate aligner for long DNA sequences. Keywords: High-throughput sequencing	Visit Website»
mbg	Minimizer based sparse de Bruijn graph constructor. Keywords: Genome Assembly High-Throughput Sequencing	Visit Website»
mbgc	(Multiple Bacteria Genome Compressor) is a tool for compressing genomes in FASTA (or gzipped FASTA) input format. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
MDAnalysis Beckstein Oliver, Michaud-Agrawal Naveen, Denning J Elizabeth, Woolf B Thomas, Reddy J. E. Tyler, Domański Jan, Linke Max, Gowers J Richard, Barnoud Jonathan, Melo N Manuel, Seyler L Sean, Dotson L David, Kenney M Ian, Buchoux Sébastien	an object-oriented Python library to analyze trajectories from molecular dynamics (MD) simulations in many popular formats. It can write most of these formats, too, together with atom selections suitable for visualization or native analysis tools. Keywords: Python Module Other	Visit Website» Documentation» Web Forum» Mailing List»
medaka	a tool to create consensus sequences and variant calls from nanopore sequencing data. Keywords: Nanopore Variant Analysis High-Throughput Sequencing	Visit Website» Documentation»
mega2	(Manipulation Environment for Genetic Analyses) - data-handling program for facilitating genetic linkage and association analyses. Keywords: GWAS Analysis High-Throughput Sequencing	Visit Website» Documentation»
megadepth	Megadepth is an efficient tool for extracting coverage related information from RNA and DNA-seq BAM and BigWig files. Keywords: High-throughput sequencing	Visit Website»
MEGAHIT	an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
mentalist	MLST (multi-locus sequence typing) is a classic technique for genotyping bacteria, widely applied for pathogen outbreak surveillance. Keywords: Virus Sequence Detection High-Throughput Sequencing	Visit Website»
merlin	uses sparse trees to represent gene flow in pedigrees and is a fast pedigree analysis package. Keywords: Genomics Genomics	Visit Website»
MetaEuk	a modular toolkit designed for large-scale gene discovery and annotation in eukaryotic metagenomic contigs. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
metagenome‑atlas	ATLAS - Three commands to start analysing your metagenome data Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
MetaGraph	The MetaGraph framework allows for indexing and analysis of very large biological sequence collections, producing compressed indexes that can represent several petabases of input data. The indexes can be efficiently queried with any query sequence of interest. Keywords: Genome Assembly High-throughput sequencing	Visit Website» Documentation»
metaMDBG	a fast, low-memory assembler designed for long and accurate metagenomic reads, such as those produced by PacBio HiFi and Nanopore sequencing. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
MetaPhlAn	Metagenomic Phylogenetic Analysis Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation»
MetaPhlAn2	(Metagenomic Phylogenetic Analysis) is a computational tool for profiling the composition of microbial communities from metagenomic shotgun sequencing data. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
Metaphor	Metagenomic Pipeline for Short Reads Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
MethylDackel	MethylDackel will process a coordinate-sorted and indexed BAM or CRAM file containing some form of BS-seq alignments and extract per-base methylation metrics from them. MethylDackel requires an indexed fasta file containing the reference genome as well. Keywords: bisulfite-Seq High-throughput sequencing	Visit Website» Documentation»
mhcflurry	MHC I ligand prediction package with competitive accuracy and a fast and documented implementation. Keywords: Genomics Motif Discovery Genomics	Visit Website»
MICA Bonnie Berger, Noah M Daniels	(Metagenomic Inquiry Compressive Acceleration) a family of programs for performing compressively-accelerated metagenomic sequence searches based on BLASTX and DIAMOND. Keywords: High-throughput sequencing Metagenomic Sequencing Analysis Protein Database Search	Visit Website» Documentation»
mimeo	Scan genomes for internally repeated sequences, elements which are repetitive in another species, or high-identity HGT candidate regions between species. Keywords: High-throughput sequencing	Visit Website» Documentation» Web Forum»
MinCED	MinCED is a program to find Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs) in full genomes or environmental datasets such as assembled contigs from metagenomes. Keywords: CRISPR/Cas9 Screen Analysis Genomics	Visit Website»
minialign	Minialign is a little bit fast and moderately accurate nucleotide sequence alignment tool designed for PacBio and Nanopore long reads. It is built on three key algorithms, minimizer-based index of the minimap overlapper, array-based seed chaining, and SIMD-parallel Smith-Waterman-Gotoh extension. Keywords: High-throughput sequencing	Visit Website»
miniasm	Miniasm is a very fast OLC-based de novo assembler for noisy long reads. It takes all-vs-all read self-mappings (typically by minimap) as input and outputs an assembly graph in the GFA format. Different from mainstream assemblers, miniasm does not have a consensus step. It simply concatenates pieces of read sequences to generate the final unitig sequences. Thus the per-base error rate is similar to … Keywords: Genome Assembly High-throughput sequencing	Visit Website»
Minimac4	a lower memory and more computationally efficient implementation of the genotype imputation algorithms in minimac/mininac2/minimac3. Keywords: Genomics Genomics	Visit Website» Documentation» Mailing List»
Minimap2	is a general-purpose alignment program to map DNA or long mRNA sequences against a large reference database. It works with accurate short reads of ≥100 bp in length, ≥1 kb genomic reads at error rate ∼15%, full-length noisy Direct RNA or cDNA reads and assembly contigs or closely related full chromosomes of hundreds of megabases in length. Minimap2 does split-read alignment, employs concave gap … Keywords: High-throughput sequencing Read Alignment	Visit Website»
minorseq	PacBio Minor Variant Calling and Phasing Tools Keywords: PacBio Sequencing High-Throughput Sequencing	Visit Website»
MIRA Bastien Chevreux	whole genome shotgun and EST sequence assembler for Sanger, 454, Solexa (Illumina), IonTorrent data and PacBio (the later at the moment only CCS and error-corrected CLR reads). Keywords: Genome Assembly High-throughput sequencing	Visit Website» Documentation» Web Forum» Mailing List»
MISO Yarden Katz	a probabilistic framework that quantitates the expression level of alternatively spliced genes from RNA-Seq data, and identifies differentially regulated isoforms or exons across samples. MISO is installed as a standalone program and as a module within python. Keywords: Alternative Splicing High-throughput sequencing Python Module RNA-Seq Analysis Other	Visit Website» Documentation» Web Forum» Mailing List»
mlst	scan contig files against PubMLST typing schemes. Keywords: High-throughput sequencing	Visit Website»
mmquant	RNA-Seq quantification tool, with special handling on multi-mapping reads. Keywords: RNA-Seq Analysis High-Throughput Sequencing	Visit Website»
MMseqs2 Johannes Soeding, Martin Steinegger	an ultra fast and sensitive sequence search and clustering suite Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
MMTF‑python	(Macromolecular Transmission Format python) is the macromolecular transmission format (MMTF) binary encoding of biological structures. Keywords: Structure Visualization & Analysis	Visit Website» Documentation»
MOB‑suite	MOB-suite: software tools for clustering, reconstruction and typing of plasmids from draft assemblies. The MOB-suite is designed to be a modular set of tools for the typing and reconstruction of plasmid sequences from WGS assemblies. Keywords: Genome Assembly High-Throughput Sequencing	Visit Website»
mokapot	fast and flexible semi-supervised learning for peptide detection. Keywords: Proteomics Proteomics	Visit Website» Documentation»
Molecular Nodes Brady A Johnston	a tool that enables quick import and visualisation of structural biology data inside of Blender. Keywords: Structure Visualization & Analysis Visualization Visualization	Visit Website» Documentation» Web Forum»
Monocle3	An analysis toolkit for single-cell RNA-seq. Keywords: scRNA-Seq Analysis High-Throughput Sequencing	Visit Website» Documentation»
MOODS	MOODS is a collection of algorithms used to match position weight matrices (PWM) with DNA sequences. Keywords: DNA-Sequencing High-Throughput Sequencing	Visit Website»
mosdepth	fast BAM/CRAM depth calculation for WGS, exome, or targeted sequencing. Keywords: High-throughput sequencing	Visit Website»
mothur Pat Schloss	a project to develop a single piece of open-source, expandable software to fill the bioinformatics needs of the microbial ecology community. Includes accelerated versions of DOTUR and SONS and the functionality of a number of other popular tools. Keywords: High-throughput sequencing Metagenomic Sequencing Analysis Pipelines	Visit Website» Documentation» Web Forum»
mOTUs	marker gene-based OTU (mOTU) profiling. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
MPICH Gropp D William, Lusk Ewing	a high performance and widely portable implementation of the Message Passing Interface (MPI) standard. Keywords: Other	Visit Website» Documentation» Web Forum» Mailing List»
MrBayes Huelsenbeck John, Larget Bret, van der Mark Paul, Ronquist Fredrik, Simon Donald	a program for Bayesian inference and model choice across a wide range of phylogenetic and evolutionary models. MrBayes uses Markov chain Monte Carlo (MCMC) methods to estimate the posterior distribution of model parameters. Keywords: Genomics Phylogenetic Inference Phylogenomics Genomics	Visit Website» Documentation» Web Forum»
MRIcron	is a cross-platform NIfTI format image viewer. It can load multiple layers of images, generate volume renderings and draw volumes of interest. It also provides dcm2nii for converting DICOM images to NIfTI format and NPM for statistics. MRIcron is a mature and useful tool, however you may want to consider the more recent MRIcroGL as an alternative. Keywords: Image-Analysis Libraries Visualization	Visit Website» Documentation» Web Forum»
MRIQC	extracts no-reference IQMs (image quality metrics) from structural (T1w and T2w) and functional MRI (magnetic resonance imaging) data. Keywords: MRI Analysis Neuroimaging Other	Visit Website» Documentation» Web Forum»
MR‑MEGA	( Meta-Regression of Multi-AncEstry Genetic Association ) is a specialized statistical tool designed for the meta-analysis of multi-ethnic genome-wide association studies (GWAS). Developed by the Reed Group at the University of Tartu, it utilizes a trans-ethnic approach to identify shared genetic effects across diverse populations while accounting for heterogeneity in allelic effects and ancestry. By incorporating principal component analysis to model ancestry-specific effect sizes, … Keywords: GWAS Analysis High-throughput sequencing Genomics	Visit Website» Documentation» Web Forum»
MRtrix3	provides a set of tools to perform various types of diffusion MRI analyses, from various forms of tractography through to next-generation group-level analyses. Keywords: MRI Analysis Visualization	Visit Website» Documentation»
msamtools	microbiome-related extension to samtools Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
msstitch	a tool to integrate a number of Shotgun proteomics tools, generating ready to use result files. Keywords: Proteomics Proteomics	Visit Website»
mudskipper	is a tool for converting genomic BAM/SAM files to transcriptomic BAM/RAD files. Keywords: High-Throughput Sequencing	Visit Website»
MultiQC	aggregates results from bioinformatics analyses across many samples into a single report. Keywords: High-throughput sequencing	Visit Website» Documentation» Web Forum»
MUMmer	a versatile alignment tool for DNA and protein sequences. Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website»
MUSCLE Robert Edgar	(multiple sequence comparison by log-expectation) a public domain multiple alignment software for protein and nucleotide sequences. Keywords: Other	Visit Website» Documentation» Web Forum»
mwga‑utils	collection of utilities for processing Multispecies Whole Genome Alignments Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
Mykrobe	antibiotic resistance prediction in minutes. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
NanoComp	Comparing runs of Oxford Nanopore sequencing data and alignments Keywords: Nanopore High-Throughput Sequencing	Visit Website»
nanoDoc	RNA modification detection using Nanopore raw reads with Deep One Class classification. Keywords: Nanopore High-Throughput Sequencing	Visit Website»
NanoFilt	Filtering and trimming of long read sequencing data. Keywords: High-throughput sequencing	Visit Website»
NanoPack	a set of tools developed for visualization and processing of long-read sequencing data from Oxford Nanopore Technologies and Pacific Biosciences. Keywords: High-throughput sequencing PacBio Sequencing	Visit Website» Documentation»
nanoplexer	a standard tool to demultiplex Nanopore long read sequencing data. Keywords: High-throughput sequencing Nanopore	Visit Website»
NanoPlot	Plotting tool for long read sequencing data and alignments. Keywords: Visualization High-Throughput Sequencing	Visit Website»
Nanopolish	software package for signal-level analysis of Oxford Nanopore sequencing data. Keywords: Nanopore High-Throughput Sequencing	Visit Website»
nanoq	Ultra-fast quality control and summary reports for nanopore reads Keywords: Nanopore High-Throughput Sequencing	Visit Website»
NanoQC	Create fastQC-like plots for Oxford Nanopore sequencing data Keywords: Nanopore High-Throughput Sequencing	Visit Website»
NanoSim	NanoSim is a fast and scalable read simulator for Nanopore sequencing data. Keywords: Nanopore High-Throughput Sequencing	Visit Website»
NanoStat	calculates various statistics from a long read sequencing dataset in fastq, bam or albacore sequencing summary format. Keywords: High-throughput sequencing Nanopore	Visit Website»
NanoVar	a genomic structural variant (SV) caller that utilizes low-depth long-read sequencing such as Oxford Nanopore Technologies (ONT). Keywords: Nanopore Structural Variant Analysis High-Throughput Sequencing	Visit Website»
Nextalign	Viral genome sequence alignment tool Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation»
nextclade	SARS-CoV-2 genome clade assignment, mutation calling, and sequence quality checks Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
Nextflow Paolo Di Tommaso	a reactive workflow framework and programming DSL that ease writing computational pipelines with complex data. It is designed around the idea that the Linux platform is the lingua franca of data science. Linux provides many simple but powerful command-line and scripting tools that, when chained together, facilitate complex data manipulations. Nextflow extends this approach, adding the ability to define complex program interactions and a … Keywords: Genomics High-throughput sequencing Workflow Management System Other	Visit Website» Documentation» Web Forum»
Nextstrain	real-time tracking of pathogen evolution. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation»
NGLess	(NGS Processing with Less Work) enables creation of a pipeline of work for all the first phase of NGS analysis until the point (inclusive) of annotation. Keywords: High-throughput sequencing Metagenomic Sequencing Analysis	Visit Website» Web Forum»
NGMLR	(coNvex Gap-cost alignMents for Long Reads) a long-read mapper designed to sensitively align PacBilo or Oxford Nanopore to (large) reference genomes. Keywords: Nanopore High-Throughput Sequencing	Visit Website»
ngsplot	Quick mining and visualization of NGS data by integrating genomic databases Keywords: Visualization High-Throughput Sequencing	Visit Website»
NiftySeg	contains programs to perform EM based segmentation of images in nifti or analyse format. Keywords: Image-Analysis Libraries Visualization	Visit Website» Documentation»
nilearn	a Python module for fast and easy statistical learning on NeuroImaging data. It leverages the scikit-learn Python toolbox for multivariate statistics with applications such as predictive modelling, classification, decoding, or connectivity analysis. Keywords: Machine Learning Neuroimaging Other	Visit Website» Documentation» Web Forum»
ninja‑nj	Nearly Infinite Neighbor Joining Application Keywords: Metagenomic Sequencing Analysis Motif Comparison Phylogenomics High-Throughput Sequencing	Visit Website»
oarfish	is a program for quantifying transcript-level expression from long-read (i.e. Oxford nanopore cDNA and direct RNA and PacBio) sequencing technologies. Keywords: Nanopore PacBio Sequencing RNA-Sequencing High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
OCOCO	the first program capable of inferring variants in a real-time, as read alignments are fed in. Ococo inputs unsorted alignments from a stream and infers single-nucleotide variants, together with a genomic consensus, using statistics stored in compact several-bit counters. Keywords: Variant Analysis High-Throughput Sequencing	Visit Website»
Octopus	Octopus is a mapping-based variant caller that implements several calling models within a unified haplotype-aware framework. Octopus takes inspiration from particle filtering by constructing a tree of haplotypes and dynamically pruning and extending the tree based on haplotype posterior probabilities in a sequential manner. This allows octopus to implicitly consider all possible haplotypes at a given loci in reasonable time. Keywords: Variant Analysis Genomics	Visit Website» Documentation»
OLego	OLego is a program specifically designed for de novo spliced mapping of mRNA-seq reads. Keywords: RNA-Sequencing High-Throughput Sequencing	Visit Website» Documentation»
Oncofuse	Oncofuse is a framework designed to estimate the oncogenic potential of de-novo discovered gene fusions. It uses several hallmark features and employs a bayesian classifier to provide the probability of a given gene fusion being a driver mutation. Keywords: Genomics High-throughput sequencing	Visit Website»
ont_fast5_api	is a simple interface to HDF5 files of the Oxford Nanopore .fast5 file format. Keywords: High-Throughput Sequencing	Visit Website»
OpenCV	(Open Source Computer Vision Library) an open source computer vision and machine learning software library. Keywords: Other	Visit Website» Documentation» Web Forum»
OpenFold Mohammed AlQuraishi, Gustaf Ahdritz, Qinghui Xia, Sachin Kadyan	a trainable, memory-efficient, and GPU-friendly PyTorch reproduction of DeepMind's AlphaFold 2. Keywords: Structure Visualization & Analysis Tertiary Structure Prediction Visualization	Visit Website» Documentation» Web Forum» Webinars
OpenJDK	(Open Java Development Kit) a free and open source implementation of the Java Platform, Standard Edition (Java SE). Keywords: Other	Visit Website»
OpenMPI Gilles Gouaillardet, Nathan Hjelm, Jeff Squyres	an open source Message Passing Interface implementation that is developed and maintained by a consortium of academic, research, and industry partners. Open MPI is therefore able to combine the expertise, technologies, and resources from all across the High Performance Computing community in order to build the best MPI library available. Keywords: Programming Tools Other	Visit Website» Documentation» Web Forum» Mailing List»
OpenMS	OpenMS is an open-source software C++ library for LC-MS data management and analyses. It offers an infrastructure for rapid development of mass spectrometry related software. Keywords: Proteomics Proteomics	Visit Website» Documentation» Web Forum» Mailing List»
OpenNucleome Zhongling Jiang, Bin Zhang, Zhuohan Lao, Kartik Kamat	an open-source software designed for conducting molecular dynamics (MD) simulations of the human nucleus. This software streamlines the process of setting up whole nucleus simulations through just a few lines of Python scripting. Keywords: Computational Chemistry	Visit Website» Documentation» Web Forum»
ORF‑RATER	(Open Reading Frame - Regression Algorithm for Translational Evaluation of Ribosome-protected footprints) comprises a series of scripts for coding sequence annotation based on ribosome profiling data. Keywords: High-throughput sequencing	Visit Website»
origami	a pipeline for processing and calling high-confidence chromatin loops associated with the ChIPped factor. Keywords: ChIP-Sequencing High-Throughput Sequencing	Visit Website»
OrthoFinder	a fast, accurate and comprehensive platform for comparative genomics, OrthoFinder is accurate inference of orthogroups, orthologues, gene trees and rooted species tree made easy! Keywords: High-throughput sequencing	Visit Website»
OSGenome	an Open Source Web Application for Genetic Data (SNPs) using 23AndMe and Data Crawling Technologies. Keywords: Genotype-Phenotype Analysis Genomics	Visit Website»
p7zip	p7zip is a quick port of 7z.exe and 7za.exe (command line version of 7zip, see www.7-zip.org ) for Unix. Keywords: Other	Visit Website»
pairix	2D indexing on bgzipped text files of paired genomic coordinates Keywords: High-throughput sequencing	Visit Website»
pairtools	CLI tools to process mapped Hi-C data Keywords: Hi-C High-Throughput Sequencing	Visit Website»
PAML	A package of programs for phylogenetic analyses of DNA or protein sequences using maximum likelihood. Keywords: Phylogenomics High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
panacus	a counting tool for pangenome graphs. It supports GFA files with P and W lines, but requires that the graph is blunt, i.e., nodes do not overlap and consequently, each link (L) points from the end of one segment (S) to the start of another. Keywords: Phylogenetic Inference Phylogenomics High-Throughput Sequencing	Visit Website»
pandas Wes McKinney	a library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language. pandas is installed as a module within python. Keywords: High-throughput sequencing Python Module WGS Analysis Other	Visit Website» Documentation» Web Forum» Mailing List»
pangolin	(Phylogenetic Assignment of Named Global Outbreak LINeages) software package for assigning SARS-CoV-2 genome sequences to global lineages. Keywords: Phylogenomics High-Throughput Sequencing	Visit Website» Documentation»
Pangolin‑DL	a deep-learning based method for predicting splice site strengths. Keywords: High-throughput sequencing	Visit Website»
Parafly	Given a file containing a list of unix commands, multithreading is used to process the commands in parallel on a single server. Success/failure is captured, and failed commands are retained and reported. Keywords: Other	Visit Website»
ParaView	is the world’s leading open source post-processing visualization engine. Keywords: Visualization Visualization	Visit Website» Documentation» Web Forum»
PASTA	is an implementation of the PASTA (Practical Alignment using Saté and TrAnsitivity) algorithm. Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website»
pbalign	pbalign aligns PacBio reads to reference sequences, filters aligned reads according to user-specific filtering criteria, and converts the output to either the SAM format or PacBio Compare HDF5 (e.g., .cmp.h5) format. The output Compare HDF5 file will be compatible with Quiver if --forQuiver option is specified. Keywords: High-throughput sequencing PacBio Sequencing	Visit Website» Documentation»
pbbam	a package that provides components to create, query, & edit PacBio BAM files and associated indices. These components include a core C++ library, bindings for additional languages, and command-line utilities. Keywords: High-throughput sequencing PacBio Sequencing	Visit Website» Documentation»
pbipa	IPA HiFi Genome Assembler Keywords: PacBio Sequencing High-Throughput Sequencing	Visit Website»
pbmm2	pbmm2 is a SMRT C++ wrapper for minimap2's C API. Its purpose is to support native PacBio in- and output, provide sets of recommended parameters, generate sorted output on-the-fly, and postprocess alignments. Sorted output can be used directly for polishing using GenomicConsensus, if BAM has been used as input to pbmm2. Benchmarks show that pbmm2 outperforms BLASR in mapped concordance, number of mapped bases, … Keywords: Genome Assembly PacBio Sequencing High-Throughput Sequencing	Visit Website»
PBSIM2	PBSIM2: a simulator for long read sequencers with a novel generative model of quality scores Keywords: High-throughput sequencing	Visit Website»
pbsv	PacBio structural variant (SV) calling and analysis tools Keywords: Structural Variant Analysis High-Throughput Sequencing	Visit Website»
PCAone	Principal Component Analysis All in One Keywords: Statistical Analysis Other	Visit Website» Documentation»
PDBImages David Sehnal, Sreenath Nair, Adam Midlik, Sameer Velankar, Stephen Anyango, Mihaly Varadi, Mandar Deshpande	a command-line tool from PDBe EMBL-EBI for generating images of macromolecular structures from mmCIF or binary CIF structure files based on Mol. Keywords:* Structure Visualization & Analysis	Visit Website» Documentation» Web Forum»
Peakachu	an acronym that standands for Unveil Hi-C Anchors and Peaks, Peakachu takes genome-wide contact data as input and returns coordinates of likely interactions such as chromatin loops. Keywords: Hi-C Genomics	Visit Website» Documentation»
Peakhood	a tool that takes a set of CLIP-seq peak regions and for each region, individually extracts the most likely site context (transcript or genomic). Keywords: CLIP-Seq Analysis High-Throughput Sequencing	Visit Website»
peddy	compares familial-relationships and sexes as reported in a PED/FAM file with those inferred from a VCF. Keywords: Variant Analysis Genomics	Visit Website»
PEER	a collection of Bayesian approaches to infer hidden determinants and their effects from gene expression profiles using factor analysis methods. Keywords: High-throughput sequencing Statistical Analysis	Visit Website» Documentation»
perbase	Per-base metrics on BAM/CRAM files. Keywords: High-throughput sequencing	Visit Website»
Percolator	semi-supervised learning for peptide identification from shotgun proteomics datasets. Keywords: Proteomics Proteomics	Visit Website» Documentation» Web Forum»
PFP	Tool to build the parse and the dictionary for VCF files using the approach described in Prefix-Free Parsing for Building Big BWTs Keywords: High-throughput sequencing	Visit Website»
phantompeakqualtools	Phantompeakqualtools computes informative enrichment and quality measures for ChIP-seq/DNase-seq/FAIRE-seq/MNase-seq data. It can also be used to obtain robust estimates of the predominant fragment length or characteristic tag shift values in these assays. Keywords: ChIP-Sequencing High-throughput sequencing	Visit Website»
PhaseDel	a Java-based variant caller designed for detecting somatic deletions from high-coverage (~30x) single-cell whole-genome sequencing (scWGS) data. Keywords: Variant Analysis Genomics	Visit Website»
phASER	(phasing and Allele Specific Expression from RNA-seq) performs haplotype phasing using read alignments in BAM format from both DNA and RNA based assays, and provides measures of haplotypic expression for RNA based assays. Keywords: RNA-Seq Analysis High-Throughput Sequencing Genomics	Visit Website» Documentation»
PHAST	(Phylogenetic Analysis with Space/Time models) a software package for comparative and evolutionary genomics. Keywords: Comparative Genomics Genomics	Visit Website»
PhenoGPT2	an advanced phenotype recognition model, leveraging the robust capabilities of large language models. It is an improved version of PhenoGPT (Jingye et. al. 2023). It employs a fine-tuned implementation on the synthetic medical data generated by Llama 3.1 70B, MIMIC-IV deidentified clinical notes, and Human Phenotype Ontology Database, to enhance prediction accuracy and alignments. Keywords: Genomics Genotype-Phenotype Analysis Genomics	Visit Website» Documentation» Mailing List»
PHESANT	PHESANT - PHEnome Scan ANalysis Tool Run a phenome scan (pheWAS, Mendelian randomisation (MR)-pheWAS etc.) in UK Biobank. There are three components in this project: Running a phenome scan in UK Biobank Post-processing of results PHESANT-viz: Visualising the results Keywords: Genomics Genomics	Visit Website»
PhiSpy	Prophage finder using multiple metrics Keywords: High-throughput sequencing	Visit Website»
PhyloPhlAn	PhyloPhlAn is an integrated pipeline for large-scale phylogenetic profiling of genomes and metagenomes. Keywords: Phylogenomics High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
phyluce	(phy-loo-chee) is a software package that is useful for analyzing both data collected from UCE loci and also data collection from other types of loci for phylogenomic studies at the species, population, and individual levels. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
Picard Alec Wysoker, Jay Carey, Jeff Gentry, Nils Homer, Tim Fennell, Yossi Farjoun	a set of Java command line tools for manipulating high-throughput sequencing (HTS) data and formats. Keywords: ChIP-Sequencing DNA-Sequencing High-throughput sequencing RNA-Sequencing WGS Analysis	Visit Website» Documentation» Web Forum» Mailing List»
picard‑slim	A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF. Keywords: High-throughput sequencing	Visit Website»
pigz	stands for parallel implementation of gzip, and is a fully functional replacement for gzip that exploits multiple processors and multiple cores to the hilt when compressing data. Keywords: Other	Visit Website» Documentation» Mailing List»
Pillow	is the friendly PIL (Python Imaging Library) fork by Alex Clark and Contributors. The Python Imaging Library adds image processing capabilities to your Python interpreter. This library provides extensive file format support, an efficient internal representation, and fairly powerful image processing capabilities. The core image library is designed for fast access to data stored in a few basic pixel formats. It should provide a … Keywords: Other	Visit Website» Documentation»
Pilon	Pilon is a software tool which can be used to automatically improve draft assemblies and find variation among strains, including large event detection. Keywords: Genome Assembly High-throughput sequencing Variant Analysis	Visit Website»
Pindel	can detect breakpoints of large deletions, medium sized insertions, inversions, tandem duplications and other structural variants at single-based resolution from next-gen sequence data. It uses a pattern growth approach to identify the breakpoints of these variants from paired-end short reads. Keywords: DNA-Sequencing Genomics High-throughput sequencing Genomics	Visit Website» Documentation» Web Forum»
Piranha	is a peak-caller for CLIP- and RIP-Seq data. It takes input in BED or BAM format and identifies regions of statistically significant read enrichment. Keywords: CLIP-Seq Analysis High-Throughput Sequencing	Visit Website» Documentation»
pixelator	A commandline tool and library to process and analyze sequencing data from Molecular Pixelation (MPX) assays. Keywords: High-throughput sequencing	Visit Website»
pLannotate	is web server for automatically annotating engineered plasmids. Keywords: Genome Annotation High-Throughput Sequencing	Visit Website»
Plant‑Seg	a tool for 3D and 2D segmentation. Keywords: Visualization Visualization	Visit Website»
PLASS	(Protein-Level ASSembler) a software to assemble short read sequencing data on a protein level. Keywords: Genome Assembly Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
plassembler	Quickly and accurately assemble plasmids in hybrid sequenced bacterial isolates Keywords: High-throughput sequencing	Visit Website»
plastid	Plastid is a Python library designed specifically for nucleotide-resolution analysis of genomics and NGS data. Keywords: High-throughput sequencing	Visit Website» Documentation»
platon	Plasmid contig classification and characterization for short read draft assemblies. Keywords: Genome Assembly High-Throughput Sequencing	Visit Website»
Platypus	Platypus is a tool designed for efficient and accurate variant-detection in high-throughput sequencing data. By using local realignment of reads and local assembly it achieves both high sensitivity and high specificity. Platypus can detect SNPs, MNPs, short indels, replacements and (using the assembly option) deletions up to several kb. It has been extensively tested on whole-genome, exon-capture, and targeted capture data, it has been … Keywords: Variant Analysis Genomics	Visit Website» Documentation»
PLINK	a comprehensive update to Shaun Purcell's PLINK command-line program -- a whole genome association analysis toolset, designed to perform a range of basic, large-scale analyses. Keywords: Association Mapping Genotype-Phenotype Analysis GWAS Analysis Genomics	Visit Website» Web Forum»
plmc Debora S Marks, John Ingraham	a tool that infers undirected graphical models to describe coevolution and covariation in families of biological sequences. With a multiple sequence alignment as an input, plmc can quantify inferred coupling strengths between all pairs of positions (couplingsfile output) or infer a generative model of the sequences for predicting the effects of mutations or designing new sequences (paramfile output). Keywords: Statistical Analysis Genomics	Visit Website» Documentation» Web Forum»
PlotHiC	PlotHiC is used to visualize whole genome-wide contact heatmaps after genome scaffolding Keywords: Genome Assembly High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
polychrom Aleksandra Galitsyna, Anton Goloborodko, Max Imakaev	a tool designed to build mechanistic models - i.e. models that simulates a biological process. Keywords: Structure Visualization & Analysis Other	Visit Website» Documentation» Web Forum»
pomoxis	Assembly, consensensus, and analysis tools by ONT research Keywords: Nanopore High-Throughput Sequencing	Visit Website»
PopDel	(Population-wide Deletion Calling) fast structural deletion calling on population-scale short read paired-end germline WGS data. Keywords: Genomics Structural Variant Analysis Genomics	Visit Website» Documentation»
poppunk	(POPulation Partitioning Using Nucleotide Kmers) Calculate core and accessory distances, cluster genomes, assign new genomes to clusters, make visualisations Keywords: Genomics Metagenomic Sequencing Analysis Genomics	Visit Website» Documentation» Web Forum»
popscle	is a suite of population scale analysis tools for single-cell genomics data. Keywords: scRNA-Seq Analysis High-Throughput Sequencing	Visit Website»
poretools	a toolkit for working with nanopore sequencing data from Oxford Nanopore Keywords: Nanopore High-Throughput Sequencing	Visit Website»
PPanGGOLiN	a software suite used to create and manipulate prokaryotic pangenomes from a set of either genomic DNA sequences or provided genome annotations. It is designed to scale up to tens of thousands of genomes. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
PRANK	a probabilistic multiple alignment program for DNA, codon and amino-acid sequences. Keywords: Read Alignment High-Throughput Sequencing	Visit Website»
PredCRP Ming-Ju Tsai, Shinn-Ying Ho	predicts the regulatory role of CRP transcription factor in Escherichia coli. PredCRP provides an accurate method for deriving an optimised model (named PredCRP-model) and a set of four interpretable rules (named PredCRP-ruleset) for predicting and analysing the regulatory roles of CRP from sequences of CRP-binding sites. Keywords: CRISPR/Cas9 Screen Analysis Protein Structure Analysis Other	Visit Website»
Predictosaurus	a command-line tool designed for uncertainty-aware haplotype-based genomic variant effect prediction. It provides comprehensive functionality for building variant graphs, processing genomic features, and extracting peptide sequences. The tool integrates various bioinformatics processes to support efficient data analysis and visualization. Keywords: High-throughput sequencing Variant Analysis	Visit Website» Web Forum»
preseq	a tool aimed at predicting the yield of distinct reads from a genomic library from an initial sequencing experiment. The estimates can then be used to examine the utility of further sequencing, optimize the sequencing depth, or to screen multiple libraries to avoid low complexity samples. Keywords: High-throughput sequencing	Visit Website»
Presto	A bioinformatics toolkit for processing high-throughput lymphocyte receptor sequencing data. Keywords: High-throughput sequencing	Visit Website» Documentation»
Prodigal	is a fast, reliable protein-coding gene prediction for prokaryotic genomes. Keywords: Genome Annotation Genomics	Visit Website» Documentation»
prodigal‑gv	A fork of Prodigal meant to improve gene calling for giant viruses and viruses that use alternative genetic codes. Keywords: Genome Annotation Genomics	Visit Website»
Prokka	a software tool to annotate bacterial, archaeal and viral genomes quickly and produce standards-compliant output files. Keywords: Genome Annotation High-Throughput Sequencing	Visit Website»
ProPhyle	ProPhyle is an accurate, resource-frugal and deterministic phylogeny-based metagenomic classifier. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
ProSolo	is a variant caller for single cell data from whole genome amplification with multiple displacement amplification (MDA). It relies on a pair of samples, where one is from an MDA single cell and the other from a bulk sample of the same cell population, sequenced with any next-generation sequencing technology. Keywords: Variant Analysis Genomics	Visit Website»
Proteinortho	a tool to detect orthologous genes within different species. Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website»
prottest3	is a bioinformatic tool for the selection of best-fit models of aminoacid replacement for the data at hand. ProtTest makes this selection by finding the model in the candidate list with the smallest Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC) score or Decision Theory Criterion (DT). Keywords:	Visit Website» Web Forum»
pybedtools	pybedtools wraps and extends BEDTools and offers feature-level manipulations from within Python. Keywords: High-throughput sequencing	Visit Website» Documentation»
PyCap	an interface to the REDCap Application Programming Interface (API), PyCap is designed to be a minimal interface exposing all required and optional API parameters. Keywords: Programming Tools Other	Visit Website»
PyCharm	a dedicated Python Integrated Development Environment (IDE) providing a wide range of essential tools for Python developers, tightly integrated together to create a convenient environment for productive Python, web, and data science development. The BioGrids-supported version of PyCharm is the open source community edition. Keywords: Other	Visit Website» Documentation»
pycudadecon	provides a python wrapper and convenience functions for cudaDeconv, which is a CUDA/C++ implementation of an accelerated Richardson Lucy Deconvolution algorithm1, suitable for general applications, but designed particularly for stage-scanning light sheet applications such as Lattice Light Sheet. Keywords: Image-Analysis Libraries Other	Visit Website» Documentation»
PyEnsembl	a Python interface to Ensembl reference genome metadata such as exons and transcripts. PyEnsembl downloads GTF and FASTA files from the Ensembl FTP server and loads them into a local database. PyEnsembl can also work with custom reference data specified using user-supplied GTF and FASTA files. Keywords: High-throughput sequencing	Visit Website»
pygtftk	(Python GTF toolkit) a suite providing facilities to manipulate genomic annotations in gtf format. Keywords: Genome Annotation High-Throughput Sequencing	Visit Website»
PyMC3	PyMC3 is a Python package for Bayesian statistical modeling and Probabilistic Machine Learning focusing on advanced Markov chain Monte Carlo (MCMC) and variational inference (VI) algorithms. Its flexibility and extensibility make it applicable to a large suite of problems. Keywords: Statistical Analysis Other	Visit Website» Documentation»
PyMOL Open Source Thomas Holder	open source version of the widely used molecular visualization package developed by Warren DeLano. Keywords: Visualization Visualization	Visit Website» Documentation» Web Forum» Mailing List» Webinars
pyPINTS	Explore distal transcriptional regulatory elements (TREs) identified from nascent-transcript sequencing. Keywords: High-throughput sequencing	Visit Website» Documentation» Web Forum»
PyRQA	a tool to conduct recurrence analysis in a massively parallel manner using the OpenCL framework. Keywords: Statistical Analysis Other	Visit Website» Documentation»
Pysam Andreas Heger, Kevin Jacobs	a python module that makes it easy to read and manipulate genomic data sets. It is a lightweight wrapper of the htslib C-API; it provides facilities to read and write SAM/BAM/VCF/BCF/BED/GFF/GTF/FASTA/FASTQ files as well as access to the command line functionality of the SAMtools and BCFtools packages. Pysam is installed as a module within python. Keywords: High-throughput sequencing Python Module WGS Analysis	Visit Website» Documentation» Web Forum»
pySCENIC	is a lightning-fast python implementation of the SCENIC pipeline (Single-Cell rEgulatory Network Inference and Clustering) which enables biologists to infer transcription factors, gene regulatory networks and cell types from single-cell RNA-seq data. Keywords: scRNA-Seq Analysis High-Throughput Sequencing	Visit Website»
Python Guido van Rossum	a general-purpose, interpreted, object oriented, high-level dynamic programming language that emphasizes code readability. Its syntax allows programmers to express concepts in fewer lines of code than in C++ or Java, thus allowing programmers to work more quickly and integrate their systems more effectively. Keywords: Other	Visit Website» Documentation» Web Forum» Mailing List»
PyTorch Adam Paszke	an open source deep learning platform that provides a seamless path from research prototyping to production deployment. Keywords: Machine Learning Other	Visit Website» Documentation» Web Forum»
pyunicorn	(Unified Complex Network and RecurreNce analysis toolbox) a fully object-oriented Python package for the advanced analysis and modeling of complex networks. Keywords: Statistical Analysis Other	Visit Website» Documentation»
QIIME 2 Greg Caporaso, Rideout Jai Ram, Rob Knight	a powerful, extensible, and decentralized microbiome analysis package with a focus on data and analysis transparency. QIIME 2 enables researchers to start an analysis with raw DNA sequence data and finish with publication-quality figures and statistical results. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
QTLtools	QTLtools is a tool set for molecular QTL discovery and analysis. Keywords: quantitative trait loci (QTLs) mapping/discovery Genomics	Visit Website»
Qualimap	a platform-independent application written in Java and R that provides both a Graphical User Inteface (GUI) and a command-line interface to facilitate the quality control of alignment sequencing data and its derivatives like feature counts. Keywords: High-throughput sequencing Read Alignment	Visit Website» Documentation» Web Forum»
Quartz Yun William Yu, Bonnie Berger	(QUAlity score Reduction at Terabyte scale) an efficient de novo quality score compression tool based on traversing the k-mer landscape of NGS read datasets. Keywords: DNA Sequence Data Compression High-throughput sequencing WGS Analysis	Visit Website»
QUAST Alexey Gurevich	(QUality ASsessment Tool) evaluates genome assemblies by computing various metrics, including N50, length for which the collection of all contigs of that length or longer covers at least 50% of assembly length; NG50, where length of the reference genome is being covered; NA50 and NGA50, where aligned blocks instead of contigs are taken; misassemblies, misassembled and unaligned contigs or contigs bases; and genes and … Keywords: Genome Assembly High-throughput sequencing Transcriptomics	Visit Website» Documentation» Web Forum»
QuickTree	an efficient implementation of the Neighbor-Joining algorithm. Keywords: High-throughput sequencing	Visit Website»
Quip	compresses next-generation sequencing data with extreme prejudice. Keywords: High-Throughput Sequencing	Visit Website»
QuPath	an open source software for bioimage analysis. It is often used for digital pathology applications because it offers a powerful set of tools for working with whole slide images - but it can be applied to lots of other kinds of image as well. Keywords: Image-Analysis Libraries Visualization	Visit Website» Documentation» Web Forum» Mailing List»
R Kurt Hornik, Martin Mächler, Ross Ihaka	a free software environment for statistical computing and graphics. Keywords: Other	Visit Website» Documentation» Web Forum» Mailing List»
Racon	ultrafast consensus module for raw de novo genome assembly of long uncorrected reads. Keywords: High-throughput sequencing	Visit Website»
RapMap Robert Patro	rapid sensitive and accurate read mapping via quasi-mapping. Keywords: RNA-Sequencing High-Throughput Sequencing	Visit Website» Documentation»
rasusa	Randomly subsample sequencing reads to a specified coverage. Keywords: High-throughput sequencing	Visit Website» Documentation»
Raven	a de novo genome assembler for long uncorrected reads. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
RAxML	(Randomized Axelerated Maximum Likelihood) a tool for phylogenetic analysis and post-analysis of large phylogenies. Keywords: Phylogenetic Inference High-Throughput Sequencing	Visit Website»
RAxML‑NG	a phylogenetic tree inference tool which uses maximum-likelihood (ML) optimality criterion. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
razers3	faster, fully sensitive read mapping. Keywords: Read Alignment High-Throughput Sequencing	Visit Website»
RBPBench	RBPBench is multi-function tool to evaluate CLIP-seq and other genomic region data using a comprehensive collection of known RNA-binding protein (RBP) binding motifs. Keywords: CLIP-Seq Analysis High-Throughput Sequencing	Visit Website» Documentation»
rclone	a command line program to manage files on cloud storage. Keywords: Other	Visit Website» Documentation» Web Forum»
Recentrifuge	Robust comparative analysis and contamination removal for metagenomics Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation»
refgenie	(reference genome manager) manages storage, access, and transfer of reference genome resources. Keywords: Genomics High-Throughput Sequencing	Visit Website» Documentation»
regenie	a C++ program for whole genome regression modelling of large genome-wide association studies. Keywords: High-throughput sequencing	Visit Website» Documentation» Web Forum»
regtools	is a set of tools that integrate DNA-seq and RNA-seq data to help interpret mutations in a regulatory and splicing context. Keywords: DNA-Sequencing High-throughput sequencing RNA-Sequencing	Visit Website» Documentation» Web Forum»
repaq	A tool to compress FASTQ files with ultra-high compression ratio and high speed. repaq supports compressing the FASTQ to .rfq or .rfq.xz formats. Compressing to .rfq is ultra fast, while compressing to .rfq.xz provides very high compression ratio. Keywords: High-throughput sequencing	Visit Website» Documentation» Web Forum»
ReportLab	is the time-proven, ultra-robust open-source engine for creating complex, data-driven PDF documents and custom vector graphics. It's free, open-source , and written in Python. The package sees 50,000+ downloads per month, is part of standard Linux distributions, is embedded in many products, and was selected to power the print/export feature for Wikipedia. Keywords: Other	Visit Website» Documentation» Web Forum»
RFDesign Jue Wang, Doug Tischer, Sidney Lisanza, David Juergens, Joe Watson	a tool for protein hallucination and inpainting with RoseTTAFold. Keywords: Protein-Protein Interaction Prediction Protein Structure Analysis	Visit Website» Web Forum»
RGT	(Regulatory Genomics Toolbox) is an open source python library for analysis of regulatory genomics. RGT is programmed in an oriented object fashion and its core classes provide functionality for handling regulatory genomics data. Keywords: ChIP-Sequencing DNA-Sequencing Visualization Genomics	Visit Website» Documentation» Web Forum»
RingMapper Anthony Mustoe, Nicole N. Lama, Kevin M Weeks, Patrick S. Irving, Samuel W. Olson	a code for performing RING-MaP and PAIR-MaP analysis. Keywords: RNA-Seq Analysis RNA-Sequencing Genomics High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
RNAblueprint	The RNAblueprint library solves the problem of stochastically sampling RNA/DNA sequences compatible to multiple structural constraints. Keywords: DNA-Sequencing RNA-Seq Analysis High-Throughput Sequencing	Visit Website» Documentation»
rna‑map	An open-source tool for rapid analysis of RNA mutational profiling (MaP) experiments. Keywords: RNA-Seq Analysis High-Throughput Sequencing	Visit Website»
RNAnorm	RNA-seq data normalization in Python. Keywords: RNA-Seq Analysis High-Throughput Sequencing	Visit Website» Documentation» Mailing List»
RNA‑SeQC	fast, efficient RNA-Seq metrics for quality control and process optimization. Keywords: RNA-Seq Analysis High-Throughput Sequencing	Visit Website»
rnashapes	RNAshape abstraction maps structures to a tree-like domain of shapes. Keywords: RNA-Seq Analysis High-Throughput Sequencing	Visit Website»
Roary	Takes annotated assemblies in GFF3 format and calculates the pan genome. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
ROBEX	(Robust Brain Extraction) is an automatic whole-brain extraction tool for T1-weighted MRI data (commonly known as skull stripping). Keywords: Image-Analysis Libraries Visualization	Visit Website» Documentation» Mailing List»
RODEO	(Rapid ORF Description & Evaluation Online) evaluates one or many genes, characterizing a gene neighborhood based on the presence of profile hidden Markov models (pHMMs). Keywords: Genomics Genomics	Visit Website» Documentation»
RoseTTAFold David Baker, Minkyung Baek	a program that provides an accurate prediction of protein structures and interactions using a 3-track network. Keywords: Protein Structure Analysis Other	Visit Website» Documentation» Web Forum»
RoseTTAFold2 Ivan Anishchenko, Frank DiMaio, Sergey Ovchinnikov	a tool that extends the original three-track architecture of RoseTTAFold over the full network, incorporating the concepts of Frame-aligned point error, recycling during training, and the use of a distillation set from AlphaFold2. Keywords: Computational Chemistry	Visit Website» Documentation» Web Forum»
RoseTTAFold‑All‑Atom	a biomolecular structure prediction neural network that can predict a broad range of biomolecular assemblies. Keywords: Protein-Ligand Docking Protein-Protein Interaction Prediction Tertiary Structure Prediction Other	Visit Website»
RSEM Bo Li, Colin Dewey	(RNA-Seq by Expectation-Maximization) a software package for estimating gene and isoform expression levels from RNA-Seq data. Keywords: High-throughput sequencing RNA-Sequencing Transcript Quantification	Visit Website» Documentation» Web Forum» Mailing List»
RSeQC	(RNA-seq Quality Control Package) provides a number of useful modules that can comprehensively evaluate high throughput sequence data especially RNA-seq data. Keywords: Read Quality Control RNA-Seq Analysis High-Throughput Sequencing	Visit Website»
RStudio JJ Allaire, Jonathan McPherson, Kevin Ushey	an integrated development environment (IDE) for R that includes a console, syntax-highlighting editor that supports direct code execution, as well as tools for plotting, history, debugging and workspace management. Keywords: Other	Visit Website» Documentation» Web Forum»
RTG Tools	(RealTimeGenomics Tools) utilities for accurate VCF comparison and manipulation. Keywords: Variant Analysis High-Throughput Sequencing	Visit Website»
Rust‑Bio‑Tools	a set of ultra fast and robust command line utilities for bioinformatics tasks based on Rust-Bio. Keywords: Other	Visit Website»
rustybam	is a bioinformatics toolkit written in the rust programing language focused around manipulation of alignment (bam and PAF), annotation (bed), and sequence (fasta and fastq) files. Keywords: High-throughput sequencing	Visit Website»
Rustyread	Rustyread, a long-read simulator Keywords: High-throughput sequencing	Visit Website»
Ryuto	Network-Flow based Transcriptome Reconstruction Keywords: Transcriptomics High-Throughput Sequencing	Visit Website»
SAIGE	an R package developed with Rcpp for genome-wide association tests in large-scale data sets and biobanks. Keywords: Association Mapping GWAS Analysis High-Throughput Sequencing	Visit Website» Documentation» Mailing List»
Sailfish Carl Kingsford, Robert Patro, Stephen M Mount	enables alignment-free isoform quantification from RNA-seq reads using lightweight algorithms. Keywords: RNA-Sequencing High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
Salmon Carl Kingsford, Robert Patro	a tool for quantifying the expression of transcripts using RNA-seq data. Salmon uses algorithms to provide very quick, accurate expression estimates using little memory and performs inference using an expressive and realistic model of RNA-seq data that takes into account experimental attributes and biases commonly observed in real RNA-seq data. Keywords: RNA-Sequencing High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
Sambamba Pjotr Prins	a high performance, highly parallel, robust and fast tool (and library), written in the D programming language, for working with SAM and BAM files. Because of its efficiency, it is an important work horse running in many sequencing centres around the world today. Keywords: High-throughput sequencing	Visit Website» Web Forum»
samblaster Greg Faust	a fast, flexible program for marking duplicates in read-id grouped1 paired-end SAM files. It can also optionally output discordant read pairs and/or split read mappings to separate SAM files, and/or unmapped/clipped reads to a separate FASTQ file. Keywords: High-throughput sequencing	Visit Website» Web Forum»
SAMtools Bob Handshaker, Heng Li, Petr Danecek	(Sequence Alignment/Map) a generic format for storing large nucleotide sequence alignments that provides various utilities for manipulating alignments, including sorting, merging, indexing and generating alignments in a per-position format. Keywords: ChIP-Sequencing DNA-Sequencing High-throughput sequencing RNA-Sequencing WGS Analysis	Visit Website» Documentation» Web Forum» Mailing List»
sansa	Structural variant (SV) annotation. Keywords: Variant Analysis Genomics	Visit Website»
SaTScan	a free software that analyzes spatial, temporal and space-time data using the spatial, temporal, or space-time scan statistics. It is designed for any of the following interrelated purposes: Perform geographical surveillance of disease, to detect spatial or space-time disease clusters, and to see if they are statistically significant. Test whether a disease is randomly distributed over space, over time or over space and time. … Keywords: Other	Visit Website» Documentation»
scAllele	scAllele is a versatile tool to detect and analyze nucleotide variants in scRNA-seq. Keywords: scRNA-Seq Analysis High-Throughput Sequencing	Visit Website»
Scallop	Scallop is an accurate reference-based transcript assembler. Keywords: RNA-Seq Analysis Transcriptomics High-Throughput Sequencing	Visit Website»
Scallop‑LR	reference-based transcriptome assembler for long-reads RNA-seq data Keywords: Genome Assembly High-Throughput Sequencing	Visit Website»
Scalpel	a software package for detecting INDELs. Keywords: Structural Variant Analysis High-Throughput Sequencing	Visit Website»
scanpy	a scalable toolkit for analyzing single-cell gene expression data built jointly with anndata. It includes preprocessing, visualization, clustering, trajectory inference and differential expression testing. The Python-based implementation efficiently deals with datasets of more than one million cells. Keywords: scRNA-Seq Analysis High-Throughput Sequencing	Visit Website» Documentation» Web Forum» Mailing List»
scCODA	scCODA is a toolbox for statistical models to analyze changes in compositional data, especially from single-cell RNA-seq experiments. Keywords: scRNA-Seq Analysis High-Throughput Sequencing	Visit Website» Documentation»
scDRS	(single-cell disease-relevance score) is a method for associating individual cells in scRNA-seq data with disease GWASs, built on top of AnnData and Scanpy. Keywords: GWAS Analysis scRNA-Seq Analysis High-Throughput Sequencing	Visit Website»
scGPT	scGPT is a foundation model for Single-Cell Multi-omics using generative AI. scGPT can be optimized to achieve superior performance across diverse downstream applications such as cell type annotation, multi-batch integration, multi-omic integration, perturbation response prediction and gene network inference Keywords: scDNA-Seq Analysis scRNA-Seq Analysis Statistical Analysis High-Throughput Sequencing	Visit Website»
scMatch	is a single-cell gene expression profile annotation tool using reference datasets. Keywords: scRNA-Seq Analysis High-Throughput Sequencing	Visit Website»
SCRAMBle	runs as a two-step process. First cluster_identifier is used to generate soft-clipped read cluster consensus sequences. Second, SCRAMBle-MEIs.R analyzes the cluster file for likely Mobile Element Insertions. Keywords: Genomics High-throughput sequencing Genomics	Visit Website»
scrm	a coalescent simulator for biological sequences. Keywords: High-Throughput Sequencing	Visit Website»
scVelo	is a scalable toolkit for RNA velocity analysis in single cells. Keywords: RNA-Seq Analysis High-Throughput Sequencing	Visit Website»
scvi‑tools	(single-cell variational inference tools) is a package for end-to-end analysis of single-cell omics data. Keywords: Machine Learning Other	Visit Website» Documentation» Web Forum»
sdm	simple demultiplex tool for FASTQ demultiplexing and dereplication. Keywords: High-throughput sequencing	Visit Website»
SEACR	Sparse Enrichment Analysis for CUT&RUN Keywords: ChIP-Sequencing High-Throughput Sequencing	Visit Website»
seaview	a multiplatform, graphical user interface for multiple sequence alignment and molecular phylogeny. Keywords: Sequence Alignment Visualization Visualization	Visit Website»
SECAPR	Process sequence-capture FASTQ files into alignments for phylogenetic analyses. Integrates allele phasing. Keywords: Phylogenomics High-Throughput Sequencing	Visit Website»
SECIMTools	a suite of tools for processing of metabolomics data. Keywords: Metabolic Network Analysis Proteomics	Visit Website»
segemehl	a software to map short sequencer reads to reference genomes. Keywords: Read Alignment High-Throughput Sequencing	Visit Website»
Segment Anything	(SAM) produces high quality object masks from input prompts such as points or boxes, and it can be used to generate masks for all objects in an image. Keywords: Visualization Visualization	Visit Website»
selscan	a program to calculate EHH-based scans for positive selection in genomes. Keywords: Genome Annotation Genomics	Visit Website»
SEPP	(SATe-enabled Phylogenetic Placement) addresses the problem of phylogenetic placement of short reads into reference alignments and trees. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
seqcomplexity	calculates Per-Read and Total Sequence Complexity from FastQ file. Keywords: Statistical Analysis High-Throughput Sequencing	Visit Website»
SeqFu	(Sequece Fastx Utilities) a general-purpose program to manipulate and parse information from FASTA/FASTQ files. Keywords: High-throughput sequencing	Visit Website» Documentation»
SeqKit	a cross-platform ultrafast comprehensive toolkit for FASTA/Q processing. Keywords: High-throughput sequencing	Visit Website» Documentation» Web Forum»
SEQLinkage	implements a collapsed haplotype pattern (CHP) method to generate markers from sequence data for linkage analysis. Keywords: ChIP-Sequencing Genomics	Visit Website» Documentation»
seqLogo Oliver Bembom	a Bioconductor software package installed in R 3.2.2 that takes the position weight matrix of a DNA sequence motif and plots the corresponding sequence logo. Keywords: Bioconductor Packages Genome Annotation Genomics Motif Comparison Visualization Visualization	Visit Website» Documentation»
seqMINER	an integrated ChIP-seq data interpretation platform. Keywords: ChIP-Sequencing High-Throughput Sequencing	Visit Website»
SeqPrep	SeqPrep is a program to merge paired end Illumina reads that are overlapping into a single longer read. It may also just be used for its adapter trimming feature without doing any paired end overlap. Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website»
seqtk Heng Li	a fast and lightweight tool for processing sequences in the FASTA or FASTQ format. It seamlessly parses both FASTA and FASTQ files, which can also be optionally compressed by gzip. Keywords: High-throughput sequencing	Visit Website» Web Forum»
SeqVerify	a Python-based command line tool for analysis of whole genome sequencing data for gene-editing verification. It performs insertion site detection, copy number variation (CNV) analysis through CNVPytor, bacterial contamination detection through KRAKEN2 and BRACKEN, and variant calling and filtering aided by SnpEff and SnpSift. Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website»
Severus	Severus is a somatic structural variation (SV) caller for long reads (both PacBio and ONT). Keywords: Structural Variant Analysis High-Throughput Sequencing	Visit Website»
SHAPEIT5	a software package to estimate haplotypes in large genotype datasets (WGS and SNP array). Keywords: WGS Analysis High-Throughput Sequencing Genomics	Visit Website»
shark	Mapping-free filtering of useless RNA-Seq reads Keywords: RNA-Seq Analysis High-Throughput Sequencing	Visit Website»
shorah	Short Reads Assembly into Haplotypes (ShoRAH) program for inferring viral haplotypes from NGS data Keywords: Virus Sequence Detection High-Throughput Sequencing	Visit Website»
Sickle	a windowed adaptive trimming tool for FASTQ files using quality. Keywords: High-throughput sequencing Read Quality Control	Visit Website»
simpleaf	simpleaf is a rust framework to make using alevin-fry even simpler. Keywords: scRNA-Seq Analysis High-Throughput Sequencing	Visit Website» Documentation»
SimpleITK	a simplified layer built on top of ITK, intended to facilitate its use in rapid prototyping, education, interpreted languages. Keywords: Image-Analysis Libraries Other	Visit Website» Documentation» Mailing List»
SINA	SINA aligns nucleotide sequences to match a pre-existing MSA using a graph based alignment algorithm similar to PoA. The graph approach allows SINA to incorporate information from many reference sequences building without blurring highly variable regions. While pure NAST implementations depend highly on finding a good match in the reference database, SINA is able to align sequences relatively distant to references with good quality … Keywords: High-throughput sequencing RNA-Sequencing	Visit Website» Documentation»
SKA2	SKA2 - Split k-mer analysis (version 2) uses exact matching of split k-mer sequences to align closely related sequences, typically small haploid genomes such as bacteria and viruses. Keywords: High-throughput sequencing	Visit Website» Documentation»
skDER	efficient & high-resolution dereplication of microbial genomes Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
SKESA	(Strategic Kmer Extension for Scrupulous Assemblies) a de-novo sequence read assembler for microbial genomes. Keywords: Genome Assembly High-Throughput Sequencing	Visit Website»
skewer Hongshan Jiang, Shuifang Zhu	implements the bit-masked k-difference matching algorithm dedicated to the task of adapter trimming. It is specially designed for processing next-generation sequencing (NGS) paired-end sequences. Keywords: Genomics High-throughput sequencing RNA-Sequencing	Visit Website» Web Forum»
slclust	slclust is a utility that performs single-linkage clustering with the option of applying a Jaccard similarity coefficient to break weakly bound clusters into distinct clusters. Keywords: Statistical Analysis Other	Visit Website»
3D Slicer	is a free and open-source platform for analyzing and understanding medical image data. Keywords: Image-Analysis Libraries Other	Visit Website» Documentation» Web Forum»
Slingshot	an R package that provides functions for inferring continuous, branching lineage structures in low-dimensional data. Designed to model developmental trajectories in single-cell RNA sequencing data and serve as a component in an analysis pipeline after dimensionality reduction and clustering, Slingshot is flexible enough to handle arbitrarily many branching events and allows for the incorporation of prior knowledge through supervised graph construction. Keywords: scRNA-Seq Analysis Other	Visit Website» Web Forum»
slow5tools	a simple toolkit for converting (FAST5 <-> SLOW5), compressing, viewing, indexing and manipulating data in SLOW5 format. Keywords: High-throughput sequencing	Visit Website» Documentation»
smallgenomeutilities	a collection of scripts that is useful for dealing and manipulating NGS data of small viral genomes. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation»
SMALT	SMALT aligns DNA sequencing reads with a reference genome. Reads from a wide range of sequencing platforms can be processed, for example Illumina, Roche-454, Ion Torrent, PacBio or ABI-Sanger. Paired reads are supported. There is no support for SOLiD reads. Keywords: Sequence Alignment Analysis Genomics	Visit Website» Documentation»
smoothxg	Local reconstruction of variation graphs using partial order alignment. Pangenome graphs built from raw sets of alignments may have complex local structures generated by common patterns of genome variation. smoothxg can be used to extract the consensus pangenome graph by applying the heaviest bundle algorithm to each chain. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
smoove	structural variant calling and genotyping with existing tools, but, smoothly. Keywords: Genotype-Phenotype Analysis Structural Variant Analysis Genomics	Visit Website»
Snakemake Johannes Köster	The Snakemake workflow management system is a tool to create reproducible and scalable data analyses. Workflows are described via a human readable, Python based language. They can be seamlessly scaled to server, cluster, grid and cloud environments, without the need to modify the workflow definition. Finally, Snakemake workflows can entail a description of required software, which will be automatically deployed to any execution environment. Keywords: Genomics High-throughput sequencing Workflow Management System Other	Visit Website» Documentation» Web Forum» Mailing List»
Sniffles	a structural variation caller using third generation sequencing. Keywords: Structural Variant Analysis Genomics	Visit Website» Documentation»
Snippy	Rapid bacterial SNP calling and core genome alignments Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
snp‑dists	converts a FASTA alignment to SNP distance matrix. Keywords: Read Alignment High-Throughput Sequencing	Visit Website»
SnpEff Pablo Cingolani	genomic variant annotation and functional effect prediction toolbox. Keywords: Genomics High-throughput sequencing Genomics	Visit Website» Documentation»
SNP‑sites	rapidly extracts SNPs from a multi-FASTA alignment. Keywords: Variant Analysis High-Throughput Sequencing	Visit Website»
somalier	extracts informative sites, evaluates relatedness, and performs quality-control on BAM/CRAM/BCF/VCF/GVCF. Keywords: High-throughput sequencing Variant Analysis	Visit Website»
SomaticSeq	is an ensemble somatic SNV/indel caller that has the ability to use machine learning to filter out false positives from other callers. Keywords: Variant Analysis Genomics	Visit Website»
SortMeRNA	SortMeRNA is a program tool for filtering, mapping and OTU-picking NGS reads in metatranscriptomic and metagenomic data. The core algorithm is based on approximate seeds and allows for fast and sensitive analyses of nucleotide sequences. The main application of SortMeRNA is filtering ribosomal RNA from metatranscriptomic data. Additional applications include OTU-picking and taxonomy assignation available through QIIME v1.9+ (http://qiime.org - v1.9.0-rc1). Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation»
sourmash	quickly searches, compares, and analyzes genomic and metagenomic data sets. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
SpacePHARER	(CRISPR Spacer Phage-Host pAiRs findER) a modular toolkit for sensitive phage-host interaction identification using CRISPR spacers. Keywords: CRISPR/Cas9 Screen Analysis Genomics	Visit Website» Documentation»
spaceranger	Visium Spatial Software Suite for analyzing and visualizing spatial gene and protein expression data Keywords: High-throughput sequencing	Visit Website»
SPAdes Andrey D Prjibelski, Anton Bankevich, Dmitry Antipov, Max A Alekseyev, Pavel A Pevzner, Sergey Nurk, Yana Safonova	(St. Petersburg genome assembler) a genome assembly algorithm designed for single-cell and multi-cell bacterial data sets. Keywords: High-throughput sequencing scDNA-Seq Analysis Single-Cell Assemblers	Visit Website» Documentation» Web Forum»
spaln	Map and align a set of cDNA/EST or protein sequences onto a genome Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website»
Spark	a multi-language engine for executing data engineering, data science, and machine learning on single-node machines or clusters. Keywords: Workflow Management System Other	Visit Website» Documentation»
spaTyper	computational method for finding spa types. Keywords: DNA-Sequencing High-Throughput Sequencing	Visit Website»
SpliceAI	A deep learning-based tool to identify splice variants. Restriction: SpliceAI models require a license for commercial use. See technical notes for details. Keywords: Genome Annotation High-Throughput Sequencing	Visit Website»
SpliceMap	is a de novo splice junction discovery and alignment tool. It offers high sensitivity and support for arbitrary RNA-seq read lengths. Keywords: RNA-Seq Analysis High-Throughput Sequencing	Visit Website» Documentation»
SpliceV	provides analysis and publication quality printing of linear and circular RNA splicing, expression and regulation. Keywords: RNA-Sequencing Visualization Visualization Genomics	Visit Website» Documentation» Web Forum»
SPM	Software Introduction SPM is made freely available to the [neuro]imaging community, to promote collaboration and a common analysis scheme across laboratories. The software represents the implementation of the theoretical concepts of Statistical Parametric Mapping in a complete analysis package. The SPM software is a suite of MATLAB (MathWorks) functions and subroutines with some externally compiled C routines. SPM was written to organise and interpret … Keywords: Image-Analysis Libraries Visualization	Visit Website» Documentation» Mailing List»
spoa	SIMD partial order alignment tool/library. Keywords: Read Alignment High-Throughput Sequencing	Visit Website»
Spyder	is a powerful scientific environment written in Python, for Python, and designed by and for scientists, engineers and data analysts. Keywords:	Visit Website» Documentation»
SpydrPick	Mutual information based detection of pairs of genomic loci co-evolving under a shared selective pressure Keywords: Genomics Genomics	Visit Website»
SQLite	SQLite is a C-language library that implements a small, fast, self-contained, high-reliability, full-featured, SQL database engine. Keywords: Other	Visit Website» Documentation»
SRA Toolkit SRA Toolkit Development Team	(Sequence Read Archive Toolkit) a collection of tools and libraries for using data in the INSDC Sequence Read Archives. Keywords: High-Throughput Sequencing Genomics	Visit Website» Documentation» Web Forum»
srnaMapper	Mapping small RNA data to a genome. Keywords: RNA-Seq Analysis High-Throughput Sequencing	Visit Website»
srprism	Short Read Alignment Tool Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website»
sscocaller	Haplotyping single-cell DNA sequenced gamete cells. Keywords: scDNA-Seq Analysis High-Throughput Sequencing	Visit Website»
Stacks	a software pipeline for building loci from RAD-seq. Keywords: RADSeq High-Throughput Sequencing	Visit Website»
STAR Alexander Dobin	(Spliced Transcripts Alignment to a Reference) is an ultrafast universal RNA-seq aligner. Keywords: RNA-Sequencing High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
staramr	Scan genome contigs against the ResFinder and PointFinder databases Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
Starcode	Starcode is a DNA sequence clustering software. Starcode clustering is based on all pairs search within a specified Levenshtein distance (allowing insertions and deletions), followed by a clustering algorithm: Message Passing, Spheres or Connected Components. Typically, a file containing a set of DNA sequences is passed as input, jointly with the desired clustering distance and algorihtm. Starcode returns the canonical sequence of the cluster, … Keywords: DNA-Sequencing High-throughput sequencing	Visit Website» Documentation»
STAR‑Fusion	a component of the Trinity Cancer Transcriptome Analysis Toolkit (CTAT), STAR-Fusion uses the STAR aligner to identify candidate fusion transcripts supported by Illumina reads. STAR-Fusion further processes the output generated by the STAR aligner to map junction reads and spanning reads to a reference annotation set. Keywords: High-throughput sequencing RNA-Seq Analysis	Visit Website» Documentation»
stark	A tool for bluntifying a bidirected de bruijn graph by removing overlaps. Keywords: High-throughput sequencing	Visit Website»
STREAM	(Single-cell Trajectories Reconstruction, Exploration And Mapping) is an interactive computational pipeline for reconstructing complex celluar developmental trajectories from sc-qPCR, scRNA-seq or scATAC-seq data. Keywords: scRNA-Seq Analysis High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
Strelka2 Christopher T Saunders, Sangtae Kim	a fast and accurate small variant caller optimized for analysis of germline variation in small cohorts and somatic variation in tumor/normal sample pairs. The germline caller employs an efficient tiered haplotype model to improve accuracy and provide read-backed phasing, adaptively selecting between assembly and a faster alignment-based haplotyping approach at each variant locus. The germline caller also analyzes input sequencing data using a mixture-model … Keywords: Genomics Variant Analysis Genomics	Visit Website» Documentation» Web Forum»
strike	A program to evaluate protein multiple sequence alignments using a single protein structure. Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website»
StringTie Geo Pertea, Mihaela Pertea	a fast and highly efficient assembler of RNA-Seq alignments into potential transcripts. It uses a novel network flow algorithm as well as an optional de novo assembly step to assemble and quantitate full-length transcripts representing multiple splice variants for each gene locus. Its input can include not only the alignments of raw reads used by other transcript assemblers, but also alignments longer sequences that … Keywords: RNA-Seq Analysis Transcriptomics Genomics High-Throughput Sequencing	Visit Website» Documentation»
strobealign	a read mapper that is typically significantly faster than other read mappers while achieving comparable or better accuracy, see the performance evaluation. Keywords: Read Alignment High-Throughput Sequencing	Visit Website»
structure	Inference of population structure using multilocus genotype data Keywords: Genomics Genomics	Visit Website»
Subread	comprises a suite of software programs for processing next-gen sequencing read data including - Subread: a general-purpose read aligner which can align both genomic DNA-seq and RNA-seq reads. It can also be used to discover genomic mutations including short indels and structural variants. - Subjunc: a read aligner developed for aligning RNA-seq reads and for the detection of exon-exon junctions. Gene fusion events can … Keywords: High-throughput sequencing	Visit Website» Documentation» Web Forum»
SUPPA2	Fast, accurate, and uncertainty-aware differential splicing analysis across multiple conditions. Keywords: RNA-Sequencing High-Throughput Sequencing	Visit Website»
SURVIVOR	a tool set for simulating/evaluating SVs, merging and comparing SVs within and among samples, and includes various methods to reformat or summarize SVs. Keywords: Structural Variant Analysis Genomics	Visit Website» Documentation»
SvABA	Structural variation and indel analysis by assembly. Keywords: Genome Assembly High-throughput sequencing	Visit Website»
SVDSS	a method for structural variations discovery from accurate long reads (e.g PacBio HiFi), based on the notion of sample-specific strings (SFS, or simply specific strings). Keywords: Structural Variant Analysis High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
SViCT	a computational tool for detecting structural variations from cell free DNA (cfDNA) containing low dilutions of circulating tumor DNA (ctDNA). Keywords: Variant Analysis Genomics	Visit Website»
swarm	a robust and fast clustering method for amplicon-based studies. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
SWORD	(Smith Waterman On Reduced Database) is a fast and sensitive software for protein sequence alignment. Keywords: Protein-protein sequence alignment High-Throughput Sequencing	Visit Website»
TakeABreak	tool that can detect inversion breakpoints directly from raw NGS reads, without the need of any reference genome and without de novo assembling the genomes Keywords: High-throughput sequencing	Visit Website»
tantan	tantan masks simple regions (low complexity & short-period tandem repeats) in biological sequences. Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website»
TaxonKit	a command-line toolkit for rapid manipulation of NCBI taxonomy data. Keywords: taxonomy Other	Visit Website» Documentation» Web Forum»
TBProfiler	profiling tool for Mycobacterium tuberculosis to detect drug resistance and lineage from WGS data. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
TelSeq	a software that estimates telomere length from whole genome sequencing data (BAMs). Keywords: Genomics Genomics	Visit Website»
TensorFlow Sanjoy Das, Gunhan Gulsoy, Benoit Steiner	an open source software library for high performance numerical computation. Its flexible architecture allows easy deployment of computation across a variety of platforms (CPUs, GPUs, TPUs), and from desktops to clusters of servers to mobile and edge devices. Originally developed by researchers and engineers from the Google Brain team within Google’s AI organization, it comes with strong support for machine learning and deep learning, … Keywords: High Performance Computing Machine Learning High-Throughput Sequencing Genomics	Visit Website» Documentation» Web Forum» Mailing List»
TensorFlow Federated	TensorFlow Federated (TFF) is an open-source framework for machine learning and other computations on decentralized data. Keywords: Machine Learning Other	Visit Website»
tensorQTL	a GPU-enabled QTL mapper, achieving ~200-300 fold faster cis- and trans-QTL mapping compared to CPU-based implementations. Keywords: quantitative trait loci (QTLs) mapping/discovery Genomics High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
terminus	enables the discovery of data-driven, robust transcript groups from RNA-seq data. Keywords: RNA-Seq Analysis High-Throughput Sequencing	Visit Website»
TeX Live Berry Karl, Scarso Luigi, Preining Norbert, Kotucha Reinhard, Kroonenberg Siep, Wawrykiewicz Staszek	a system that cleans up raw data files and converts them to pdf format with LaTex. Offers an easy way to get up and running with the TeX document production system. Keywords: Other	Visit Website» Documentation» Web Forum» Mailing List»
TGS‑GapCloser	A gap-closing software tool that uses error-prone long reads generated by third-generation-sequence techniques (Pacbio, Oxford Nanopore, etc.) or preassembled contigs to fill N-gap in the genome assembly. Keywords: Genome Assembly High-Throughput Sequencing	Visit Website»
TideHunter	efficient and sensitive tandem repeat detection from noisy long reads using seed-and-chain. Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website»
tidk	Identify and find telomeres, or telomeric repeats in a genome. Keywords: Genome Annotation High-Throughput Sequencing	Visit Website»
Tig	is an ncurses-based text-mode interface for git. Keywords:	Visit Website» Documentation» Web Forum»
tigmint	Correct misassemblies using linked or long reads Keywords: High-throughput sequencing	Visit Website»
tmux	is a terminal multiplexer: it enables a number of terminals to be created, accessed, and controlled from a single screen. tmux may be detached from a screen and continue running in the background, then later reattached. Keywords: Other	Visit Website» Web Forum»
TN93	a fast distance calculator that computes pairwise distances between aligned nucleotide sequences in sequential FASTA format using the Tamura Nei 93 distance. Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website»
TOBIAS	(Transcription factor Occupancy prediction By Investigation of ATAC-seq Signal) a collection of command-line bioinformatics tools for performing footprinting analysis on ATAC-seq data. Keywords: ATAC-Seq High-Throughput Sequencing	Visit Website» Documentation»
Tombo	a suite of tools primarily for the identification of modified nucleotides from nanopore sequencing data. Keywords: Nanopore High-Throughput Sequencing	Visit Website» Documentation»
TopHat Cole Trapnell, Daehwan Kim, Steven Salzberg	a fast splice junction mapper for RNA-Seq reads that aligns RNA-Seq reads to mammalian-sized genomes using the ultra high-throughput short read aligner Bowtie, and then analyzes the mapping results to identify splice junctions between exons. Keywords: High-throughput sequencing RNA-Sequencing Spliced Read Alignment	Visit Website» Documentation» Mailing List»
ToulligQC	A post sequencing QC tool for Oxford Nanopore sequencers Keywords: Nanopore High-Throughput Sequencing	Visit Website»
TPMCalculator	quantifies mRNA abundance directly from the alignments by parsing BAM files. The input parameters are the same GTF files used to generate the alignments, and one or multiple input BAM file(s) containing either single-end or paired-end sequencing reads. The TPMCalculator output is comprised of four files per sample reporting the TPM values and raw read counts for genes, transcripts, exons and introns respectively. Keywords: RNA-Seq Analysis High-Throughput Sequencing	Visit Website»
Tracer	is a software package for visualising and analysing the MCMC trace files generated through Bayesian phylogenetic inference. Tracer provides kernel density estimation, multivariate visualisation, demographic trajectory reconstruction, conditional posterior distribution summary and more. Tracer v1.7.1 can read output files from MrBayes, BEAST, BEAST2, RevBayes, Migrate, LAMARC and and possibly other MCMC programs from other domains. Keywords: Phylogenomics Genomics	Visit Website» Documentation» Web Forum»
TractSeg	tool for fast and accurate white matter bundle segmentation from Diffusion MRI. It can create bundle segmentations, segmentations of the endregions of bundles and Tract Orientation Maps (TOMs). Moreover, it can do tracking on the TOMs creating bundle-specific tractogram and do Tractometry analysis on those. Keywords: Image-Analysis Libraries Visualization	Visit Website»
Tracy	basecalling, alignment, assembly and deconvolution of Sanger Chromatogram trace files. Keywords: Read Quality Control Other	Visit Website»
TransDecoder	identifies candidate coding regions within transcript sequences, such as those generated by de novo RNA-Seq transcript assembly using Trinity, or constructed based on RNA-Seq alignments to the genome using Tophat and Cufflinks. Keywords: Genomics High-throughput sequencing Transcriptomics	Visit Website» Documentation» Web Forum»
Transformers	State-of-the-art Natural Language Processing for Jax, Pytorch and TensorFlow Keywords: Machine Learning Other	Visit Website»
TreeBeST	(Tree Building guided by Species Tree) is a versatile program that builds, manipulates and displays phylogenetic trees. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website»
treePL	is a phylogenetic penalized likelihood program. Keywords: Phylogenomics High-Throughput Sequencing	Visit Website»
TreeSAPP	a functional and taxonomic annotation tool for microbial genomes and proteins. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation»
TreeTime	provides routines for ancestral sequence reconstruction and inference of molecular-clock phylogenies. Keywords: Phylogenomics High-Throughput Sequencing	Visit Website»
TRF	(Tandem Repeats Finder) a program to locate and display tandem repeats in DNA sequences. Keywords: Genome Annotation High-Throughput Sequencing	Visit Website»
trimAl	is a tool for the automated removal of spurious sequences or poorly aligned regions from a multiple sequence alignment. Keywords: High-throughput sequencing	Visit Website» Documentation» Web Forum»
TrimGalore Felix Krueger	a wrapper around Cutadapt and FastQC to consistently apply adapter and quality trimming to FastQ files, with extra functionality for RRBS data. Keywords: High-throughput sequencing Genomics	Visit Website» Documentation» Web Forum»
Trimmomatic Anthony M Bolger, Bjoern Usadel	a flexible read trimming tool for Illumina NGS data. Keywords: High-Throughput Sequencing	Visit Website» Documentation»
Trinity Alexie Papanicolaou, Aviv Regev, Brian Haas, Manfred Grabherr, Moran Yassour, Nir Friedman, Richard LeDuc, Robert Henschel	a software package comprised of three independent software modules (Inchworm, Chrysalis, and Butterfly) for the efficient and robust de novo reconstruction of transcriptomes from RNA-seq data. Keywords: De Novo Transcriptome Assembly High-throughput sequencing RNA-Sequencing	Visit Website» Documentation» Web Forum»
Trinotate	a comprehensive annotation suite designed for automatic functional annotation of transcriptomes, particularly de novo assembled transcriptomes, from model or non-model organisms. Keywords: High-throughput sequencing	Visit Website»
triodenovo	implements a Bayesian framework for calling de novo mutations in trios for next-generation sequencing data. Keywords: Genomics High-Throughput Sequencing	Visit Website»
tRNAscan‑SE	tRNA detection in large-scale genome sequence. Keywords: High-throughput sequencing	Visit Website»
TRUST4	a computational tool to analyze TCR and BCR sequences using unselected RNA sequencing data, profiled from fluid and solid tissues, including tumors. Keywords: High-throughput sequencing RNA-Sequencing	Visit Website» Documentation» Web Forum»
Truvari	Structural variant comparison tool for VCFs Keywords: Variant Analysis High-Throughput Sequencing	Visit Website» Documentation»
UGENE Mikhail Fursov, Olga Golosova, Konstantin Okonechnikov	a free open-source bioinformatics software for macOS, and Linux. Keywords: High-throughput sequencing Metagenomic Sequencing Analysis Sequence Alignment Analysis Sequence Alignment Visualization Visualization Visualization Other	Visit Website» Documentation» Web Forum» Mailing List»
Ultraplex	an all-in-one software package for processing and demultiplexing fastq files. Keywords: High-Throughput Sequencing	Visit Website» Documentation»
umap‑learn	(UMAP) is a dimension reduction technique that can be used for visualisation similarly to t-SNE, but also for general non-linear dimension reduction Keywords: Visualization Other	Visit Website»
umis	tools for processing UMI RNA-tag data. Keywords: RNA-Seq Analysis High-Throughput Sequencing	Visit Website»
UMI‑tools	tools for dealing with Unique Molecular Identifiers (UMIs)/Random Molecular Tags (RMTs) and single cell RNA-Seq cell barcodes. Keywords: scRNA-Seq Analysis High-Throughput Sequencing	Visit Website»
Unicycler Ryan R Wick, Kathryn E. Holt, Louise M Judd, Claire L. Gorrie	an assembly pipeline for bacterial genomes. Keywords: Bioinformatics Genome Assembly RNA-Sequencing Genomics	Visit Website» Documentation» Web Forum»
UniFrac	Fast phylogenetic diversity calculations Keywords: Phylogenomics High-Throughput Sequencing	Visit Website»
unikmer	toolkit for k-mer with taxonomic information Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation»
UPIMAPI	(UniProt Id Mapping through API) a command line interface for using UniProt's API, which allows access to UniProt's ID mapping programmatically. Keywords: High-Throughput Sequencing	Visit Website»
UShER	a program that rapidly places new samples onto an existing phylogeny using maximum parsimony. It is particularly helpful in understanding the relationships of newly sequenced SARS-CoV-2 genomes with each other and with previously sequenced genomes in a global phylogeny. Keywords: Phylogenomics High-Throughput Sequencing	Visit Website» Documentation»
vapor	is a tool for classification of Influenza samples from raw short read sequence data for downstream bioinformatics analysis. Keywords: Genome Annotation High-Throughput Sequencing	Visit Website»
varlociraptor	flexible, arbitrary-scenario, uncertainty-aware variant calling with parameter free filtration via FDR control. Keywords: Variant Analysis High-Throughput Sequencing	Visit Website» Documentation»
VarScan Daniel C Koboldt	a platform-independent mutation caller for targeted, exome, and whole-genome resequencing data generated on Illumina, SOLiD, Life/PGM, Roche/454, and similar instruments. Restriction: available to non-profit users only. See technical notes for additional information on for-profit user licensing. Keywords: Genomics Genomics	Visit Website» Documentation» Web Forum» Mailing List»
vcf2parquet	Convert a vcf in parquet. Keywords: High-throughput sequencing	Visit Website»
vcfanno	allows you to quickly annotate your VCF with any number of INFO fields from any number of VCFs or BED files. It uses a simple conf file to allow the user to specify the source annotation files and fields and how they will be added to the info of the query VCF. Keywords: Variant Analysis Genomics	Visit Website»
vcflib	command-line tools for manipulating VCF files. Keywords: Variant Analysis High-Throughput Sequencing	Visit Website»
VCFtools Adam Auton, Petr Danecek, Tony Marcketta	a program package designed to provide easily accessible methods for working with complex genetic variation data in the form of VCF files, such as those generated by the 1000 Genomes Project. Keywords: High-throughput sequencing Variant Aggregation/Summarization WGS Analysis	Visit Website» Documentation» Web Forum»
velocyto	a library for the analysis of RNA velocity. Keywords: scRNA-Seq Analysis High-Throughput Sequencing	Visit Website» Documentation»
Velvet Daniel Zerbino, Ewan Birney	a sequence assembler for very short reads. Keywords: De Novo Sequencing Analysis DNA-Sequencing Genome Assembly High-throughput sequencing	Visit Website» Documentation» Web Forum»
vembrane	allows to simultaneously filter variants based on any INFO field, CHROM, POS, REF, ALT, QUAL, and the annotation field ANN. Keywords: Variant Analysis Genomics	Visit Website»
VerifyBamID2	A robust tool for DNA contamination estimation from sequence reads using ancestry-agnostic method. Keywords: Genomics High-Throughput Sequencing	Visit Website»
VERSE	a versatile and efficient RNA-Seq read counting tool Keywords: RNA-Seq Analysis High-Throughput Sequencing	Visit Website»
VeryFastTree	a new tool designed for efficient phylogenetic tree inference, specifically tailored to handle massive taxonomic datasets. It is a highly-tuned implementation based on the FastTree-2 tool that takes advantage of parallelization and vectorization strategies to speed up the inference of phylogenies for huge alignments. Keywords: Phylogenomics High-Throughput Sequencing	Visit Website»
vg	Variation graphs provide a succinct encoding of the sequences of many genomes. A variation graph (in particular as implemented in vg) is composed of: * nodes, which are labeled by sequences and ids * edges, which connect two nodes via either of their respective ends * paths, describe genomes, sequence alignments, and annotations (such as gene models and transcripts) as walks through nodes connected … Keywords: Genomics Variant Analysis Genomics	Visit Website» Documentation» Web Forum»
ViennaRNA	Vienna RNA package -- RNA secondary structure prediction and comparison Keywords: RNA-Seq Analysis High-Throughput Sequencing	Visit Website»
vireosnp	Vireo: Variational Inference for Reconstructing Ensemble Origin by expressed SNPs in multiplexed scRNA-seq data. Keywords: scRNA-Seq Analysis High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
VIRULIGN	a tool for codon-correct pairwise alignments, with an augmented functionality to annotate the alignment according the positions of the proteins. Keywords: Sequence Alignment Analysis High-Throughput Sequencing	Visit Website» Documentation»
Vmatch	a versatile software tool for efficiently solving large scale sequence matching tasks. Vmatch subsumes the software tool REPuter, but is much more general, with a very flexible user interface, and improved space and time requirements. Keywords: Sequence Alignment Analysis Genomics	Visit Website» Documentation»
VSEARCH Torbjørn Rognes	an alternative to the USEARCH tool developed by Robert C. Edgar (2010) for which the source code is not publicly available, VSEARCH is an open source, multithreaded 64-bit tool for processing and preparing metagenomics, genomics, and population genomics nucleotide sequence data. It supports de novo and reference based chimera detection, clustering, full-length and prefix dereplication, rereplication, reverse complementation, masking, all-vs-all pairwise global alignment, exact … Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
vt	A tool set for short variant discovery in genetic sequence data. Keywords: Variant Analysis Genomics	Visit Website» Documentation»
vtk Will Schroeder	(Visualization Tool Kit) a C++ class library and several interpreted interface layers including Tcl/Tk, Java, and Python. VTK supports a wide variety of visualization algorithms including scalar, vector, tensor, texture, and volumetric methods, as well as advanced modeling techniques such as implicit modeling, polygon reduction, mesh smoothing, cutting, contouring, and Delaunay triangulation. Keywords: Bioimaging Image-Analysis Libraries Other	Visit Website»
Wally	Plotting of aligned sequencing reads, assembled contigs or pan-genome graphs in BAM/CRAM/GFA format and visualization of genomic variants. Keywords: High-throughput sequencing Visualization Visualization	Visit Website»
WASPQTL	WASP is a suite of tools for unbiased allele-specific read mapping and discovery of molecular QTLs. Keywords: quantitative trait loci (QTLs) mapping/discovery Read Alignment Genomics High-Throughput Sequencing	Visit Website» Documentation»
WebLogo Gary Hon, Gavin E Crooks, John-Marc Chandonia, Steven Brenner	a set of command line tools for sequence logo generation. Keywords: Comparative Genomics Genomics Sequence Logo Generation Other Visualization	Visit Website» Documentation» Web Forum»
WhatsHap	a software for phasing genomic variants using DNA sequencing reads, also called read-based phasing or haplotype assembly. Keywords: Genomics Genomics	Visit Website»
whitematteranalysis	WhiteMatterAnalysis (WMA) provides fiber clustering and tractography analysis tools.	Visit Website»
WiggleTools	The WiggleTools package allows genomewide data files to be manipulated as numerical functions, equipped with all the standard functional analysis operators (sum, product, product by a scalar, comparators), and derived statistics (mean, median, variance, stddev, t-test, Wilcoxon's rank sum test, etc). Keywords: Genomics High-throughput sequencing	Visit Website» Documentation»
Winnowmap	Winnowmap is a long-read mapping algorithm optimized for mapping ONT and PacBio reads to repetitive reference sequences. Keywords: PacBio Sequencing High-Throughput Sequencing	Visit Website»
Connectome Workbench	an open source, freely available visualization and discovery tool used to map neuroimaging data, especially data generated by the Human Connectome Project. The distribution includes wb_view, a GUI-based visualization platform, and wb_command, a command-line program for performing a variety of algorithmic tasks using volume, surface, and grayordinate data. Keywords: Image-Analysis Libraries MRI Analysis Neuroimaging Other	Visit Website» Documentation»
wot	a software package for analyzing snapshots of developmental processes in scRNA-seq data. Keywords: scRNA-Seq Analysis High-Throughput Sequencing	Visit Website» Documentation»
xAtlas	xAtlas is a fast and retrainable small variant caller that has been developed at the Baylor College of Medicine Human Genome Sequencing Center. Keywords: Variant Analysis High-Throughput Sequencing	Visit Website»
Xenium Ranger	provides flexible off-instrument reanalysis of Xenium In Situ data, including relabeling transcripts, resegmenting cells, importing custom segmentation data, and renaming datasets. Keywords: RNA-Seq Analysis RNA-Sequencing High-Throughput Sequencing	Visit Website» Documentation»
XNAT	XNAT client tools comprise a number of command line tools to store and retrieve data from XNAT archives. - ArcGet: retrieves image data. - ArcRead: retrieves summary text documents describing imaging data. - ArcSim: retrieves a list of imaging sessions with similar IDs. - StoreXML: writes XML documents to the archive. Keywords: Other	Visit Website»
xPore	is a Python package for identification and quantification of differential RNA modifications from direct RNA sequencing Keywords: RNA-Sequencing High-Throughput Sequencing	Visit Website» Documentation»
yacrd	is a simple and easy to use long read error-correction tool which can detect and remove chimeras. Keywords: High-throughput sequencing Read Quality Control	Visit Website» Web Forum»
YaHS	YaHS, yet another Hi-C scaffolding tool Keywords: Hi-C High-Throughput Sequencing	Visit Website»
YAP Michael Derby, Michael Steeves, Robin Ge, Steven Litster, Tripti Kulkarni, Varun Shivashankar, Vibhas Aravamuthan	an extensible parallel framework, written in Python using OpenMPI libraries that allows researchers to quickly build high throughput big data pipelines without extensive knowledge of parallel programming. Keywords: Python Module Other	Visit Website» Documentation» Web Forum»
Zerone	discretizes several ChIP-seq replicates simultaneously and resolves conflicts between them. After the job is done, Zerone checks the results and tells you whether it passes the quality control. Keywords: ChIP-Sequencing High-Throughput Sequencing Genomics	Visit Website» Documentation»

How to use AppCiter?

1. Choose Your Programs

2. Select Citations

3. Export Citations