Supported Applications

Search software:
Search software:
Filter by keywords:
- available keywords
  - All
  - High-Throughput Sequencing
  - Genomics
  - Proteomics
  - Visualization
  - Other
  - Alternative Splicing
  - Association Mapping
  - ATAC-Seq
  - Bioconductor Packages
  - Bioimaging
  - Bioinformatics
  - Bioinformatics Infrastructure
  - bisulfite-Seq
  - Cell Tracking
  - ChIP-Sequencing
  - CLIP-Seq Analysis
  - Comparative Genomics
  - Complex Trait Prediction
  - Computational Chemistry
  - CRISPR/Cas9 Screen Analysis
  - De Novo Sequencing Analysis
  - De Novo Transcriptome Assembly
  - DNA Sequence Data Compression
  - DNA-Sequencing
  - Electron Microscopy
  - Epigenomics
  - Figure Creation
  - Genome Annotation
  - Genome Assembly
  - Genome Visualization
  - Genomics
  - Genotype-Phenotype Analysis
  - Germline SNP Detection
  - GWAS Analysis
  - Hi-C
  - HiChIP
  - High Performance Computing
  - High-throughput sequencing
  - High-Tpeak Calling
  - Homology-Based Taxonomic Classification
  - Image-Analysis Libraries
  - Machine Learning
  - Metabolic Network Analysis
  - Metagenomic Sequencing Analysis
  - Motif Comparison
  - Motif Discovery
  - MRI Analysis
  - Multiple Nucleotide Sequence Alignment
  - Multiple Structure Alignment
  - Nanopore
  - Neuroimaging
  - Normalization/Differential Expression
  - Nucleic Acids
  - Nucleotide Sequence Homology Search
  - Other
  - PacBio Sequencing
  - PCR
  - Phylogenetic Inference
  - Phylogenomics
  - Pipelines
  - PLAC-seq
  - Programming Tools
  - Protein Database Search
  - Protein-Ligand Docking
  - Protein-Protein Interaction Prediction
  - Protein-protein sequence alignment
  - Protein Structure Analysis
  - Proteomics
  - Python Module
  - quantitative trait loci (QTLs) mapping/discovery
  - RADSeq
  - Read Alignment
  - Read Quality Control
  - RNA-Seq Analysis
  - RNA-Sequencing
  - scDNA-Seq Analysis
  - scRNA-Seq Analysis
  - Sequence Alignment Analysis
  - Sequence Alignment Visualization
  - Sequence Logo Generation
  - Single-Cell Assemblers
  - Spliced Read Alignment
  - Statistical Analysis
  - Structural Biology
  - Structural Variant Analysis
  - Structure Visualization & Analysis
  - Target Gene Detection
  - taxonomy
  - Tertiary Structure Prediction
  - Transcriptomics
  - Transcript Quantification
  - Variant Aggregation/Summarization
  - Variant Analysis
  - Virus Sequence Detection
  - Visualization
  - WGS Analysis
  - Workflow Management System
Filter by OS:
- available OS
  - macOS
  - Linux
Filter by member or license type:
- available types
  - Member type
  - Academic
  - Beamline
  - Government
  - Industry
  - Non-profit
  - All
  - License type
  - Commercial
  - Open
  - Registration required
  - All

AppCiter will help you create a bibliography of the programs you wish to cite.

AppCiter Programs:

No programs selected

Clear All

Continue to Step 2

Results:

Name	Description	Links
A5 David Coil, Aaron E Darling, Guillaume Jospin	A5-miseq is a pipeline for assembling DNA sequence data generated on the Illumina sequencing platform. A5-miseq can produce high-quality microbial genome assemblies on a laptop computer without any parameter tuning by automating the process of adapter trimming, quality filtering, error correction, contig and scaffold generation and detection of misassemblies. Keywords: Genome Assembly High-throughput sequencing Metagenomic Sequencing Analysis Genomics	Visit Website» Web Forum»
AKT Rudy Arthur, Jared O’Connell	(Ancestry and Kinship Toolkit) a statistical genetics tool for analysing large cohorts of whole-genome sequenced samples. It provides a handful of useful statistical genetics routines using the htslib API for input/output. This means it can seamlessly read BCF/VCF files and play nicely with bcftools. Keywords: Genomics Statistical Analysis High-Throughput Sequencing Genomics	Visit Website» Documentation»
andi	estimates the evolutionary distance between closely related genomes. These distances can be used to rapidly infer phylogenies for big sets of genomes. Because andi does not compute full alignments, it is so efficient that it scales even up to thousands of bacterial genomes. Keywords: Genomics Genomics	Visit Website» Documentation»
ASCIIGenome	a command-line genome browser running from terminal window and solely based on ASCII characters. Keywords: Genome Visualization Genomics	Visit Website» Documentation» Web Forum»
AUGUSTUS	a gene prediction program for eukaryotes that can be used as an ab initio program, which means it bases its prediction purely on the sequence. Keywords: Genome Annotation Genomics	Visit Website»
Balrog	A universal protein model for prokaryotic gene prediction Keywords: Genome Annotation High-Throughput Sequencing Genomics	Visit Website»
Barrnap	(BAsic Rapid Ribosomal RNA Predictor) predicts the location of ribosomal RNA genes in genomes (bacteria, archaea, metazoan mitochondria and eukaryotes). Keywords: Genomics Genomics	Visit Website»
bcbio‑prioritize	Prioritize small variants, structural variants and coverage based on biological inputs. The goal is to use pre-existing knowledge of relevant genes, domains and pathways involved with a disease to extract the most interesting signal from a set of high quality small or structural variant calls. Given information on coverage, it will be able to identify poorly covered regions in potential genes of interest. Keywords: Genomics Variant Analysis Genomics	Visit Website»
bcbio‑variation	bcbio-variation is a toolkit to analyze genome variation data, built on top of the Genome Analysis Toolkit (GATK) with Clojure. It supports scoring for the Archon Genomics X PRIZE competition and is also a general framework for variant file comparison. It enables validation of variants and exploration of algorithm differences between calling methods by automating the process involved with comparing two sets of variants. … Keywords: Genomics Variant Analysis Genomics	Visit Website»
bcbio‑variation‑recall	Parallel merging, squaring off and ensemble calling for genomic variants. Provide a general framework meant to combine multiple variant calls, either from single individuals, batched family calls, or multiple approaches on the same sample. Splits inputs based on shared genomic regions without variants, allowing independent processing of smaller regions with variant calls. Keywords: Variant Analysis Genomics	Visit Website»
BEAGLE	is a software package for phasing genotypes and imputing ungenotyped markers. Keywords: Genotype-Phenotype Analysis Genomics	Visit Website»
BEAST	is a cross-platform program for Bayesian analysis of molecular sequences using MCMC. Keywords: Sequence Alignment Analysis Genomics	Visit Website»
BEDOPS	BEDOPS is an open-source command-line toolkit that performs highly efficient and scalable Boolean and other set operations, statistical calculations, archiving, conversion and other management of genomic data of arbitrary scale. Tasks can be easily split by chromosome for distributing whole-genome analyses across a computational cluster. Keywords: Genomics High-throughput sequencing Genomics	Visit Website» Documentation» Web Forum» Mailing List»
bedtools Aaron R Quinlan	a swiss-army knife of tools for a wide-range of genomics analysis tasks. The most widely-used tools enable genome arithmetic. Bedtools allows one to intersect, merge, count, complement, and shuffle genomic intervals from multiple files in widely-used genomic file formats such as BAM, BED, GFF, VCF. While each individual tool is designed to do a relatively simple task (e.g., intersect two interval files), sophisticated analyses … Keywords: High-Throughput Sequencing Genomics	Visit Website» Documentation» Web Forum» Mailing List»
bioawk Aaron R Quinlan, Heng Li	an extension to Brian Kernighan's awk, with added support for several common biological data formats, including optionally gzip'ed BED, GFF, SAM, VCF, FASTA/Q, and TAB-delimited formats with column names along with new built-in functions and a command line option to use TAB as the input/output delimiter. When the new functionality is not used, bioawk should behave exactly like the original BWK awk. Keywords: High-Throughput Sequencing Genomics	Visit Website» Documentation» Web Forum»
Bismark Felix Krueger	a set of tools for the time-efficient analysis of Bisulfite-Seq (BS-Seq) data. Bismark performs alignments of bisulfite-treated reads to a reference genome and cytosine methylation calls at the same time. Keywords: bisulfite-Seq Genomics	Visit Website» Documentation» Web Forum»
BreakDancer Ken Chen	a Perl/Cpp package that provides genome-wide detection of structural variants from next generation paired-end sequencing reads. It includes two complementary programs. Keywords: Genomics High-throughput sequencing Genomics	Visit Website» Web Forum»
BVATools Louis Letourneau, Mathieu Bourgey	Bam and Variant Analysis Tools Keywords: Genomics High-throughput sequencing Variant Aggregation/Summarization Genomics	Visit Website»
Canvas	a tool for calling copy number variants (CNVs) from human DNA sequencing data. Keywords: Variant Analysis Genomics	Visit Website» Documentation»
CAVIAR	CAVIAR (CAusal Variants Identication in Associated Regions): a statistical framework that quantifies the probability of each variant to be causal while allowing with arbitrary number of causal variants. Keywords: Statistical Analysis Variant Analysis Genomics High-Throughput Sequencing	Visit Website» Documentation» Mailing List»
CEFCIG	CEFCIG (Computational Epigenetic Framework for Cell Identity Gene Discovery) Keywords: Epigenomics Genomics	Visit Website»
Cell Ranger	a set of analysis pipelines that process Chromium single-cell RNA-seq output to align reads, generate feature-barcode matrices and perform clustering and gene expression analysis. Keywords: scRNA-Seq Analysis Genomics	Visit Website» Documentation»
cellsnp‑lite	Efficient genotyping bi-allelic SNPs on single cells Keywords: scRNA-Seq Analysis Genomics	Visit Website»
cellxgene	an interactive explorer for single-cell transcriptomics data Keywords: scDNA-Seq Analysis scRNA-Seq Analysis Genomics	Visit Website»
Centrifuge	is a very rapid and memory-efficient system for the classification of DNA sequences from microbial samples, with better sensitivity than and comparable accuracy to other leading systems. The system uses a novel indexing scheme based on the Burrows-Wheeler transform (BWT) and the Ferragina-Manzini (FM) index, optimized specifically for the metagenomic classification problem. Centrifuge requires a relatively small index (e.g., 4.3 GB for ~4,100 bacterial … Keywords: Metagenomic Sequencing Analysis Genomics High-Throughput Sequencing	Visit Website» Documentation»
chewBBACA	A complete suite for gene-by-gene schema creation and strain identification. Keywords: Genome Annotation Genomics	Visit Website»
CHISEL	Copy-number Haplotype Inference in Single-cell by Evolutionary Links CHISEL is an algorithm to infer allele- and haplotype-specific copy numbers in individual cells from low-coverage single-cell DNA sequencing data (e.g., those generated by Direct Library Preparation+ (DLP+), 10x Genomics CNV Solution, DOP-PCR, etc.). Keywords: Genomics High-throughput sequencing Genomics	Visit Website» Documentation» Mailing List»
Chromap	is an ultrafast method for aligning and preprocessing high throughput chromatin profiles. Keywords: High-throughput sequencing Genomics	Visit Website»
CNVkit	a command-line toolkit and Python library for detecting copy number variants and alterations genome-wide from high-throughput sequencing. Keywords: Genomics Genomics	Visit Website» Documentation» Web Forum»
CoNIFER	uses exome sequencing data to find copy number variants (CNVs) and genotype the copy-number of duplicated genes. Keywords: Genomics High-throughput sequencing Genomics	Visit Website» Documentation»
Control‑FREEC	Copy number and genotype annotation from whole genome and whole exome sequencing data. Keywords: Genomics Genomics	Visit Website» Documentation»
CRISPRCasFinder Christine Pourcel, David Couvin	a tool that enables the easy detection of CRISPRs and cas genes in user-submitted sequence data (allows sequences up to 50 Mo otherwise download standalone program). This is an update of the CRISPRFinder program with improved specificity and indication on the CRISPR orientation. MacSyFinder is used to identify cas genes, the CRISPR-Cas type and subtype. Keywords: CRISPR/Cas9 Screen Analysis Genomics Genomics Other	Visit Website» Documentation» Web Forum»
CrossMap	a program for genome coordinates conversion between different genome assemblies. Keywords: Genome Assembly Genomics	Visit Website»
DANPOS3	a toolkit for Dynamic Analysis of Nucleosome and Protein Occupancy by Sequencing. Keywords: High-throughput sequencing Nucleic Acids Genomics	Visit Website» Documentation» Web Forum»
dEploid	deconvolutes mixed genomes with unknown proportions. Keywords: Genomics Genomics	Visit Website»
DFAST	is a flexible and customizable pipeline for prokaryotic genome annotation as well as data submission to the INSDC. Keywords: Genome Annotation Genomics	Visit Website»
dicey	In-silico PCR and variant primer design Keywords: PCR High-Throughput Sequencing Genomics	Visit Website»
DWGSIM Nils Homer	a whole genome simulator for next-generation sequencing based off of wgsim found in SAMtools, which was written by Heng Li, and forked from DNAA. It was modified to handle ABI SOLiD and Ion Torrent data, as well as various assumptions about aligners and positions of indels. Many new features have been subsequently added. Keywords: Genomics High-throughput sequencing Genomics	Visit Website»
Eagle	estimates haplotype phase either within a genotyped cohort or using a phased reference panel. Eagle2 is now the default phasing method used by the Sanger and Michigan imputation servers and uses a new, very fast HMM-based algorithm that improves speed and accuracy over existing methods via two key ideas: a new data structure based on the positional Burrows-Wheeler transform and a rapid search algorithm … Keywords: Genomics Genomics	Visit Website» Documentation»
EIGENSOFT	The EIGENSOFT package combines functionality from our population genetics methods (Patterson et al. 2006) and our EIGENSTRAT stratification correction method (Price et al. 2006). Keywords: Genomics Genomics	Visit Website» Web Forum»
ensembl_vep	predicts the functional effects of genomic variants Keywords: Genomics Genomics	Visit Website» Documentation»
Exomiser Damian Smedley, Peter N Robinson, Sebastian Köhler	a Java program that finds potential disease-causing variants from whole-exome or whole-genome sequencing data. Starting from a VCF file and a set of phenotypes encoded using the Human Phenotype Ontology (HPO), it will annotate, filter and prioritize likely causative variants based on user-defined criteria such as a variant's predicted pathogenicity, frequency of occurrence in a population and also how closely the given phenotype matches … Keywords: Genome Annotation Genomics Genotype-Phenotype Analysis Genomics	Visit Website» Documentation» Web Forum»
FastANI	developed for fast alignment-free computation of whole-genome Average Nucleotide Identity (ANI). Keywords: Genomics Genomics	Visit Website»
FastTree Morgan N Price	infers approximately-maximum-likelihood phylogenetic trees from alignments of nucleotide or protein sequences. FastTree can handle alignments with up to a million sequences in a reasonable amount of time and memory. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing Genomics	Visit Website» Documentation»
FlashPCA	performs fast principal component analysis (PCA) of single nucleotide polymorphism (SNP) data, similar to smartpca from EIGENSOFT (http://www.hsph.harvard.edu/alkes-price/software/) and shellfish (https://github.com/dandavison/shellfish). FlashPCA is based on the https://github.com/yixuan/spectra/ library. Keywords: Variant Analysis Genomics	Visit Website»
FusionCatcher	finds somatic fusion-genes in RNA-seq data. Keywords: Genomics Genomics	Visit Website»
GangSTR Nima Mousavi	a tool for genome-wide profiling tandem repeats from short reads. A key advantage of GangSTR over existing tools (e.g. lobSTR or hipSTR) is that it can handle repeats that are longer than the read length. GangSTR takes aligned reads (BAM) and a set of repeats in the reference genome as input and outputs a VCF file containing genotypes for each locus. Keywords: Genomics High-throughput sequencing Genomics	Visit Website» Documentation» Web Forum»
gappa	Genesis Applications for Phylogenetic Placement Analysis Keywords: Phylogenomics Genomics	Visit Website»
GCTA Jian Yang, Peter Visscher, Mike Goddard, Andrew Bakshi	(Genome-wide Complex Trait Analysis) a tool for genome-wide complex trait analysis with five main functions: data management, estimation of the genetic relationships from SNPs, mixed linear model analysis of variance explained by the SNPs, estimation of the linkage disequilibrium structure, and GWAS simulation. GCTA estimates the variance explained by all the SNPs on a chromosome or on the whole genome for a complex trait … Keywords: Complex Trait Prediction Genotype-Phenotype Analysis GWAS Analysis Genomics	Visit Website» Documentation» Web Forum»
GenomeBrowse	a free tool offered by Golden Helix that delivers stunning visualizations of your genomic data, enabling you to see what is occurring at each base pair in your samples. Keywords: Genome Visualization Genomics Visualization	Visit Website» Documentation» Web Forum»
Genrich	a peak-caller for genomic enrichment assays (e.g. ChIP-seq, ATAC-seq). Keywords: ATAC-Seq ChIP-Sequencing Genomics	Visit Website» Documentation»
GOR	a tool based on a genomic ordered relational architecture and allows analysis of large sets of genomic and phenotypic tabular data using a declarative query language, in a parallel execution engine. It is very efficient in a wide range of use-cases, including genome wide batch analysis, range-queries, genomic table joins of variants and segments, filtering, aggregation etc. Keywords: Genomics Genomics	Visit Website» Documentation»
GRIDSS	a module software suite containing tools useful for the detection of genomic rearrangements. GRIDSS includes a genome-wide break-end assembler, as well as a structural variation caller for Illumina sequencing data. GRIDSS calls variants based on alignment-guided positional de Bruijn graph genome-wide break-end assembly, split read, and read pair evidence. Keywords: Genomics Sequence Alignment Analysis Structural Variant Analysis Genomics	Visit Website» Documentation»
GSAlign	an ultra-fast sequence alignment algorithm for intra-species genome comparison. Keywords: Genomics Sequence Alignment Analysis Genomics	Visit Website»
GSEApy	a Python/Rust implementation for GSEA and wrapper for Enrichr. GSEApy can be used for RNA-seq, ChIP-seq, Microarray data. It can be used for convenient GO enrichment and to produce publication quality figures in python. Keywords: Genotype-Phenotype Analysis GWAS Analysis High-Throughput Sequencing Genomics	Visit Website» Documentation» Web Forum»
GSearch	an ultra-fast and scalable microbial genome search program based on MinHash-like metric and graph-based approximate nearest neighbor search Keywords: Genomics Genomics	Visit Website»
gsort	a tool to sort genomic files according to a genomefile. Keywords: High-throughput sequencing Genomics	Visit Website»
GToTree	is a user-friendly workflow for phylogenomics intended to give more researchers the capability to create phylogenomic trees. Keywords: Phylogenomics Genomics	Visit Website» Documentation»
Hail	an open-source, general-purpose, Python-based data analysis library with additional data types and methods for working with genomic data. Keywords: Comparative Genomics GWAS Analysis High-throughput sequencing Genomics	Visit Website» Documentation»
hapLOHseq Paul Scheet, Anthony San Lucas	Developed for the detection of subtle allelic imbalance events from next-generation sequencing data, hapLOHseq is a sequencing-based extension of hapLOH, which is a method for the detection of subtle allelic imbalance events from SNP array data. It is capable of identifying events of 10 mega-bases or greater occurring in as little as 16% of the sample using exome sequencing data (at 80x) and 4% … Keywords: Genomics High-throughput sequencing Variant Analysis Genomics	Visit Website» Documentation»
hatchet	(Holistic Allele-specific Tumor Copy-number Heterogeneity) is an algorithm that infers allele and clone-specific CNAs and WGDs jointly across multiple tumor samples from the same patient, and that leverages the relationships between clones in these samples. Keywords: Genomics Genomics	Visit Website»
hic_breakfinder	a framework that integrates optical mapping, high-throughput chromosome conformation capture (Hi-C), and whole genome sequencing to systematically detect SVs in a variety of normal or cancer samples and cell lines. Keywords: Structural Variant Analysis Genomics	Visit Website» Documentation»
hictk	Blazing fast toolkit to work with .hic and .cool files Keywords: Hi-C Genomics	Visit Website» Documentation»
HipSTR Thomas Willems	(Haplotype inference and phasing for Short Tandem Repeats) a novel haplotype-based method for robustly genotyping and phasing STRs from Illumina sequencing data. HipSTR was specifically developed to deal with short tandem repeats (STRs) in genomic sequences in the hopes of obtaining more robust STR genotypes. Keywords: Genomics High-throughput sequencing Genomics	Visit Website» Documentation» Web Forum»
Hopla	Hopla enables classic genomic single, duo, trio, etc., analysis, by studying a single (multisample) vcf-file, eventually generating interactive visualizations. Keywords: Genomics Variant Analysis Genomics	Visit Website»
HyPhy	(Hypothesis Testing using Phylogenies) an open-source software package for comparative sequence analysis using stochastic evolutionary models. Keywords: Genomics Genomics	Visit Website»
IDR	The IDR (Irreproducible Discovery Rate) framework is a uniﬁed approach to measure the reproducibility of ﬁndings identiﬁed from replicate experiments and provide highly stable thresholds based on reproducibility. Keywords: Statistical Analysis Genomics	Visit Website» Documentation»
Infernal Sean R Eddy, Nawrocki P Eric	(INFERence of RNA ALignment) a program that searches DNA sequence databases for RNA structure and sequence similarities and uses a special case of profile stochastic context-free grammars called covariance models (CMs). In many cases It is more capable of identifying RNA homologs that conserve their secondary structure more than their primary sequence. Keywords: Comparative Genomics Genomics Multiple Nucleotide Sequence Alignment Genomics	Visit Website» Documentation» Web Forum»
intervene	is a tool for intersection and visualization of multiple genomic region and gene sets (or lists of items). Keywords: Genomics Genomics	Visit Website» Documentation»
JBrowse2	a new kind of genome browser that runs on your desktop. Keywords: Visualization High-Throughput Sequencing Visualization Genomics	Visit Website» Documentation» Web Forum» Mailing List»
Jellyfish Marçais Guillaume, Carl Kingsford	a tool for fast, memory-efficient counting of k-mers in DNA. A k-mer is a substring of length k, and counting the occurrences of all such substrings is a central step in many analyses of DNA sequence. JELLYFISH can count k-mers quickly by using an efficient encoding of a hash table and by exploiting the "compare-and-swap" CPU instruction to increase parallelism. Keywords: High-throughput sequencing Genomics	Visit Website» Documentation»
Kleborate	Kleborate: a tool for typing and screening pathogen genome assemblies Keywords: Genomics High-throughput sequencing Metagenomic Sequencing Analysis Genomics	Visit Website»
LDAK	a powerful and computationally efficient method for mixed-model association analysis in genome-wide association studies (GWAS). It is part of the LDAK software, which is written in C. Keywords: GWAS Analysis Genomics	Visit Website» Documentation» Mailing List»
LDSC	a command line tool for estimating heritability and genetic correlation from GWAS summary statistics. ldsc also computes LD Scores. Keywords: Genomics GWAS Analysis Genomics	Visit Website» Documentation»
LiftoffTools	is a toolkit to compare genes lifted between genome assemblies. Keywords: Genome Annotation Genomics	Visit Website»
LoFreq	is a fast and sensitive variant-caller for inferring SNVs and indels from next-generation sequencing data. It makes full use of base-call qualities and other sources of errors inherent in sequencing (e.g. mapping or base/indel alignment uncertainty), which are usually ignored by other methods or only used for filtering. Keywords: Genomics High-throughput sequencing Variant Analysis Genomics	Visit Website» Documentation»
lorax	A long-read analysis toolbox for cancer genomics. Keywords: Genomics High-throughput sequencing Genomics	Visit Website»
LUMPY Ryan Layer, Ira M Hall, Colby Chiang	a probabilistic framework for structural variant discovery. Keywords: Genomics Genomics	Visit Website» Documentation» Web Forum»
Macrel	(Meta)genomic AMP Classification and Retrieval Pipeline to mine antimicrobial peptides (AMPs) from (meta)genomes. Keywords: Metagenomic Sequencing Analysis High-Throughput Sequencing Genomics	Visit Website» Documentation» Web Forum»
MacSyFinder	a program to model and detect macromolecular systems, genetic pathways in protein datasets. In prokaryotes, these systems have often evolutionarily conserved properties: they are made of conserved components and are encoded in compact loci (conserved genetic architecture). The user models these systems with MacSyFinder to reflect these conserved features and to allow their efficient detection. Keywords: CRISPR/Cas9 Screen Analysis Genomics Genomics	Visit Website» Documentation» Web Forum»
MAGeCK	(Model-based Analysis of Genome-wide CRISPR-Cas9 Knockout) a computational tool to identify important genes from the recent genome-scale CRISPR-Cas9 knockout screens (or GeCKO) technology. Keywords: CRISPR/Cas9 Screen Analysis Genomics	Visit Website» Documentation» Web Forum»
MAGeCK‑VISPR ‑	a comprehensive quality control, analysis and visualization workflow for CRISPR/Cas9 screens. Keywords: CRISPR/Cas9 Screen Analysis Genomics	Visit Website» Web Forum»
Manta Christopher T Saunders	calls structural variants (SVs) and indels from mapped paired-end sequencing reads. Manta is optimized for analysis of germline variation in small sets of individuals and somatic variation in tumor/normal sample pairs. It discovers, assembles, and scores large-scale SVs, medium-sized indels and large insertions within a single efficient workflow. The method is designed for rapid analysis on standard compute hardware: NA12878 at 50x genomic coverage … Keywords: Genomics High-throughput sequencing Genomics	Visit Website» Documentation» Web Forum»
MarViN Rudy Arthur	a method for rapid genotype refinement for whole-genome sequencing data using multi-variate normal distribution. Whole-genome low-coverage sequencing has been combined with linkage-disequilibrium (LD) based genotype refinement to accurately and cost-effectively infer genotypes in large cohorts of individuals. Keywords: Genomics Genotype-Phenotype Analysis High-throughput sequencing Genomics	Visit Website»
merlin	uses sparse trees to represent gene flow in pedigrees and is a fast pedigree analysis package. Keywords: Genomics Genomics	Visit Website»
mhcflurry	MHC I ligand prediction package with competitive accuracy and a fast and documented implementation. Keywords: Genomics Motif Discovery Genomics	Visit Website»
MinCED	MinCED is a program to find Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs) in full genomes or environmental datasets such as assembled contigs from metagenomes. Keywords: CRISPR/Cas9 Screen Analysis Genomics	Visit Website»
Minimac4	a lower memory and more computationally efficient implementation of the genotype imputation algorithms in minimac/mininac2/minimac3. Keywords: Genomics Genomics	Visit Website» Documentation» Mailing List»
MrBayes Huelsenbeck John, Larget Bret, van der Mark Paul, Ronquist Fredrik, Simon Donald	a program for Bayesian inference and model choice across a wide range of phylogenetic and evolutionary models. MrBayes uses Markov chain Monte Carlo (MCMC) methods to estimate the posterior distribution of model parameters. Keywords: Genomics Phylogenetic Inference Phylogenomics Genomics	Visit Website» Documentation» Web Forum»
Octopus	Octopus is a mapping-based variant caller that implements several calling models within a unified haplotype-aware framework. Octopus takes inspiration from particle filtering by constructing a tree of haplotypes and dynamically pruning and extending the tree based on haplotype posterior probabilities in a sequential manner. This allows octopus to implicitly consider all possible haplotypes at a given loci in reasonable time. Keywords: Variant Analysis Genomics	Visit Website» Documentation»
OSGenome	an Open Source Web Application for Genetic Data (SNPs) using 23AndMe and Data Crawling Technologies. Keywords: Genotype-Phenotype Analysis Genomics	Visit Website»
Peakachu	an acronym that standands for Unveil Hi-C Anchors and Peaks, Peakachu takes genome-wide contact data as input and returns coordinates of likely interactions such as chromatin loops. Keywords: Hi-C Genomics	Visit Website» Documentation»
peddy	compares familial-relationships and sexes as reported in a PED/FAM file with those inferred from a VCF. Keywords: Variant Analysis Genomics	Visit Website»
PhaseDel	a Java-based variant caller designed for detecting somatic deletions from high-coverage (~30x) single-cell whole-genome sequencing (scWGS) data. Keywords: Variant Analysis Genomics	Visit Website»
phASER	(phasing and Allele Specific Expression from RNA-seq) performs haplotype phasing using read alignments in BAM format from both DNA and RNA based assays, and provides measures of haplotypic expression for RNA based assays. Keywords: RNA-Seq Analysis High-Throughput Sequencing Genomics	Visit Website» Documentation»
PHAST	(Phylogenetic Analysis with Space/Time models) a software package for comparative and evolutionary genomics. Keywords: Comparative Genomics Genomics	Visit Website»
PhenoGPT2	an advanced phenotype recognition model, leveraging the robust capabilities of large language models. It is an improved version of PhenoGPT (Jingye et. al. 2023). It employs a fine-tuned implementation on the synthetic medical data generated by Llama 3.1 70B, MIMIC-IV deidentified clinical notes, and Human Phenotype Ontology Database, to enhance prediction accuracy and alignments. Keywords: Genomics Genotype-Phenotype Analysis Genomics	Visit Website» Documentation» Mailing List»
PHESANT	PHESANT - PHEnome Scan ANalysis Tool Run a phenome scan (pheWAS, Mendelian randomisation (MR)-pheWAS etc.) in UK Biobank. There are three components in this project: Running a phenome scan in UK Biobank Post-processing of results PHESANT-viz: Visualising the results Keywords: Genomics Genomics	Visit Website»
Pindel	can detect breakpoints of large deletions, medium sized insertions, inversions, tandem duplications and other structural variants at single-based resolution from next-gen sequence data. It uses a pattern growth approach to identify the breakpoints of these variants from paired-end short reads. Keywords: DNA-Sequencing Genomics High-throughput sequencing Genomics	Visit Website» Documentation» Web Forum»
Platypus	Platypus is a tool designed for efficient and accurate variant-detection in high-throughput sequencing data. By using local realignment of reads and local assembly it achieves both high sensitivity and high specificity. Platypus can detect SNPs, MNPs, short indels, replacements and (using the assembly option) deletions up to several kb. It has been extensively tested on whole-genome, exon-capture, and targeted capture data, it has been … Keywords: Variant Analysis Genomics	Visit Website» Documentation»
PLINK	a comprehensive update to Shaun Purcell's PLINK command-line program -- a whole genome association analysis toolset, designed to perform a range of basic, large-scale analyses. Keywords: Association Mapping Genotype-Phenotype Analysis GWAS Analysis Genomics	Visit Website» Web Forum»
plmc Debora S Marks, John Ingraham	a tool that infers undirected graphical models to describe coevolution and covariation in families of biological sequences. With a multiple sequence alignment as an input, plmc can quantify inferred coupling strengths between all pairs of positions (couplingsfile output) or infer a generative model of the sequences for predicting the effects of mutations or designing new sequences (paramfile output). Keywords: Statistical Analysis Genomics	Visit Website» Documentation» Web Forum»
PopDel	(Population-wide Deletion Calling) fast structural deletion calling on population-scale short read paired-end germline WGS data. Keywords: Genomics Structural Variant Analysis Genomics	Visit Website» Documentation»
poppunk	(POPulation Partitioning Using Nucleotide Kmers) Calculate core and accessory distances, cluster genomes, assign new genomes to clusters, make visualisations Keywords: Genomics Metagenomic Sequencing Analysis Genomics	Visit Website» Documentation» Web Forum»
Prodigal	is a fast, reliable protein-coding gene prediction for prokaryotic genomes. Keywords: Genome Annotation Genomics	Visit Website» Documentation»
prodigal‑gv	A fork of Prodigal meant to improve gene calling for giant viruses and viruses that use alternative genetic codes. Keywords: Genome Annotation Genomics	Visit Website»
ProSolo	is a variant caller for single cell data from whole genome amplification with multiple displacement amplification (MDA). It relies on a pair of samples, where one is from an MDA single cell and the other from a bulk sample of the same cell population, sequenced with any next-generation sequencing technology. Keywords: Variant Analysis Genomics	Visit Website»
QTLtools	QTLtools is a tool set for molecular QTL discovery and analysis. Keywords: quantitative trait loci (QTLs) mapping/discovery Genomics	Visit Website»
RGT	(Regulatory Genomics Toolbox) is an open source python library for analysis of regulatory genomics. RGT is programmed in an oriented object fashion and its core classes provide functionality for handling regulatory genomics data. Keywords: ChIP-Sequencing DNA-Sequencing Visualization Genomics	Visit Website» Documentation» Web Forum»
RingMapper Anthony Mustoe, Nicole N. Lama, Kevin M Weeks, Patrick S. Irving, Samuel W. Olson	a code for performing RING-MaP and PAIR-MaP analysis. Keywords: RNA-Seq Analysis RNA-Sequencing Genomics High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
RODEO	(Rapid ORF Description & Evaluation Online) evaluates one or many genes, characterizing a gene neighborhood based on the presence of profile hidden Markov models (pHMMs). Keywords: Genomics Genomics	Visit Website» Documentation»
sansa	Structural variant (SV) annotation. Keywords: Variant Analysis Genomics	Visit Website»
SCRAMBle	runs as a two-step process. First cluster_identifier is used to generate soft-clipped read cluster consensus sequences. Second, SCRAMBle-MEIs.R analyzes the cluster file for likely Mobile Element Insertions. Keywords: Genomics High-throughput sequencing Genomics	Visit Website»
selscan	a program to calculate EHH-based scans for positive selection in genomes. Keywords: Genome Annotation Genomics	Visit Website»
SEQLinkage	implements a collapsed haplotype pattern (CHP) method to generate markers from sequence data for linkage analysis. Keywords: ChIP-Sequencing Genomics	Visit Website» Documentation»
SHAPEIT5	a software package to estimate haplotypes in large genotype datasets (WGS and SNP array). Keywords: WGS Analysis High-Throughput Sequencing Genomics	Visit Website»
SMALT	SMALT aligns DNA sequencing reads with a reference genome. Reads from a wide range of sequencing platforms can be processed, for example Illumina, Roche-454, Ion Torrent, PacBio or ABI-Sanger. Paired reads are supported. There is no support for SOLiD reads. Keywords: Sequence Alignment Analysis Genomics	Visit Website» Documentation»
smoove	structural variant calling and genotyping with existing tools, but, smoothly. Keywords: Genotype-Phenotype Analysis Structural Variant Analysis Genomics	Visit Website»
Sniffles	a structural variation caller using third generation sequencing. Keywords: Structural Variant Analysis Genomics	Visit Website» Documentation»
SnpEff Pablo Cingolani	genomic variant annotation and functional effect prediction toolbox. Keywords: Genomics High-throughput sequencing Genomics	Visit Website» Documentation»
SomaticSeq	is an ensemble somatic SNV/indel caller that has the ability to use machine learning to filter out false positives from other callers. Keywords: Variant Analysis Genomics	Visit Website»
SpacePHARER	(CRISPR Spacer Phage-Host pAiRs findER) a modular toolkit for sensitive phage-host interaction identification using CRISPR spacers. Keywords: CRISPR/Cas9 Screen Analysis Genomics	Visit Website» Documentation»
SpliceV	provides analysis and publication quality printing of linear and circular RNA splicing, expression and regulation. Keywords: RNA-Sequencing Visualization Genomics Visualization	Visit Website» Documentation» Web Forum»
SpydrPick	Mutual information based detection of pairs of genomic loci co-evolving under a shared selective pressure Keywords: Genomics Genomics	Visit Website»
SRA Toolkit SRA Toolkit Development Team	(Sequence Read Archive Toolkit) a collection of tools and libraries for using data in the INSDC Sequence Read Archives. Keywords: Genomics High-Throughput Sequencing	Visit Website» Documentation» Web Forum»
Strelka2 Christopher T Saunders, Sangtae Kim	a fast and accurate small variant caller optimized for analysis of germline variation in small cohorts and somatic variation in tumor/normal sample pairs. The germline caller employs an efficient tiered haplotype model to improve accuracy and provide read-backed phasing, adaptively selecting between assembly and a faster alignment-based haplotyping approach at each variant locus. The germline caller also analyzes input sequencing data using a mixture-model … Keywords: Genomics Variant Analysis Genomics	Visit Website» Documentation» Web Forum»
StringTie Geo Pertea, Mihaela Pertea	a fast and highly efficient assembler of RNA-Seq alignments into potential transcripts. It uses a novel network flow algorithm as well as an optional de novo assembly step to assemble and quantitate full-length transcripts representing multiple splice variants for each gene locus. Its input can include not only the alignments of raw reads used by other transcript assemblers, but also alignments longer sequences that … Keywords: RNA-Seq Analysis Transcriptomics Genomics High-Throughput Sequencing	Visit Website» Documentation»
structure	Inference of population structure using multilocus genotype data Keywords: Genomics Genomics	Visit Website»
SURVIVOR	a tool set for simulating/evaluating SVs, merging and comparing SVs within and among samples, and includes various methods to reformat or summarize SVs. Keywords: Structural Variant Analysis Genomics	Visit Website» Documentation»
SViCT	a computational tool for detecting structural variations from cell free DNA (cfDNA) containing low dilutions of circulating tumor DNA (ctDNA). Keywords: Variant Analysis Genomics	Visit Website»
TelSeq	a software that estimates telomere length from whole genome sequencing data (BAMs). Keywords: Genomics Genomics	Visit Website»
TensorFlow Sanjoy Das, Gunhan Gulsoy, Benoit Steiner	an open source software library for high performance numerical computation. Its flexible architecture allows easy deployment of computation across a variety of platforms (CPUs, GPUs, TPUs), and from desktops to clusters of servers to mobile and edge devices. Originally developed by researchers and engineers from the Google Brain team within Google’s AI organization, it comes with strong support for machine learning and deep learning, … Keywords: High Performance Computing Machine Learning Genomics High-Throughput Sequencing	Visit Website» Documentation» Web Forum» Mailing List»
tensorQTL	a GPU-enabled QTL mapper, achieving ~200-300 fold faster cis- and trans-QTL mapping compared to CPU-based implementations. Keywords: quantitative trait loci (QTLs) mapping/discovery High-Throughput Sequencing Genomics	Visit Website» Documentation» Web Forum»
Tracer	is a software package for visualising and analysing the MCMC trace files generated through Bayesian phylogenetic inference. Tracer provides kernel density estimation, multivariate visualisation, demographic trajectory reconstruction, conditional posterior distribution summary and more. Tracer v1.7.1 can read output files from MrBayes, BEAST, BEAST2, RevBayes, Migrate, LAMARC and and possibly other MCMC programs from other domains. Keywords: Phylogenomics Genomics	Visit Website» Documentation» Web Forum»
TrimGalore Felix Krueger	a wrapper around Cutadapt and FastQC to consistently apply adapter and quality trimming to FastQ files, with extra functionality for RRBS data. Keywords: High-throughput sequencing Genomics	Visit Website» Documentation» Web Forum»
Unicycler Ryan R Wick, Kathryn E. Holt, Louise M Judd, Claire L. Gorrie	an assembly pipeline for bacterial genomes. Keywords: Bioinformatics Genome Assembly RNA-Sequencing Genomics	Visit Website» Documentation» Web Forum»
VarScan Daniel C Koboldt	a platform-independent mutation caller for targeted, exome, and whole-genome resequencing data generated on Illumina, SOLiD, Life/PGM, Roche/454, and similar instruments. Restriction: available to non-profit users only. See technical notes for additional information on for-profit user licensing. Keywords: Genomics Genomics	Visit Website» Documentation» Web Forum» Mailing List»
vcfanno	allows you to quickly annotate your VCF with any number of INFO fields from any number of VCFs or BED files. It uses a simple conf file to allow the user to specify the source annotation files and fields and how they will be added to the info of the query VCF. Keywords: Variant Analysis Genomics	Visit Website»
vembrane	allows to simultaneously filter variants based on any INFO field, CHROM, POS, REF, ALT, QUAL, and the annotation field ANN. Keywords: Variant Analysis Genomics	Visit Website»
vg	Variation graphs provide a succinct encoding of the sequences of many genomes. A variation graph (in particular as implemented in vg) is composed of: * nodes, which are labeled by sequences and ids * edges, which connect two nodes via either of their respective ends * paths, describe genomes, sequence alignments, and annotations (such as gene models and transcripts) as walks through nodes connected … Keywords: Genomics Variant Analysis Genomics	Visit Website» Documentation» Web Forum»
Vmatch	a versatile software tool for efficiently solving large scale sequence matching tasks. Vmatch subsumes the software tool REPuter, but is much more general, with a very flexible user interface, and improved space and time requirements. Keywords: Sequence Alignment Analysis Genomics	Visit Website» Documentation»
vt	A tool set for short variant discovery in genetic sequence data. Keywords: Variant Analysis Genomics	Visit Website» Documentation»
WASPQTL	WASP is a suite of tools for unbiased allele-specific read mapping and discovery of molecular QTLs. Keywords: quantitative trait loci (QTLs) mapping/discovery Read Alignment Genomics High-Throughput Sequencing	Visit Website» Documentation»
WhatsHap	a software for phasing genomic variants using DNA sequencing reads, also called read-based phasing or haplotype assembly. Keywords: Genomics Genomics	Visit Website»
Zerone	discretizes several ChIP-seq replicates simultaneously and resolves conflicts between them. After the job is done, Zerone checks the results and tells you whether it passes the quality control. Keywords: ChIP-Sequencing Genomics High-Throughput Sequencing	Visit Website» Documentation»

How to use AppCiter?

1. Choose Your Programs

2. Select Citations

3. Export Citations