AlmostSignificant

Simplifying quality control of high-throughput sequencing data

Joseph Ward, Christian Cole (Lead / Corresponding author), Melanie Febrer, Geoffrey Barton (Lead / Corresponding author)

Research output: Other contribution

13 Downloads (Pure)

Abstract

Motivation: The current generation of DNA sequencing technologies produce a large amount of data quickly. All of these data need to pass some form of quality control processing and checking before they can be used for any analysis. The large number of samples that are run through Illumina sequencing machines makes the process of quality control an onerous and time-consuming task that requires multiple pieces of information from several sources. Results: AlmostSignificant is an open-source platform for aggregating multiple sources of quality metrics as well as meta-data associated with DNA sequencing runs from Illumina sequencing machines. AlmostSignificant is a graphical platform to streamline the quality control of DNA sequencing data, to collect and store these data for future reference and to collect extra meta-data associated with the sequencing runs to check for errors and monitor the volume of data produced by the associated machines. AlmostSignificant has been used to track the quality of over 80 sequencing runs covering over 2500 samples produced over the last three years. Availability: The code and documentation for AlmostSignificant is freely available at https://github.com/bartongroup/AlmostSignificant.
Original languageEnglish
Media of outputBioRxiv
PublisherBioRxiv
Number of pages2
DOIs
Publication statusPublished - 16 May 2016

Fingerprint

DNA Sequence Analysis
Quality Control
Documentation
Technology

Cite this

@misc{4f6469bf5040454b861db9bc6afc92e2,
title = "AlmostSignificant: Simplifying quality control of high-throughput sequencing data",
abstract = "Motivation: The current generation of DNA sequencing technologies produce a large amount of data quickly. All of these data need to pass some form of quality control processing and checking before they can be used for any analysis. The large number of samples that are run through Illumina sequencing machines makes the process of quality control an onerous and time-consuming task that requires multiple pieces of information from several sources. Results: AlmostSignificant is an open-source platform for aggregating multiple sources of quality metrics as well as meta-data associated with DNA sequencing runs from Illumina sequencing machines. AlmostSignificant is a graphical platform to streamline the quality control of DNA sequencing data, to collect and store these data for future reference and to collect extra meta-data associated with the sequencing runs to check for errors and monitor the volume of data produced by the associated machines. AlmostSignificant has been used to track the quality of over 80 sequencing runs covering over 2500 samples produced over the last three years. Availability: The code and documentation for AlmostSignificant is freely available at https://github.com/bartongroup/AlmostSignificant.",
author = "Joseph Ward and Christian Cole and Melanie Febrer and Geoffrey Barton",
year = "2016",
month = "5",
day = "16",
doi = "10.1101/053702",
language = "English",
publisher = "BioRxiv",
address = "United States",
type = "Other",

}

TY - GEN

T1 - AlmostSignificant

T2 - Simplifying quality control of high-throughput sequencing data

AU - Ward, Joseph

AU - Cole, Christian

AU - Febrer, Melanie

AU - Barton, Geoffrey

PY - 2016/5/16

Y1 - 2016/5/16

N2 - Motivation: The current generation of DNA sequencing technologies produce a large amount of data quickly. All of these data need to pass some form of quality control processing and checking before they can be used for any analysis. The large number of samples that are run through Illumina sequencing machines makes the process of quality control an onerous and time-consuming task that requires multiple pieces of information from several sources. Results: AlmostSignificant is an open-source platform for aggregating multiple sources of quality metrics as well as meta-data associated with DNA sequencing runs from Illumina sequencing machines. AlmostSignificant is a graphical platform to streamline the quality control of DNA sequencing data, to collect and store these data for future reference and to collect extra meta-data associated with the sequencing runs to check for errors and monitor the volume of data produced by the associated machines. AlmostSignificant has been used to track the quality of over 80 sequencing runs covering over 2500 samples produced over the last three years. Availability: The code and documentation for AlmostSignificant is freely available at https://github.com/bartongroup/AlmostSignificant.

AB - Motivation: The current generation of DNA sequencing technologies produce a large amount of data quickly. All of these data need to pass some form of quality control processing and checking before they can be used for any analysis. The large number of samples that are run through Illumina sequencing machines makes the process of quality control an onerous and time-consuming task that requires multiple pieces of information from several sources. Results: AlmostSignificant is an open-source platform for aggregating multiple sources of quality metrics as well as meta-data associated with DNA sequencing runs from Illumina sequencing machines. AlmostSignificant is a graphical platform to streamline the quality control of DNA sequencing data, to collect and store these data for future reference and to collect extra meta-data associated with the sequencing runs to check for errors and monitor the volume of data produced by the associated machines. AlmostSignificant has been used to track the quality of over 80 sequencing runs covering over 2500 samples produced over the last three years. Availability: The code and documentation for AlmostSignificant is freely available at https://github.com/bartongroup/AlmostSignificant.

U2 - 10.1101/053702

DO - 10.1101/053702

M3 - Other contribution

PB - BioRxiv

ER -