RESEARCH ARTICLE


Comparison of Sequencing Utility Programs



Erik Aronesty*
Expression Analysis, A Quintiles Company, Durham, NC, USA


Article Metrics

CrossRef Citations:
0
Total Statistics:

Full-Text HTML Views: 255
Abstract HTML Views: 2503
PDF Downloads: 499
Total Views/Downloads: 3257
Unique Statistics:

Full-Text HTML Views: 179
Abstract HTML Views: 1400
PDF Downloads: 428
Total Views/Downloads: 2007



© 2013 Erik Aronesty

open-access license: This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International Public License (CC-BY 4.0), a copy of which is available at: https://creativecommons.org/licenses/by/4.0/legalcode. This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

* Address correspondence to this author at the Expression Analysis, a Quintiles Company, Durham, NC, USA; Tel: 919-287-4011; E-mail: erik.aronesty@quintiles.com


Abstract

High throughput sequencing (HTS) has resulted in extreme growth rates of sequencing data. At our lab, we generate terabytes of data every day. It is usually seen as required for data output to be “cleaned” and processed in various ways prior to use for common tasks such as variant calling, expression quantification and assembly.

Two common tasks associated with HTS are adapter trimming and paired-end joining. I have developed two tools at Expression Analysis, Inc. to address these common tasks. The names of these programs are fastq-mcf and fastq-join. I compared the performance of these tools to similar open-source utilities, both in terms of resource efficiency, and effectiveness.

Keywords: Sequencing, utilities, Illumina, clipping, ngs and next-generation.