"9623","15","archive","17",,,"disk0/00/00/96/23","2016-10-14 09:32:34","2016-10-14 09:37:36","2016-10-14 09:32:34","article",,,"show",,,,"","","","","","","","","","",,,,"Katta","M A V S K","","","","","","","Katta","M A V S K","","","","",,,,,"","",,,,,"","","ICRISAT (Patancheru)","India","NGS-QCbox and Raspberry for Parallel, Automated and Rapid Quality Control Analysis of Large-Scale Next Generation
Sequencing (Illumina) Data","pub","s2.4","D3","crp1.5","public",,,"NGS-QCbox, Raspberry for Parallel, Rapid Quality Control",,"Authors are thankful to the CGIAR
Generation Challenge Program for financial support.
This work has been undertaken as part of the CGIAR
Research Program on Grain Legumes. ICRISAT is a
member of the CGIAR Consortium.","Rapid popularity and adaptation of next generation sequencing (NGS) approaches have
generated huge volumes of data. High throughput platforms like Illumina HiSeq produce
terabytes of raw data that requires quick processing. Quality control of the data is an
important component prior to the downstream analyses. To address these issues, we have
developed a quality control pipeline, NGS-QCbox that scales up to process hundreds or
thousands of samples. Raspberry is an in-house tool, developed in C language utilizing
HTSlib (v1.2.1) (http://htslib.org), for computing read/base level statistics. It can be used as
stand-alone application and can process both compressed and uncompressed FASTQ format
files. NGS-QCbox integrates Raspberry with other open-source tools for alignment
(Bowtie2), SNP calling (SAMtools) and other utilities (bedtools) towards analyzing raw NGS
data at higher efficiency and in high-throughput manner. The pipeline implements batch processing
of jobs using Bpipe (https://github.com/ssadedin/bpipe) in parallel and internally, a
fine grained task parallelization utilizing OpenMP. It reports read and base statistics along
with genome coverage and variants in a user friendly format. The pipeline developed presents
a simple menu driven interface and can be used in either quick or complete mode. In
addition, the pipeline in quick mode outperforms in speed against other similar existing QC
pipeline/tools. The NGS-QCbox pipeline, Raspberry tool and associated scripts are made
available at the URL https://github.com/CEG-ICRISAT/NGS-QCbox and https://github.com/
CEG-ICRISAT/Raspberry for rapid quality control analysis of large-scale next generation
sequencing (Illumina) data.","2015-10","published",,"PLOS One","10","10","Public Library of Science",,"1-9",,,,,,"10.1371/journal.pone.0139868",,,,,"TRUE",,"1932-6203",,,,,,"","http://dx.doi.org/10.1371/journal.pone.0139868","http://scholar.google.co.in/scholar?as_q=NGS-QCbox+and+Raspberry+for+Parallel%2C+Automated+and+Rapid+Quality+Control+Analysis+of+Large-Scale+Next+Generation+Sequencing+%28Illumina%29+Data++&as_epq=&as_oq=&as_eq=&as_occt=title&as_sauthors=&as_publication=&","pub",,"CGIAR Generation Challenge Program","",,,,,,"",,,,,,,"",,,,,"",,,,,"","",,,,,"","",,,,,
"9623",,,,,,,,,,,,,,,,,,,,,,,,,,,,,,"Khan","A W","","",,,,,"Khan","A W","","",,,,,,,,,,,,,,,"The University of Western Australia (Crawley)","Australia",,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,
"9623",,,,,,,,,,,,,,,,,,,,,,,,,,,,,,"Doddamani","D","","",,,,,"Doddamani","D","","",,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,
"9623",,,,,,,,,,,,,,,,,,,,,,,,,,,,,,"Thudi","M","","",,,,,"Thudi","M","","",,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,
"9623",,,,,,,,,,,,,,,,,,,,,,,,,,,,,,"Varshney","R K","","",,,,,"Varshney","R K","","",,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,
