All published articles of this journal are available on ScienceDirect.
A Cloud Computing System to Quickly Implement New Microarray Data Pre-processing Methods
Abstract
Background:
Pre-processing, including normalization of raw microarray data is crucial to microarray-related data analysis. It takes time and effort to build newly-developed algorithms into commercial software or locally developed systems. While most new algorithms emerge in the form of sharable R packages, it can be difficult for many biologists to apply them as soon as they are available. Currently, we rely on statisticians and experienced programmers to develop and implement code to access those R packages. Therefore, we need a robust procedure to quickly implement pre-processing methods as they appear. The newly emerging cloud computing concept has directed us toward a new way for providing an easily accessible service to the biologists without requiring them to have any programming knowledge in R.
Results:
Based on our earlier Java-based software tool JavaStat, we developed an internet based application prototype to upload data and carry out pre-processing applications that include normalization, statistical analyses and plots. More im-portantly, R packages, e. g., for newly-developed normalization methods, and GC-robust multichip algorithm (RMA) for exon arrays, can be easily incorporated into the system with limited inputs from a biologist or a programmer. The data are stored in the cloud and the R code runs on server.
Conclusion:
The newly emerged cloud computing concept provides us a new way to provide an easily accessible and up-to-date service to biologists, as evidenced by our JavaStat system to incorporate new pre-processing package as they ap-pear. Users can access the application with a newly incorporated module through the Web. We expect this and other simi-lar systems greatly decrease turn-around time, improve accessibility of newly developed R model for pre-processing algo-rithms.