PromoterPredict: sequence-based modelling of Escherichia coli σ70 promoter strength yields logarithmic dependence between promoter strength and sequence

This article has 6 evaluations Published on
Read the full article Related papers
This article on Sciety

Abstract

We present PromoterPredict, a dynamic multiple regression approach to predict the strength of Escherichia coli promoters binding the σ70 factor of RNA polymerase. σ70 promoters are ubiquitously used in recombinant DNA technology, but characterizing their strength is demanding in terms of both time and money. Using a well-characterized set of promoters, we trained a multivariate linear regression model and found that the log of the promoter strength is significantly linearly associated with a weighted sum of the –10 and –35 sequence profile scores. It was found that the two regions contributed almost equally to the promoter strength. PromoterPredict accepts –10 and –35 hexamer sequences and returns the predicted promoter strength. It is capable of dynamic learning from user-supplied data to refine the model construction and yield more confident estimates of promoter strength.

Availability

Open source code and a standalone executable with both dynamic model-building and prediction are available (under GNU General Public License 3.0) at <ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/PromoterPredict">https://github.com/PromoterPredict</ext-link>, and require Python 2.7 or greater. PromoterPredict is also available as a web service at <ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://promoterpredict.com">https://promoterpredict.com</ext-link>.

Contact

<email>apalania@scbt.sastra.edu</email>

Related articles

Related articles are currently not available for this article.