Guide to Threshold Selection for Motif Prediction Using Positional Weight Matrix

  1. (PDF, 421 KB)
AuthorSearch for: ; Search for:
ConferenceProceedings of the International MultiConference of Engineers and Computer Scientists 2008 (IMECS'08) at The 2008 IAENG International Conference on Bioinformatics (ICB'08), March 19-21, 2008., Hong Kong, China
Subjectsequence motif; positional weight matrix; log-odd score; statistical expectation; goodness-of-fit; matrice position-poids; score log-odd; espérance statistique; validité de la concordance
AbstractIn biological sequence research, the positional weight matrix (PWM) is often used to search for putative transcription factor binding sites. A log-odd score is usually applied to measure the closeness of a subsequence to the PWM. However, the log-odd score is motif-length-dependent and thus there is no universally applicable threshold. In this paper, we propose an alternative scoring index (G) varying from zero, where the subsequence is not much different from the background, to one, where the subsequence fits best to the PWM. We also propose a measure evaluating the statistical expectation at each G index. We investigated the PWMs from the TRANSFAC and found that the statistical expectation is significantly ( p < 0.0001) correlated with both the length of the PWMs and the threshold G value. We applied this method to two PWMs (GCN4_C and ROX1_Q6) of yeast transcription factor binding sites and two PWMs (HIC1-02, HIC1_03) of the human tumor suppressor (HIC-1) binding sites from the TRANSFAC database. Finally, our method compares favorably with the broadly used Match method. The results indicate that our method is more flexible and can provide better confidence.
Publication date
AffiliationNRC Institute for Information Technology; National Research Council Canada
Peer reviewedNo
NRC number49881
NPARC number8913400
Export citationExport as RIS
Report a correctionReport a correction
Record identifier47384da9-c7d2-41af-a3ef-13a49f0b3202
Record created2009-04-22
Record modified2016-05-09
Bookmark and share
  • Share this page with Facebook (Opens in a new window)
  • Share this page with Twitter (Opens in a new window)
  • Share this page with Google+ (Opens in a new window)
  • Share this page with Delicious (Opens in a new window)
Date modified: