Precise physical models of protein-DNA interaction from high-throughput data Journal Article

Author(s): Kinney, Justin B; Tkacik, Gasper; Callan, Curtis G
Article Title: Precise physical models of protein-DNA interaction from high-throughput data
Abstract: A cell's ability to regulate gene transcription depends in large part on the energy with which transcription factors (TFs) bind their DNA regulatory sites. Obtaining accurate models of this binding energy is therefore an important goal for quantitative biology. In this article, we present a principled likelihood-based approach for inferring physical models of TF-DNA binding energy from the data produced by modern high-throughput binding assays. Central to our analysis is the ability to assess the relative likelihood of different model parameters given experimental observations. We take a unique approach to this problem and show how to compute likelihood without any explicit assumptions about the noise that inevitably corrupts such measurements. Sampling possible choices for model parameters according to this likelihood function, we can then make probabilistic predictions for the identities of binding sites and their physical binding energies. Applying this procedure to previously published data on the Saccharomyces cerevisiae TF Abf1p, we find models of TF binding whose parameters are determined with remarkable precision. Evidence for the accuracy of these models is provided by an astonishing level of phylogenetic conservation in the predicted energies of putative binding sites. Results from in vivo and in vitro experiments also provide highly consistent characterizations of Abf1p, a result that contrasts with a previous analysis of the same data.
Keywords: Protein Binding; Likelihood Functions; Binding Sites; Thermodynamics; Models: Chemical; Transcription Factors; Biophysics; DNA-Binding Proteins; Biophysical Phenomena; DNA: Fungal; Protein Array Analysis; Saccharomyces cerevisiae; Saccharomyces cerevisiae Proteins; DNA
Journal Title: PNAS
Volume: 104
Issue 2
ISSN: 1091-6490
Publisher: National Academy of Sciences  
Date Published: 2007-01-09
Start Page: 501
End Page: 506
DOI: 10.1073/pnas.0609908104
Open access: no