Volume 10 Supplement 4
A novel index of protein-protein interface propensity improves interface residue recognition
- Wentao Dai†1, 3,
- Aiping Wu†2,
- Liangxiao Ma1,
- Yi-Xue Li1, 3, 4,
- Taijiao Jiang2, 5Email author and
- Yuan-Yuan Li1, 3, 4Email author
© The Author(s). 2016
Published: 23 December 2016
Protein-protein interface holds important information of protein-protein interactions which play key roles in most biological processes. In the past few years, a lot of efforts have been made to improve interface residue recognition by characterizing protein-protein interfaces and extracting relevant features. However, most previous studies were carried out in a qualitative level, and there are also some inconsistencies between them.
In the present work, to improve interface residue recognition, we built a novel quantitative residue protein-protein interface propensity index (QIPI) and gained a comprehensive picture of protein-protein interface through analyzing protein-protein interfaces on our comprehensive protein-protein interfaces dataset (Astral2.05-40-4506). Furthermore, in order to assess the effect of QIPI in improving the protein-protein interface prediction, we developed an interface residue recognition method SPR (Single domain based Patch Recognition) based on the QIPI. The evaluation results proved that our novel QIPI is able to improve the interface residue recognition.
Through a comprehensive quantitative analysis of protein-protein interface, we constructed a novel quantitative protein-protein interface propensity index (QIPI), which could be easily applied to improve the interface residue recognition and helpful in understanding the protein-protein interface.
QIPI and SPR are available to non-commercial users at our website: http://www.scbit.org/QIPI/.
Protein-protein interactions play crucial roles in many biological functions [1–3]. A detailed characterization of protein-protein interactions may provide crucial information about the function of protein complexes which would be helpful in medicine and drug researches [4–6]. In order to elucidate the mechanisms of protein-protein interactions, a number of biophysical techniques [7, 8] including X-ray crystallography, various spectroscopic techniques, cross-linking methods, mutation studies and so on, have been employed to investigate protein-protein interface properties. Meanwhile, a lot of efforts have been made to find the critical factors determining the specificity and affinity of protein–protein interfaces [3, 9–11].
It is indicated that protein-protein interfaces are characterized by several distinguishing properties from the rest of the surfaces in terms of geometric and chemical complementarities between interfaces, ranging from hydrophobic forces, electrostatic forces, surface planarity, interface biased residue composition to inter-residue contacts [12–15]. Knowledge of these characteristics has enabled the understanding of the interface as a whole. Various hypotheses have been proposed to delineate the interface architecture and explore the mechanisms of protein-protein interactions. The first study is O-Ring theory which concluded that the existence of a hot-spot enriched region at the center surrounded by an outer ring of non-conserved residues to occlude water [16, 17]. Later on, a series of hypotheses were developed to refine the O-Ring theory [18–20]. Another viewpoint proposes that interface should be divided into core and rim area: the former consisting largely of buried atoms and the latter formed mainly by exposed atoms . However, there are some inconsistencies between these studies. Taking basic residues’ interface preference as an example, Arg and His showed positive interface propensity in some studies [14, 15] but opposite preference of these ones were also reported by other researchers [21, 22]. Moreover, qualitative results were given by most previous studies, while the interface residue recognition methods essentially need quantitative interface propensities [14, 15, 23]. There are two main reasons leading to these contradictory conclusions in previous studies: lacking a comprehensive non-redundant protein-protein interface dataset and ignoring the bias effect of solvent accessibility between interfaces and non-interface surfaces. In order to gain a comprehensive picture of protein-protein interface, we first constructed a latest comprehensive protein-protein interfaces dataset (Astral2.05-40-4506) which was extracted from the latest version of Structural Classification of Proteins — extended (SCOPe) database (v2.05) . Then we reassessed the various features excluding the bias effect of solvent accessibility in a suitable manner on the dataset Astral2.05-40-4506.
In this work, we performed a novel analysis of protein-protein interface on our comprehensive protein-protein interfaces dataset (Astral2.05-40-4506). Because the interface and non-interface surfaces have different solvent accessibility, it is not well known whether their difference is due to the differences in solvent accessibility or differences in functionality (such as protein-protein interaction). The bias effect of solvent accessibility should be excluded in the protein-protein interface analysis. We analyzed the interface using non-interface surface as reference to remove the bias effect of solvent accessibility. In a convincing manner, a novel quantitative residue interface propensity index (QIPI) was constructed from our analysis and an interface residue recognition method SPR (Single domain based Patch Recognition) was developed based on the quantitative index to evaluate the interface prediction power of QIPI. The result shows that the QIPI not only characterizes protein-protein interfaces, but also helps to improve the interface residue recognition.
Datasets and interface definition
Protein complexes were retrieved from the latest version of Structural Classification of Proteins — extended (SCOPe) database (v2.05) . A previous study demonstrates that interface properties showed consistency across different datasets, which are from the same raw protein database but with different constraints on sequence similarity and structure quality . Based on the above reason, we constructed the Astral2.05-40 dataset, which is a subset of SCOPe2.05 with less than 40% identity between any two domains, for large-scale analysis of interface propensities.
A dataset of protein-protein interfaces (referred to as Astral2.05-40-4506), which consists of 4506 interfaces, was thus obtained from the Astral2.05-40 dataset.
The Astral2.05-40-4506 was used as the comprehensive interface dataset to analyze characteristics of protein-protein interfaces and develop our interface prediction method. We used the independent dataset Docking Benchmark 2.0  to evaluate the power of new interface features especially the quantitative residue interface propensity index (QIPI) for interface prediction. The Docking Benchmark 2.0, which contained 84 complexes and 168 monomers, consists of 168 interfaces.
Two protein-protein interface datasets were widely used to assess interface residue recognition methods in the previous study. The first dataset consists of 25 CAPRI targets and 176 interfaces. The second dataset Enz35 set consists of 35 protein interfaces  and these proteins in this dataset are all enzymes. In order to compare SPR with the existing popular interface prediction method directly, we carried out the tests based on these two datasets.
For a single domain, the residue whose accessible surface area (ASA) > 1 Å2 is defined as surface residue. Surface residues were classified into two groups: interface and non-interface. The interface is formed by spatially neighboring residues whose ASA between single domain and complex were changed more than 1 Å2 per site and cross-interface contacts distance < 5 Å. The other surface residues are non-interface [14, 26, 27]. The accessible surface area (ASA) of residues was computed using NACCESS (http://www.bioinf.manchester.ac.uk/naccess/). Only surface residues were considered in the analysis and assessment. Similarly, only unbound structures were used for interface prediction.
Relative Interface Ratio (RIR) and contact preferences
Let f i be the number of interface residues of type i, and F i be the number of non-interface surface residues of type i. The frequency of residue i in the interface and non-interface surface were calculated as w i = f i /∑ m f m and W i = F i /∑ m F m (m is the residue type), respectively. The relative interface ratio (RIR) of residue type i was given by (w i /W i ). As the similar criteria, we analyzed the frequency and RIR of secondary structure elements in interface. In order to analyze the independent and cooperation effect of residues and secondary structures, we considered 60 classes of residues as defined by 20 residue types multiplied by 3 secondary structure states and analyzed the frequency and RIR of the 60 kinds of residues at interface.
In order to describe the ASA propensities for interface and non-interface surface residues, we got the ASA threshold A t for residue type i from the Astral2.05-40-4506. The ASA threshold A t was defined that ASA frequency (percentage of residues in the ASA bins) of interface residue type i was very close to the ASA frequency of non-interface surface ones in the A t bin (Additional File 1: Figure S1). The A t of 20 amino acids were calculated and shown in Additional file 2: Table S1. f IS(i) was the number of interface residue type i whose ASA < A t , and f IL(i) was the number of interface residue type i whose ASA ≥ A t . As the similar definition, the f SS(i) and f SL(i) are generated for the non-interface surface residue type i. The relative interface ratio (RIR) of residue type i in ASA was given by (f IL(i)/f IS(i))/(f SL(i)/f SS(i)).
C ij was the number of interface-crossing contacts between residues of types i and j. The raw contact frequency between residues of types i and j was calculated as (C ij /∑ m,n C mn ). Here, m and n are residue types in the interface-crossing contacts. The contact preference between residue types i and j was calculated as log2((C ij /∑ m,n C mn )/(w i × w j )), where w i and w j were defined as above.
Interface size and residue number is calculated separately for each side of an interface. Domain size is also calculated for each domain. The summary of statistic result was shown in histogram and probability density function curve.
Based on characteristics of interface especially the QIPI in our analysis, a novel method SPR (Single domain based Patch Recognition) was developed as an interface predictor to assess the effect of interface features founded by us. Therefore, in SPR, we focus on (i) patches generated on the protein surface as virtual interfaces, which is described in the section of patch generation and (ii) the scoring function to evaluate the quality of a virtual interface, which is described in the section of scoring function.
Patch generation on the protein surface
Step I: Identification of surface residues. As in the above analysis, surface residues are defined as accessible surface area (ASA) > 1 Å2.
Step II: Generation of residue side-chain distance matrix. For a protein sequence, the minimum distance between side-chain atoms of each residue pair (Cα to Cα distance in the case of glycine) was calculated as the element of residue side-chain distance matrix. If the minimum distance of a residue pair >25 Å, the corresponding element in the matrix was 25 Å.
Step III: Construction of candidate interface patches. A random surface residue was selected as the seed residue, and neighboring surface residues whose ASA and distance to the seed residue satisfy the standard in the Table 1A were included in the candidate interface patch. All of the surface residues were sampled and a series of candidate interface patches were constructed.Table 1
Patch generation thresholds
A The ASA and distance with seed residue of patch residue
B Thresholds for patch merging
Step IV: Merging the candidate interface patches. For candidate interface patches in a protein, two patches were merged into a new patch when the ratio of identity residues between two patches was not less than the threshold (Table 1B). The merging process was kept iterating until there wasn’t any candidate patches could be merged.
The final predicted interface is defined as the top-ranked candidate interface patch measured by the following scoring function for interface-residue recognition.
The scoring function for interface-residue recognition
Residue interface propensity score. We use a scoring function to calculate similarity between patch and interface based on the sum of residue interface propensity which is calculated from QIPI. The score for a given patch, whose residue interface propensity score E res was calculated as:
Hydrophobic score. The term E hydro is the hydrophobic score of the query patch, which is given below:
Residue conservation score. Residue conservation was assessed by the self-substitution score based on the sequence profile. Sequence profiles were built by using PSI-BLAST  to search against non-redundant (NR) database with the BLOSUM62  substitution matrix. The conservation score of the given patch was defined as:
Solvation energy score. The E sol was adapted from the one used in Cyscore , which is formulated as follows:
Training and evaluation
The two criteria were used as the performance assessment in our study because a good interface recognition method could identify more real interface residues with less false positives.
The optimization goal was to maximize the cost function F value. This training process could balance the accuracy and coverage to avoid the overfitting of parameters. To evaluate the robustness of the SPR, a 10-fold cross-validation for SPR on Astral2.05-40-4506 dataset was carried out.
After training of SPR using the above process, the performance of SPR was tested on two datasets CAPRI25 and Enz35 using accuracy and coverage compared with several popular interface recognition programs [14, 23].
To gain an overall performance of SPR, we further tested it on two independent datasets, CAPRI25 and Enz35, by making comparison with several popular interface prediction programs, Meta-PPISP , con-PPISP , Promat , PINUP . Meta-PPISP is probably one of most popular programs in this field and widely used as the reference method in the recent research . Meta-PPISP is a meta-server built on scores from other method through linear regression. Con-PPISP combines PSI-Blast sequence profile and solvent accessibility in a neural network. Promate is a naïve Bayesian method consisting of properties such as secondary structure, atom distribution and sequence conservation. PINUP employs solvent accessible area, sequence conservation and side-chain energy in an empirical scoring function.
In this section, we first show the characteristics of protein interfaces in our analysis and develop a novel quantitative residue interface propensity index (QIPI). Secondly, we explore the contribution of the QIPI to improvement of interface-residue recognition. Finally, we demonstrate the performance of SPR by comparing it with several existing popular interface prediction programs.
Characteristics of interface
Each protein surface was divided into two disjoint groups: interface and non-interface. Interface properties including residue composition, secondary structure, solvent accessibility, contact preference and interface size were analyzed using Astral2.05-40-4506.
Residue composition and QIPI
Quantitative residue interface propensity index
Figure 2b compares the 60 residue compositions of interfaces and non-interface surfaces in order to analysis the independent and cooperation effect of residues and secondary structures. Combined with Figs. 1 and 2, we could find that the principal factor of interface propensity is the residue type. Within each residue types, trends of three secondary structure classes are almost as similar as that in Fig. 2a.
In summary, the residue composition is a crucial interface feature and the QIPI could be used in improving the interface-residue recognition.
Solvent accessibility and contact preference
Combined with the RIR of residues and contact preferences, we may conclude that Arg, Phe, Trp and Tyr have the highest interface propensity. The reason is that RIR of these residues >1.2 (as shown in Table 2) and the number of contacts include these residues with high contact preference (more than 1.5 in pink as Fig. 4b) is at least 2. This result further supports that our QIPI grasping the interface feature.
In Fig. 5b, we could find that the size of interface residue number also has a gamma distribution and the average of interface residue numbers is about 20. Figure 5c shows that domain sizes also span a broad range but have a distribution that is very different from interface ones. The average domain size is about 9000 Å2 which is much larger than that of interface. The difference between interface and domain sizes indicates that the interface size and residue number could be used as constraints in generating candidate interface patches for prediction methods.
The QIPI contributes to the improvement of interface residue recognition
Contribution of interface features to interface residue recognition
QIPI + Hydrophobic
To evaluate the robustness of SPR, a 10-fold cross-validation was carried out on the training set Astral2.05-40-4506. The average of coverage and accuracy were 0.506 ± 0.020 and 0.267 ± 0.019 respectively (see Additional file 2: Table S2 for details), which indicates the stable performance of SPR in the recognition of interface residue.
Comparison of interface prediction methods
Comparisons of SPR with several popular interface prediction programs on CAPRI25 dataset
Comparisons of SPR with several popular interface prediction programs on Enz35 dataset
In this study, through exploring the structural and physicochemical characteristics underlying various protein-protein interfaces, we have attempted to investigate various interface features and have successfully constructed a novel quantitative index of residue interface propensity. Identifying key features of protein-protein interface is a crucial step in understanding protein-protein interactions and exploring the function and evolution of protein complexes. At the same time, the quantitative interface propensity could also be used in improving the interface residue recognition, which is important for a series of computational structure biology problems such as docking and protein design. For these reasons, a number of efforts have been devoted to characterize the interface physicochemical properties and propose hypotheses such as O-Ring to depict the mechanism of protein-protein interaction. However, previous studies were limited by lacking a comprehensive non-redundant protein-protein interface dataset and ignoring relative solvent accessibility of interface residues distributions when analyzing interface features. This leads to some inconsistencies in this field. For example, Arg and His showed diverse interface preference in different previous studies, and it is difficult to improve interface residue recognition based on the qualitative knowledge from these analyses [14, 15, 23].
In order to solve the above-mentioned problems, we carried out a new quantitative analysis for exploring various features of protein-protein interface. Compared with previous studies, the main outputs of this study included: 1) a large-scale comprehensive interface dataset Astral2.05-40-4506 for analysis; 2) novel quantitative interface propensities using non-interface surface as reference to remove the bias effect of solvent accessibility; 3) a novel quantitative residue interface propensity index (QIPI) and other interface features improving interface residue recognition confirmed by the interface prediction method SPR.
Previously, lots of researches revealed that the interfaces have more hydrophobic and aromatic residues but puzzled by the observation that Arg and His also present more frequently at interface [14, 21, 22, 40]. For example, in the work of Yan et al. , the normalized interface propensity of residues, which is based on the accessible surface area, is highly consistent with the data based on our RIR. They concluded that the hydrophobic and aromatic residues had high interface propensity, but they were not able to explain the high interface propensities of Arg and His. According to our analysis, it is indicated that residues with long side chain (such as Arg and His) showed interface preference in a convincing manner, which solves the above puzzle. Our observation about interface preference of hydrophobic and aromatic residues is also consistent with some previous studies. For example, Ile, Val and Leu have high positive propensities for interfaces have been reported by Bahadur et al.  and Yan et al. . In summary, we concluded that characteristics of interface residues are as follows: hydrophobic, aromatic and long side chain. These residues could form strong driving forces, such as hydrophobic interactions, which drive the formation of protein complexes and stabilize the resulting complexes.
The interface contact preference contacts in our analysis included three types of contacts: Cys–Cys, contacts between residues with opposite charges, and contacts between hydrophobic residues. The fact that Cys–Cys contacts have one of the highest preferences indicates the important role of this type of contacts in protein–protein interactions. These results are consistent with previous reports which claimed that disulfide bonds, salt bridges, and hydrophobic interactions represent the main forces in protein–protein interactions [13, 41–44]. This is also supported by the observations that at close distances, interactions between pairs of hydrophilic residues are principally important; whereas hydrophobic interactions are crucial at longer distances [13, 42, 43, 45]. Integrated with the interface preference residues and contacts, we found that that Arg, Phe, Trp and Tyr have the highest interface propensity. The residue and contact preference in interfaces observed in this analysis are consistent with the 'Double water exclusion’  which is refined from the O-Ring theory  and roles of interface residues in the previous reports [46, 47].
We analyzed the distributions of interface size, interface number and domain size. As shown in Fig. 5, the average interface size is approximate 800 Å2 and about 86% of interface sizes is in the range of 0-2000 Å2. Our observation is consistent with the interface size distribution reported by previous researches. In these studies, Yan et al. found that the distribution of interface sizes has a peak in the range of 600-800 Å2 (whose average is 1227 Å2)  and Lo Conte et al. reported that the buried area for each side of the interface is about 800 Å2 . Compared with the interface size, the domain size has a different distribution. Our research gives a generating candidate interface patches method using the interface size, interface number and domain size as constraint as Table 1.
Based on the above results, we constructed a novel quantitative residue interface propensity index (QIPI) which could be easily applied in the interface residue recognition approach. We concluded that QIPI shows clearly the effective improvement in interface residue recognition especially the coverage but its expense is losing accuracy as shown in Table 3. In order to further confirm the interface prediction power of QIPI and other interface features in our result, we developed a protein-protein interface residue recognition method SPR based on these characteristics of protein-protein interface. Through rigorous testing on independent datasets, SPR using a simple empirical scoring function shows comparable prediction power with other four popular interface prediction programs that most belong to the machine learning method especially for the coverage criterion. SPR could be applied to most protein-protein interface but its accuracy on enzyme protein interface (Enz35 dataset) is relative poor as shown in Table 5. This result demonstrates that characteristics of protein-protein interface extracted from our analysis, especially the QIPI, are effective in improving protein-protein interface residue recognition. Through analyze the all testing result (Additional file 2: Table S2 and Tables 3 and 4), we could conclude that the main contribution of QIPI is to significantly improve the coverage of interface residue recognition, while the cost is the loss of accuracy for the competition balance between coverage and accuracy.
In conclusion, we constructed a novel quantitative residue interface propensity index (QIPI) through building a comprehensive non-redundant protein-protein interface dataset Astral2.05-40-4506 and quantitatively analyzing the protein-protein interface by considering the effect of relative solvent accessibility of interface residues factors distributions. The QIPI with other interface features from our analysis was helpful to explore protein-protein interfaces, and solved some inconsistent observations in previous studies such as interface propensity of Arg and His. Moreover, the QIPI successfully improved the protein-protein interface residue recognition, which was confirmed by the contribution test (Table 3), performance of SPR (Tables 4 and 5) and 10-fold cross-validation test (Additional file 2: Table S2). Therefore, the QIPI not only depicts the protein-protein interface, but also improves the protein-protein interface residue recognition. Our work provides a systematic study of protein-protein interfaces, and we believe that the quantitative index, QIPI, will contribute to the development of protein-protein interaction research.
This article has been published as part of BMC Systems Biology Volume 10 Supplement 4, 2016: Proceedings of the 27th International Conference on Genome Informatics: systems biology. The full contents of the supplement are available online at http://bmcsystbiol.biomedcentral.com/articles/supplements/volume-10-supplement-4.
This work was supported by the grants from the Shanghai Sailing Program (16YF1408600), the Shanghai Natural Science Foundation (15ZR1430300), the Industrial Generic Technology Platform Construction Project of SITI (SKY2015003) and the National Natural Science Foundation of China (91529302 and 81672736). The cost of publication for this paper was covered by the Shanghai Sailing Program (16YF1408600).
Availability of data and materials
All data generated or analysed during this study are included in this published article and its supplementary information files.
WD and AW contributed to the design and conception of the study, conducted computational experiments, analyzed and interpreted data, developed the software and drafted the manuscript. LM joined in the processing of data materials and wrote part of the computer codes. YXL, YYL and TJ conceived of the project and participated in its design, helped to analyze and interpret the data and drafted the manuscript. All authors have read and approved the manuscript for publication.
The authors’ declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Janin J, Wodak SJ. Protein modules and protein-protein interaction. Introduction. Adv Protein Chem. 2002;61:1–8.View ArticlePubMedGoogle Scholar
- Levy ED, Pereira-Leal JB. Evolution and dynamics of protein interactions and networks. Curr Opin Struct Biol. 2008;18(3):349–57.View ArticlePubMedGoogle Scholar
- Reichmann D, Rahat O, Cohen M, Neuvirth H, Schreiber G. The molecular architecture of protein-protein binding sites. Curr Opin Struct Biol. 2007;17(1):67–76.View ArticlePubMedGoogle Scholar
- Vidal M, Cusick ME, Barabasi AL. Interactome networks and human disease. Cell. 2011;144(6):986–98.View ArticlePubMedPubMed CentralGoogle Scholar
- Davis FP, Barkan DT, Eswar N, McKerrow JH, Sali A. Host pathogen protein interactions predicted by comparative modeling. Protein Sci. 2007;16(12):2585–96.View ArticlePubMedPubMed CentralGoogle Scholar
- Loewenstein Y, Raimondo D, Redfern OC, Watson J, Frishman D, Linial M, Orengo C, Thornton J, Tramontano A. Protein function annotation by homology-based inference. Genome Biol. 2009;10(2):207.View ArticlePubMedPubMed CentralGoogle Scholar
- Lakey JH, Raggett EM. Measuring protein-protein interactions. Curr Opin Struct Biol. 1998;8(1):119–23.View ArticlePubMedGoogle Scholar
- Khan SH, Ahmad F, Ahmad N, Flynn DC, Kumar R. Protein-protein interactions: principles, techniques, and their potential role in new drug development. J Biomol Struct Dyn. 2011;28(6):929–38.View ArticlePubMedGoogle Scholar
- Nooren IM, Thornton JM. Diversity of protein-protein interactions. EMBO J. 2003;22(14):3486–92.View ArticlePubMedPubMed CentralGoogle Scholar
- Janin J, Bahadur RP, Chakrabarti P. Protein-protein interaction and quaternary structure. Q Rev Biophys. 2008;41(2):133–80.View ArticlePubMedGoogle Scholar
- Keskin O, Gursoy A, Ma B, Nussinov R. Principles of protein-protein interactions: what are the preferred ways for proteins to interact? Chem Rev. 2008;108(4):1225–44.View ArticlePubMedGoogle Scholar
- Jones S, Thornton JM. Principles of protein-protein interactions. Proc Natl Acad Sci U S A. 1996;93(1):13–20.View ArticlePubMedPubMed CentralGoogle Scholar
- Zhao N, Pang B, Shyu CR, Korkin D. Charged residues at protein interaction interfaces: unexpected conservation and orchestrated divergence. Protein Sci. 2011;20(7):1275–84.View ArticlePubMedPubMed CentralGoogle Scholar
- Yan C, Wu F, Jernigan RL, Dobbs D, Honavar V. Characterization of protein-protein interfaces. Protein J. 2008;27(1):59–70.View ArticlePubMedPubMed CentralGoogle Scholar
- Sudha G, Nussinov R, Srinivasan N. An overview of recent advances in structural bioinformatics of protein-protein interactions and a guide to their principles. Prog Biophys Mol Biol. 2014;116(2-3):141–50.View ArticlePubMedGoogle Scholar
- Clackson T, Wells JA. A hot spot of binding energy in a hormone-receptor interface. Science. 1995;267(5196):383–6.View ArticlePubMedGoogle Scholar
- Bogan AA, Thorn KS. Anatomy of hot spots in protein interfaces. J Mol Biol. 1998;280(1):1–9.View ArticlePubMedGoogle Scholar
- Li J, Liu Q. ‘Double water exclusion’: a hypothesis refining the O-ring theory for the hot spots at protein interfaces. Bioinformatics. 2009;25(6):743–50.View ArticlePubMedPubMed CentralGoogle Scholar
- Keskin O, Ma B, Nussinov R. Hot regions in protein--protein interactions: the organization and contribution of structurally conserved hot spot residues. J Mol Biol. 2005;345(5):1281–94.View ArticlePubMedGoogle Scholar
- Li X, Keskin O, Ma B, Nussinov R, Liang J. Protein–protein interactions: hot spots and structurally conserved residues often locate in complemented pockets that pre-organized in the unbound states: implications for docking. J Mol Biol. 2004;344(3):781–95.View ArticlePubMedGoogle Scholar
- Chakrabarti P, Janin J. Dissecting protein-protein recognition sites. Proteins. 2002;47(3):334–43.View ArticlePubMedGoogle Scholar
- Bahadur RP, Chakrabarti P, Rodier F, Janin J. A dissection of specific and non-specific protein-protein interfaces. J Mol Biol. 2004;336(4):943–55.View ArticlePubMedGoogle Scholar
- Esmaielbeiki R, Krawczyk K, Knapp B, Nebel J-C, Deane CM. Progress and challenges in predicting protein interfaces. Brief Bioinform. 2015. doi:https://doi.org/10.1093/bib/bbv027.
- Fox NK, Brenner SE, Chandonia J-M. SCOPe: Structural Classification of Proteins—extended, integrating SCOP and ASTRAL data and classification of new structures. Nucleic Acids Res. 2014;42(D1):D304–9.View ArticlePubMedGoogle Scholar
- Mintseris J, Wiehe K, Pierce B, Anderson R, Chen R, Janin J, Weng Z. Protein–protein docking benchmark 2.0: an update. Proteins: Struct, Funct, Bioinform. 2005;60(2):214–6.View ArticleGoogle Scholar
- Zhou H-X, Qin S. Interaction-site prediction for protein complexes: a critical assessment. Bioinformatics. 2007;23(17):2203–9.View ArticlePubMedGoogle Scholar
- Agrawal NJ, Helk B, Trout BL. A computational tool to predict the evolutionarily conserved protein-protein interaction hot-spot residues from the structure of the unbound protein. FEBS Lett. 2014;588(2):326–33.View ArticlePubMedGoogle Scholar
- Kawashima S, Pokarowski P, Pokarowska M, Kolinski A, Katayama T, Kanehisa M. AAindex: amino acid index database, progress report 2008. Nucleic acids research. 2008;36(Database issue):D202–205.PubMedGoogle Scholar
- Janin J, Wodak S. Conformation of amino acid side-chains in proteins. J Mol Biol. 1978;125(3):357–86.View ArticlePubMedGoogle Scholar
- Casari G, Sippl MJ. Structure-derived hydrophobic potential: hydrophobic potential derived from X-ray structures of globular proteins is able to identify native folds. J Mol Biol. 1992;224(3):725–32.View ArticlePubMedGoogle Scholar
- Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25(17):3389–402.View ArticlePubMedPubMed CentralGoogle Scholar
- Henikoff S, Henikoff JG. Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci U S A. 1992;89(22):10915–9.View ArticlePubMedPubMed CentralGoogle Scholar
- Cao Y, Li L. Improved protein–ligand binding affinity prediction by using a curvature-dependent surface-area model. Bioinformatics. 2014. doi:https://doi.org/10.1093/bioinformatics/btu104.
- Qin S, Zhou H-X. meta-PPISP: a meta web server for protein-protein interaction site prediction. Bioinformatics. 2007;23(24):3386–7.View ArticlePubMedGoogle Scholar
- Chen H, Zhou HX. Prediction of interface residues in protein–protein complexes by a consensus neural network method: test against NMR data. Proteins: Struct, Funct, Bioinform. 2005;61(1):21–35.View ArticleGoogle Scholar
- Neuvirth H, Raz R, Schreiber G. ProMate: a structure based prediction program to identify the location of protein–protein binding sites. J Mol Biol. 2004;338(1):181–99.View ArticlePubMedGoogle Scholar
- Liang S, Zhang C, Liu S, Zhou Y. Protein binding site prediction using an empirical scoring function. Nucleic Acids Res. 2006;34(13):3698–707.View ArticlePubMedPubMed CentralGoogle Scholar
- Hwang H, Vreven T, Weng Z. Binding interface prediction by combining protein-protein docking results. Proteins. 2014;82(1):57–66.View ArticlePubMedGoogle Scholar
- Raih MF, Ahmad S, Zheng R, Mohamed R. Solvent accessibility in native and isolated domain environments: general features and implications to interface predictability. Biophys Chem. 2005;114(1):63–9.View ArticlePubMedGoogle Scholar
- Bahadur RP, Chakrabarti P, Rodier F, Janin J. Dissecting subunit interfaces in homodimeric proteins. Proteins. 2003;53(3):708–19.View ArticlePubMedGoogle Scholar
- Sheinerman FB, Norel R, Honig B. Electrostatic aspects of protein-protein interactions. Curr Opin Struct Biol. 2000;10(2):153–9.View ArticlePubMedGoogle Scholar
- Glaser F, Steinberg DM, Vakser IA, Ben‐Tal N. Residue frequencies and pairing preferences at protein–protein interfaces. Proteins: Struct, Funct, Bioinform. 2001;43(2):89–102.View ArticleGoogle Scholar
- Ofran Y, Rost B. Analysing six types of protein–protein interfaces. J Mol Biol. 2003;325(2):377–87.View ArticlePubMedGoogle Scholar
- McCoy AJ, Epa VC, Colman PM. Electrostatic complementarity at protein/protein interfaces. J Mol Biol. 1997;268(2):570–84.View ArticlePubMedGoogle Scholar
- Bahar I, Jernigan RL. Inter-residue potentials in globular proteins and the dominance of highly specific hydrophilic interactions at close separation. J Mol Biol. 1997;266(1):195–214.View ArticlePubMedGoogle Scholar
- Keskin O, Bahar I, Jernigan R, Badretdinov A, Ptitsyn O. Empirical solvent‐mediated potentials hold for both intra‐molecular and inter‐molecular inter‐residue interactions. Protein Sci. 1998;7(12):2578–86.View ArticlePubMedPubMed CentralGoogle Scholar
- Swapna LS, Bhaskara RM, Sharma J, Srinivasan N. Roles of residues in the interface of transient protein-protein complexes before complexation. Sci Rep. 2012;2.Google Scholar
- Conte LL, Chothia C, Janin J. The atomic structure of protein-protein recognition sites. J Mol Biol. 1999;285(5):2177–98.View ArticlePubMedGoogle Scholar