System BiologyProtein DatabaseMedical TopicRibonuclease InhibitorEvolutionary Path
Proteins with internal repeat structures present particular challenges to methods of classification. Major repeat patterns are straightforward to identify and tend to dominate the annotation of sequences conforming to them. However, it may be difficult to find sub-levels into such patterns that can be correlated to specific functions. Leucine-rich repeat (LRR) proteins provide a typical example. Their canonical repeat pattern is well established but it still remains difficult to establish specific markers for subcategories. Different protein databases (SMART, InterPro, PRINTS, Pfam...) usually define the canonical leucine-rich repeat but in addition they describe different subtypes of repeats to account for specific characteristics: bacterial type, cysteine-rich type, ribonuclease inhibitor type, etc. [1, 2]. Many LRR proteins contain characteristic Cys-rich capping motifs conserved across species and lineages, with the most common N-terminal and C-terminal LRR-capping motifs having been described in different databases. Recently we determined the crystal structure of decorin , which is the archetypal representative of the extracellular LRR subfamily of small leucine-rich repeat proteins and proteoglycans (SLRP). The decorin structure shows a unique C-terminal capping motif that does not conform to the most commonly observed type . We have been able to define a consensus pattern that correctly and uniquely identify all known sequences containing such capping motif, which we propose is the defining characteristic of the entire SLRP subfamily. The collection of sequences allows us to trace the evolutionary path of SLRPs across the vertebrate lineage (Figure 1). This pattern will be useful in automatic sequence-annotation of LRR proteins belonging to the SLRP subfamily.
Faculty of Life Sciences, the University of Manchester, Manchester, UK
Enkhbayar P, Kamiya M, Osaki M, Matsumoto T, Matsushima N: Structural principles of leucine-rich repeat (LRR) proteins. Proteins. 2004, 54: 394-403. 10.1002/prot.10605PubMedView ArticleGoogle Scholar
Kobe B, Kajava AV: The leucine-rich repeat as a protein recognition motif. Curr Opin Struct Biol. 2001, 11: 725-732. 10.1016/S0959-440X(01)00266-4PubMedView ArticleGoogle Scholar
McEwan PA, Scott PG, Bishop PN, Bella J: Structural correlations in the family of small leucine-rich repeat proteins and proteoglycans. J Struct Biol. 2006, 155: 294-305. 10.1016/j.jsb.2006.01.016PubMedView ArticleGoogle Scholar
Scott PG, McEwan PA, Dodd CM, Bergmann EM, Bishop PN, Bella J: Crystal structure of the dimeric protein core of decorin, the archetypal small leucine-rich repeat proteoglycan. Proc Natl Acad Sci USA. 2004, 101: 15633-15638. 10.1073/pnas.0402976101PubMedPubMed CentralView ArticleGoogle Scholar