- Research article
- Open Access
The whole-organism heavy chain B cell repertoire from Zebrafish self-organizes into distinct network features
© Ben-Hamo and Efroni; licensee BioMed Central Ltd. 2011
- Received: 19 September 2010
- Accepted: 10 February 2011
- Published: 10 February 2011
The adaptive immune system is based on selected populations of molecularly distinct individual B and T cell clones. However, it has not been possible to characterize these clones in a comprehensive and informatics manner to date; attempts have been limited by the number of cells in the adaptive immune system and an inability to quantify them. Recently, using the Zebrafish (ZF) Danio rerio as a model organism and parallel sequencing as the quantifying technology, Weinstein et al. overcame this major hurdle and quantified the entire heavy chain B-cell repertoire in ZF. Here, we present a novel network analysis of the data from the Weinstein group, providing new insights into the network structure of the B-cell repertoire.
Using a collection of computational methods, the IgM sequences from 14 fish were analyzed. This analysis demonstrated that the B-cell repertoire of the ZF is structured along similar lines to those previously detected in limited parts of the human B-cell immune system. The analysis confirms the validity of the global data and the evolutionary placement of the ZF based on known sequence motifs. Recombination events in the repertoire were quantified, and demonstrated a lack of shared recombined V, J groups across fish. Nevertheless, it was demonstrated that a similar network architecture is shared among fish. However, the network analysis identified two distinct populations within the group; these findings are compatible with the occurrence of an immune response in a subset of the fish. The emerging connectivity network was demonstrated and quantified, and mutation drifts within the groups were characterized. Dissection of sequence data revealed common network features of the B-cell repertoire as well as individual differences.
The ZF B-cell repertoire reveals an underlying order that is compatible with self-organization representing every portion of the sequence-based network. This pattern varies in individual specimens, perhaps as a response to an immune challenge. However, a sequence-non-specific network that maintains a common architecture of sequence diversity was detected.
The common feature among different individuals can be captured by the network architecture and characteristics, rather than specific clones. We believe that further study of the dynamics of this network could provide insight into modes of operation of the immune system.
- Network Architecture
- Random Network
- Adaptive Immune System
- Germline Sequence
The immune system is a remarkably adaptive defense and maintenance system that has evolved in vertebrates to protect against invading pathogenic microorganisms and to maintain homeostasis. The immune system has two arms: the innate arm, which is activated by innate ligands, and the adaptive arm, which includes T cells and B cells that recognize antigens via their specific antigen receptors (TCR and BCR) [1, 2].
B cells, a component of the adaptive immune system, mature within the bone marrow; when they exit to the periphery as naive B cells they express a unique antigen-binding receptor, immunoglobulin (Ig), on their membrane. When activated by the antigen specific to its membrane-bound antibody, a B cell proliferates and differentiates to generate plasma cells that secrete Ig molecules, and also memory cells [1, 3, 4].
B-cell maturation depends on rearrangement of the Ig in a process known as V (D) J rearrangement. By randomly choosing V, D and J genes among many alleles, the recombination provides a variety of antigen sequences. The process is highly conserved in jawed vertebrates [5, 6]. The whole collection of various, rearranged immune receptors is known as the B-cell repertoire.
Additional variability within the B-cell repertoire arises from somatic hypermutation (SHM) - a recombination process that occurs in germinal centers in which the recombined immunoglobulin undergoes error-prone replication during an in vivo selection process. These mutations are several orders of magnitude more frequent than in genes encoding other proteins [7–11]. Several mutations yield amino acid substitutions that improve antigen binding by increasing the antigen affinity and diversity .
The Zebrafish (ZF), Danio rerio, offers unique opportunities for studying the ontogenetic development of the immune system. A great advantage in studying this organism is the optical transparency of ZF during early development and the fact that it shares many orthologous genes with mouse and man (e.g., rag1 and rag2). This gives the species considerable relevance over other traditional developmental models [13–15]. The ZF immune system has approximately 300,000 antibody-producing cells. This is a small number compared to an estimated 1012 cells in humans. Therefore, ZF is an excellent model organism for global quantitative analysis of the immune system, including the B-cell repertoire.
Technically, questions concerning the B cell repertoire have been limited by: (1) the scale of the system, with more than 1012 cells in humans and a similar order of magnitude in mice; (2) an inability to measure diversity within the whole population of B cells. The scale of the system is solved by using ZF as a model organism, with 105 B cells generating an immune system of comparable complexity. Quantifying diversity is currently approachable using platforms of massive parallel sequencing. While such systems cannot sequence recombined regions (~200 bp in length) from 1012 cells within a reasonable time frame, they are capable of sequencing recombined regions from the approximate 105 cells of ZF.
Recently, Weinstein et al.  constructed the sequence reads of 640 million bases of zebrafish antibody heavy chain cDNA from 14 zebrafish in four families, and the data are publicly available. The authors characterized and analyzed the antibody repertoire of zebrafish by analyzing complementary determining region 3 (CDR3) of the heavy chain.
The concept of network architecture has changed the perspective on biological phenomena recently [17–19]. Research has demonstrated the importance of network architecture in terms of phenotype classification, emergence and function (for a recent review, see ). Such research suggests, for example, that the topological prominence of a protein in a protein interaction is a good predictor of its biological importance. Network architecture analysis provides the tools to study structural properties in networks. One important method is centrality analysis. Such analysis ranks vertices in a network according to their connectivity within a network structure [18, 21, 22]. Centrality indices of a network include the Degree of each vertex (the number of physical connections a vertex has) and the Betweenness of a vertex which measure the number of shortest paths that the vertex lies on (high Betweenness signifies this vertex as one that resides on several short paths). Betweenness of a node is the proportion of shortest paths among all pairs of reachable nodes that go through the node; Betweenness measures the centrality of a node in the global network structure .
Protein-Protein interaction networks (PPI) becoming a key strategy for uncovering the inner mechanism of a cell, these networks systematically determine both the potential and actual protein interactions in selected model organisms, such as the S. cerevisiae. PPI networks identify every node with a specific protein and the edges stands for identified direct physical interaction. The protein interactions and its position in the network are likely to correlate with the protein functional properties. The nodes themselves possess characteristics that carry important information about their role in the system, (e.g highly connected nodes tend to be more essential) [23–26].
In the present study, the dataset of Weinstein et al. was analyzed to characterize the whole recombined sequence, and to investigate V (D) J distribution among all 14 fish and the mutations that occur within the sequences. In addition, the Weinstein data were extended to characterize the connectivity network of the ZF B-cell repertoire and study mutation drifts as multiple nodes within the groups. Unlike PPI networks here every node represent a different sequence in the repertoire and edges represents mutations or indels (insertion or deletion) that may occurred. This form of network-based interpretation facilitates the identification of unique sequences versus groups of sequences and indicates their centrality in the network. Using these computational tools that facilitate the dissection of sequence data, the fundamental repertoire architecture was examined to investigate how the repertoire varies among individuals and the common network features that dictate the B-cell repertoire. Further, we report that in the dataset of Weinstein et al. there are two distinct immune behaviors, which have not been identified in previous analyses [16, 27]. We hypothesize that these distinct immune behaviors are immune responses that may be the result of external perturbations such as a yet unknown pathogenic response.
The Ig network that emerged from the Ig sequences in each fish was analyzed. In addition, to highlight the specific choices in the selection process, the observed network was compared with an artificial, randomly-based network. The Random network algorithm randomly chose a combination of three germline sequences (V, D and J) in order to build a "naive" sequence, then the algorithm randomly put point mutations, deletions and insertions into every sequence. The random network does not represent a population of B cell but the lowest level of organization, one with no bias toward any specific germline combination or mutation.
A metric evaluation of the Ig network architecture demonstrated major differences between the groups.
Each vertex in the network refers to a different sequence and the size of the vertex represents the number of identical sequences. A red edge indicates a point mutation between the two sequences, a blue edge indicates one deletion and a green edge represents deletion and mutation. Each component is marked by a distinct color, so that the node color ascribes the node to a specific connected component of the network.
Fish in the highly connected group exhibit large clusters of components. Such clusters may be the network equivalent of an immune system response to a foreign antigen. The rationale behind this categorization of inflamed vs. naive fish is that the basis for network behavior is similarity between clones, or clonal expansion. The components are highly connected, and the vertices in these clusters are bigger than those of relative networks. In the poorly connected group of fish (such as fish D) the same distribution of all sequences is observed and the highly connected, massive components are noticeably absent from such networks. We would expect to observe 975 (39V × 5D × 5J) connected clusters, representing all the possible V (D) J combinations. However, hundreds of thousands of components were identified. Therefore, the mutation process is such that once sequences have been generated, the network algorithm cannot connect them to one another in a single step.
As described previously, networks can be assessed by studying and comparing their centrality indices, and in the present study we examined the Degree and Betweenness of the vertex in every network and the size of the vertex. This analysis demonstrated that poorly connected fish have vertices that contain a maximum of 10% of their sequences, while highly connected fish have vertices that contain 50% of their sequences. Furthermore, the vertices in connected fish were of much higher degree than in poorly connected fish, as were levels of network Betweenness centrality, indicating several connected clones that could be the result of an immune response in those fish.
V/J segment analysis
The Ig sequences from the original paper by Weinstein et al. were obtained from 14 fish, with each family being raised in separate aquaria and having normal interactions with the environment including the development of natural internal flora . V and J segments from all 14 fish were analyzed, and germline sequences were matched with sequence reads to demonstrate their distribution among individual fish.
One of the hallmarks of the adaptive immune system is its ability to recognize large numbers of antigens. B-cell repertoire diversity is generated by two distinct processes that modify the Ig sequence.
The primary repertoire is generated in the bone marrow by a process of V (D) J recombination, with further diversification via two pathways: (1) somatic hyper-mutation (SHM), a process that introduces a high frequency of point mutations into the variable region gene of the Ig; (2) relatively rare insertions and deletions of nucleotides. Recent studies have demonstrated that the molecular processes of SHM and class-switch recombination (CSR) require the activity of activation-induced cytidine deaminase (AID), which removes C to form a U nucleotide on the mRNA sequence. This change activates proteins that mediate SHM and CSR [28–30].
ZF share several orthologous genes with mouse and human, which gives this species considerable relevance over other traditional developmental models . Moreover, previous works have identified several regions of the ZF and human genome that encode the same or similar genes [33–35]. In addition, the results presented herein reinforce the similarity between these two species at the level of the mutation process and base specificity.
In addition to the DH-JH region motifs identified, the specific motif EDTAVYYCAR was present in the VH region; part of this motif (AVYYCAR) was previously identified in the TCR cDNA V region of Horn shark (Heterodontus francisci ) . These findings support the idea that the Ig heavy chain in teleosts is evolutionarily stable.
Comprehensive analysis of the first sequenced whole organism Ig repertoire, as provided by Weinstein et al., resulted in the following conclusions:
By comparing somatic mutation exchange across silent and replacement mutations, a remarkable similarity between human and ZF was evident, a similarity that places the ZF within a proper context as a relevant model for studying selection mechanisms.
The ZF evolutionary context was investigated by comparing sequence motifs identified in ZF to evolutionary adjacent fish species.
VJ combination analysis across individuals was carried out in order to investigate how recombined VJ regions are used in different individuals. This analysis revealed that specific portions of the repertoire are utilized by the recombination mechanisms, while others are clearly absent (figure 8).
Mora et al. demonstrated that the data from Weinstein et al. prove that "antibody diversity is not limited by the sequences encoded in the genome" . Furthermore, the authors demonstrated an intriguing set of properties that they derived from the data. However, the original paper by Weinstein et al. and the paper by Mora et al. do not demonstrate the clear stratification of the group of fish, as presented in this study. This research has demonstrated that there are two distinct groups of fish. One group presents a uniform use of VJ recombination and a connected, uniform-like network of Ig molecules, while the other presents a much higher frequency of a subset of VJ recombination and a highly connected sub-network of Ig molecules.
We hypothesize that these two groups represent two distinct populations. The group that represents a random-like pattern of behavior is composed of fish with a basal immune response, while the second set of fish represents those with a more complex immune response that may be the result of external perturbations. The network transformation performed over sequences demonstrates this visually by highlighting the special connectivity patterns, and quantifiably by using network architecture metrics that demonstrate major differences between the two groups.
The mechanisms that determine the connections made by individual sequences across the network, and the meaning of these network connections, are the subject of future work. The temporal behavior of the network during an immune response will undoubtedly shed light on the specific choices made among clones.
The shape of the immunological repertoire is at the heart of our understanding of an immune response, self-non-self discrimination, autoimmunity and immune modulation. As current technologies facilitate organism-wide measurements, a better understanding of this fundamental system is tied to our ability to influence the immunological repertoire in a clinically related manner.
All sequences were obtained from supporting online information from the report published by Weinstein et al. . A total of 14 fish and approximately one million sequences were used. Germline sequences were obtained from Danilova et al. .
The Zebrafish heavy chain has 39 V genes, 5 D genes and 5 J genes: overall, there are 975 (39 × 5 × 5) optional combinations of V (D) J sequences . Here, using BLAST  alignment, V and J germline sequences were matched. From a sequence perspective, the five D genes were very close. Therefore, D gene alignment was not used as an indicator and the D region was defined for analyses as the segment between the identifiable parts of V and J .
All sequences with a BLAST E-value higher than 10-8 were removed from the analysis owing to insufficient fit to either V or J (overall, less than 0.5% sequences were extracted). Results presented have been normalized across individual fish.
Mutation analysis was carried out by aligning each sequence to its germline and tallying base substitutions, deletions and insertions in the sequence. Once all mutations were determined, the ORF was identified and the type of mutation was tagged as silent or replacement, thus tallying the amount of silent and replacement mutations and their frequencies within the four nucleotides. The results were compared with the analysis previously carried out on humans . The nucleotide sequences were translated into amino acid sequences in order to be analyzed and to allow the search for specific motifs that are consistent in the majority of sequences. This analysis revealed a pattern that was previously found in other vertebrates.
Network analysis was carried out using Pajek , a program for network layout and analyses. Each vertex in the network represents a different sequence. The size of the vertex is comparable to the number of sequences matching a specific vertex perfectly. The distance between a pair of connected nodes in the network is one mutation (red edge), one deletion (blue edge) or mutation and deletion (green edge). Nodes in every cluster are colored differently, such that each cluster represents a connected component inside the whole network. In other words, in every component there is a path connecting every two vertices. Network centrality indices, which are a key measurement in assessing any network, were calculated using Pajek and are presented in the results section.
The authors wish to thank Irun Cohen for his help.
Dr. Sol Efroni is supported by the European Union through the IRG program.
This work has been supported by the Israel Science Foundation (ISF) 1601/10, ISF-Bikura.
- Richard A, Goldsby TJK, Barbara A, Osborne : KUBY IMMUNOLOGY. W.H. Freeman & Company; 2000.Google Scholar
- Cohen IR: Tending Adam's garden: evolving the cognitive immune self. San Diego, CA: Academic Press; 2000.Google Scholar
- Parham P: The immune System. New York, Garland Publishing Press; 2009.Google Scholar
- Andersson E, Matsunaga T: Evolutionary stability of the immunoglobulin heavy chain variable region gene families in teleost. Immunogenetics 1998,47(3):272-277. 10.1007/s002510050357View ArticlePubMedGoogle Scholar
- Hansen JD, McBlane JF: Recombination-activating genes, transposition, and the lymphoid-specific combinatorial immune system: a common evolutionary connection. Curr Top Microbiol Immunol 2000, 248: 111-135.PubMedGoogle Scholar
- Li Z, Chang Y: V(D)J recombination in zebrafish: Normal joining products with accumulation of unresolved coding ends and deleted signal ends. Mol Immunol 2007,44(7):1793-1802. 10.1016/j.molimm.2006.07.295PubMed CentralView ArticlePubMedGoogle Scholar
- Clark LA, Ganesan S, Papp S, van Vlijmen HW: Trends in antibody sequence changes during the somatic hypermutation process. J Immunol 2006,177(1):333-340.View ArticlePubMedGoogle Scholar
- Kepler TB, Perelson AS: Somatic hypermutation in B cells: an optimal control treatment. J Theor Biol 1993,164(1):37-64. 10.1006/jtbi.1993.1139View ArticlePubMedGoogle Scholar
- Foster SJ, Dorner T, Lipsky PE: Somatic hypermutation of VkappaJkappa rearrangements: targeting of RGYW motifs on both DNA strands and preferential selection of mutated codons within RGYW motifs. Eur J Immunol 1999,29(12):4011-4021. 10.1002/(SICI)1521-4141(199912)29:12<4011::AID-IMMU4011>3.0.CO;2-WView ArticlePubMedGoogle Scholar
- Berek C, Milstein C: The dynamic nature of the antibody repertoire. Immunol Rev 1988, 105: 5-26. 10.1111/j.1600-065X.1988.tb00763.xView ArticlePubMedGoogle Scholar
- Kleinstein SH, Louzoun Y, Shlomchik MJ: Estimating hypermutation rates from clonal tree data. J Immunol 2003,171(9):4639-4649.View ArticlePubMedGoogle Scholar
- Jolly CJ, Wagner SD, Rada C, Klix N, Milstein C, Neuberger MS: The targeting of somatic hypermutation. Semin Immunol 1996,8(3):159-168. 10.1006/smim.1996.0020View ArticlePubMedGoogle Scholar
- Yoder JA, Nielsen ME, Amemiya CT, Litman GW: Zebrafish as an immunological model system. Microbes Infect 2002,4(14):1469-1478. 10.1016/S1286-4579(02)00029-1View ArticlePubMedGoogle Scholar
- Willett CE, Cherry JJ, Steiner LA: Characterization and expression of the recombination activating genes (rag1 and rag2) of zebrafish. Immunogenetics 1997,45(6):394-404. 10.1007/s002510050221View ArticlePubMedGoogle Scholar
- Trede NS, Langenau DM, Traver D, Look AT, Zon LI: The use of zebrafish to understand immunity. Immunity 2004,20(4):367-379. 10.1016/S1074-7613(04)00084-6View ArticlePubMedGoogle Scholar
- Weinstein JA, Jiang N, White RA, Fisher DS, Quake SR: High-throughput sequencing of the zebrafish antibody repertoire. Science 2009,324(5928):807-810. 10.1126/science.1170020PubMed CentralView ArticlePubMedGoogle Scholar
- Tavazoie S, Hughes JD, Campbell MJ, Cho RJ, Church GM: Systematic determination of genetic network architecture. Nat Genet 1999,22(3):281-285. 10.1038/10343View ArticlePubMedGoogle Scholar
- Jeong H, Mason SP, Barabasi AL, Oltvai ZN: Lethality and centrality in protein networks. Nature 2001,411(6833):41-42. 10.1038/35075138View ArticlePubMedGoogle Scholar
- Ozier O, Amin N, Ideker T: Global architecture of genetic interactions on the protein network. Nat Biotechnol 2003,21(5):490-491. 10.1038/nbt0503-490View ArticlePubMedGoogle Scholar
- Barabasi AL: Scale-free networks: a decade and beyond. Science 2009,325(5939):412-413. 10.1126/science.1173299View ArticlePubMedGoogle Scholar
- Girvan M, Newman ME: Community structure in social and biological networks. Proc Natl Acad Sci USA 2002,99(12):7821-7826. 10.1073/pnas.122653799PubMed CentralView ArticlePubMedGoogle Scholar
- Zotenko E, Mestre J, O'Leary DP, Przytycka TM: Why do hubs in the yeast protein interaction network tend to be essential: reexamining the connection between the network topology and essentiality. PLoS Comput Biol 2008,4(8):e1000140. 10.1371/journal.pcbi.1000140PubMed CentralView ArticlePubMedGoogle Scholar
- He XL, Zhang JZ: Why do hubs tend to be essential in protein networks? Plos Genetics 2006,2(6):826-834. 10.1371/journal.pgen.0020088View ArticleGoogle Scholar
- Jeong H, Mason SP, Barabasi AL, Oltvai ZN: Lethality and centrality in protein networks. Nature 2001,411(6833):41-42. 10.1038/35075138View ArticlePubMedGoogle Scholar
- Park JY, Barabasi AL: Distribution of node characteristics in complex networks. Proceedings of the National Academy of Sciences of the United States of America 2007,104(46):17916-17920. 10.1073/pnas.0705081104PubMed CentralView ArticlePubMedGoogle Scholar
- Yook SH, Oltvai ZN, Barabasi AL: Functional and topological characterization of protein interaction networks. Proteomics 2004,4(4):928-942. 10.1002/pmic.200300636View ArticlePubMedGoogle Scholar
- Mora T, Walczak AM, Bialek W, Callan CG Jr: Maximum entropy models for antibody diversity. Proc Natl Acad Sci USA 2010,107(12):5405-5410. 10.1073/pnas.1001705107PubMed CentralView ArticlePubMedGoogle Scholar
- Zheng NY, Wilson K, Jared M, Wilson PC: Intricate targeting of immunoglobulin somatic hypermutation maximizes the efficiency of affinity maturation. J Exp Med 2005,201(9):1467-1478. 10.1084/jem.20042483PubMed CentralView ArticlePubMedGoogle Scholar
- Papavasiliou FN, Schatz DG: The activation-induced deaminase functions in a postcleavage step of the somatic hypermutation process. J Exp Med 2002,195(9):1193-1198. 10.1084/jem.20011858PubMed CentralView ArticlePubMedGoogle Scholar
- Muramatsu M, Sankaranand VS, Anant S, Sugai M, Kinoshita K, Davidson NO, Honjo T: Specific expression of activation-induced cytidine deaminase (AID), a novel member of the RNA-editing deaminase family in germinal center B cells. J Biol Chem 1999,274(26):18470-18476. 10.1074/jbc.274.26.18470View ArticlePubMedGoogle Scholar
- Tanaka A, Shen HM, Ratnam S, Kodgire P, Storb U: Attracting AID to targets of somatic hypermutation. J Exp Med 207(2):405-415. 10.1084/jem.20090821Google Scholar
- Hershberg U, Shlomchik MJ: Differences in potential for amino acid change after mutation reveals distinct strategies for kappa and lambda light-chain variation. Proc Natl Acad Sci USA 2006,103(43):15963-15968. 10.1073/pnas.0607581103PubMed CentralView ArticlePubMedGoogle Scholar
- Barbazuk WB, Korf I, Kadavi C, Heyen J, Tate S, Wun E, Bedell JA, McPherson JD, Johnson SL: The syntenic relationship of the zebrafish and human genomes. Genome Res 2000,10(9):1351-1358. 10.1101/gr.144700PubMed CentralView ArticlePubMedGoogle Scholar
- Postlethwait JH, Woods IG, Ngo-Hazelett P, Yan YL, Kelly PD, Chu F, Huang H, Hill-Force A, Talbot WS: Zebrafish comparative genomics and the origins of vertebrate chromosomes. Genome Res 2000,10(12):1890-1902. 10.1101/gr.164800View ArticlePubMedGoogle Scholar
- Liu TX, Zhou Y, Kanki JP, Deng M, Rhodes J, Yang HW, Sheng XM, Zon LI, Look AT: Evolutionary conservation of zebrafish linkage group 14 with frequently deleted regions of human chromosome 5 in myeloid malignancies. Proc Natl Acad Sci USA 2002,99(9):6136-6141. 10.1073/pnas.072560099PubMed CentralView ArticlePubMedGoogle Scholar
- Andersson E, Matsunaga T: Evolution of immunoglobulin heavy chain variable region genes: a VH family can last for 150-200 million years or longer. Immunogenetics 1995,41(1):18-28. 10.1007/BF00188428View ArticlePubMedGoogle Scholar
- Kabat EA: The paucity of species-specific amino acid residues in the variable regions of human and mouse Bence-Jones proteins and its evolutionary and genetic implications. Proc Natl Acad Sci USA 1967,57(5):1345-1349. 10.1073/pnas.57.5.1345PubMed CentralView ArticlePubMedGoogle Scholar
- Andersson E, Tormanen V, Matsunaga T: Evolution of a VH gene family in low vertebrates. Int Immunol 1991,3(6):527-533. 10.1093/intimm/3.6.527View ArticlePubMedGoogle Scholar
- Rast JP, Litman GW: T-cell receptor gene homologs are present in the most primitive jawed vertebrates. Proc Natl Acad Sci USA 1994,91(20):9248-9252. 10.1073/pnas.91.20.9248PubMed CentralView ArticlePubMedGoogle Scholar
- Mora T, Walczak AM, Bialek W, Callan CG Jr: Maximum entropy models for antibody diversity. Proc Natl Acad Sci USA 107(12):5405-5410. 10.1073/pnas.1001705107Google Scholar
- Danilova N, Bussmann J, Jekosch K, Steiner LA: The immunoglobulin heavy-chain locus in zebrafish: identification and expression of a previously unknown isotype, immunoglobulin Z. Nat Immunol 2005,6(3):295-302. 10.1038/ni1166View ArticlePubMedGoogle Scholar
- McGinnis S, Madden TL: BLAST: at the core of a powerful and diverse set of sequence analysis tools. Nucleic Acids Res 2004, (32 Web Server):W20-25. 10.1093/nar/gkh435Google Scholar
- Batagelj VMA: Pajek: A program for large network analysis. Connections 1998, 47-57.Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.