World Scientific
  • Search
  •   
Skip main navigation

Cookies Notification

We use cookies on this site to enhance your user experience. By continuing to browse the site, you consent to the use of our cookies. Learn More
×

System Upgrade on Tue, May 28th, 2024 at 2am (EDT)

Existing users will be able to log into the site and access content. However, E-commerce and registration of new users may not be available for up to 12 hours.
For online purchase, please visit us again. Contact us at [email protected] for any enquiries.

GI-Cluster: Detecting genomic islands via consensus clustering on multiple features

    https://doi.org/10.1142/S0219720018400103Cited by:4 (Source: Crossref)

    The accurate detection of genomic islands (GIs) in microbial genomes is important for both evolutionary study and medical research, because GIs may promote genome evolution and contain genes involved in pathogenesis. Various computational methods have been developed to predict GIs over the years. However, most of them cannot make full use of GI-associated features to achieve desirable performance. Additionally, many methods cannot be directly applied to newly sequenced genomes. We develop a new method called GI-Cluster, which provides an effective way to integrate multiple GI-related features via consensus clustering. GI-Cluster does not require training datasets or existing genome annotations, but it can still achieve comparable or better performance than supervised learning methods in comprehensive evaluations. Moreover, GI-Cluster is widely applicable, either to complete and incomplete genomes or to initial GI predictions from other programs. GI-Cluster also provides plots to visualize the distribution of predicted GIs and related features. GI-Cluster is available at https://github.com/icelu/GI_Cluster.

    References

    • 1. Vernikos GS, Parkhill J, Resolving the structural features of genomic islands: A machine learning approach, Genome Res 18 (2) :331–342, 2008. Crossref, MedlineGoogle Scholar
    • 2. Langille MGI, Hsiao WWL, Brinkman FSL, Detecting genomic islands using bioinformatics approaches, Nat Rev Microbiol 8 (5) :373–382, 2010. Crossref, MedlineGoogle Scholar
    • 3. Dobrindt U, Hochhut B, Hentschel U, Hacker J, Genomic islands in pathogenic and environmental microorganisms, Nature Rev Microbiol 2 (5) :414–424, 2004. Crossref, MedlineGoogle Scholar
    • 4. Jani M, Mathee K, Azad RK, Identification of novel genomic islands in liverpool epidemic strain of Pseudomonas aeruginosa using segmentation and clustering, Front Microbiol 7 :1210, 2016. Crossref, MedlineGoogle Scholar
    • 5. Hacker J, Blum-Oehler G, Mühldorfer I, Tschäpe H, Pathogenicity islands of virulent bacteria: Structure, function and impact on microbial evolution, Mol Microbiol 23 (6) :1089–1097, 1997. Crossref, MedlineGoogle Scholar
    • 6. Schmidt H, Hensel M, Pathogenicity islands in bacterial pathogenesis, Clin Microbiol Rev 17 (1) :14–56, 2004. Crossref, MedlineGoogle Scholar
    • 7. Juhas M, van der Meer JR, Gaillard M, Harding RM, Hood DW, Crook DW, Genomic islands: Tools of bacterial horizontal gene transfer and evolution, FEMS Microbiol Rev 33 (2) :376–393, 2009. Crossref, MedlineGoogle Scholar
    • 8. Williams KP, Integration sites for genetic elements in prokaryotic tRNA and tmRNA genes: Sublocation preference of integrase subfamilies, Nucleic Acids Res 30 (4) :866–875, 2002. Crossref, MedlineGoogle Scholar
    • 9. Bellanger X, Payot S, Leblond-Bourget N, Guédon G, Conjugative and mobilizable genomic islands in bacteria: Evolution and diversity, FEMS Microbiol Rev 38 :720–760, 2014. Crossref, MedlineGoogle Scholar
    • 10. Lu B, Leong HW, Computational methods for predicting genomic islands in microbial genomes, Comput Struct Biotechnol J 14 :200–206, 2016. Crossref, MedlineGoogle Scholar
    • 11. Vernikos GS, Parkhill J, Interpolated variable order motifs for identification of horizontally acquired DNA: Revisiting the Salmonella pathogenicity islands, Bioinf 22 (18) :2196–2203, 2006. Crossref, MedlineGoogle Scholar
    • 12. Lu B, Leong HW, GI-SVM: A sensitive method for predicting genomic islands based on unannotated sequence of a single genome, J Bioinform Comput Biol 14 (1) :1640003, 2016. LinkGoogle Scholar
    • 13. Waack S, Keller O, Asper R, Brodag T, Damm C, Fricke WF, Surovcik K, Meinicke P, Merkl R, Score-based prediction of genomic islands in prokaryotic genomes using hidden Markov models, BMC Bioinf 7 :142, 2006. Crossref, MedlineGoogle Scholar
    • 14. Bertelli C, Laird MR, Williams KP, Lau BY, Hoad G, Winsor GL, Brinkman F, Islandviewer 4: Expanded prediction of genomic islands for larger-scale datasets, Nucleic Acids Res 45 (W1) :W30–W35, 2017. Crossref, MedlineGoogle Scholar
    • 15. Yoon SH, Hur CG, Kang HY, Kim YH, Oh TK, Kim JF, A computational approach for identifying pathogenicity islands in prokaryotic genomes, BMC Bioinf 6 :184, 2005. Crossref, MedlineGoogle Scholar
    • 16. Hsiao W, Wan I, Jones SJ, Brinkman FSL, IslandPath: Aiding detection of genomic islands in prokaryotes, Bioinf 19 (3) :418–420, 2003. Crossref, MedlineGoogle Scholar
    • 17. Hudson CM, Lau BY, Williams KP, Islander: A database of precisely mapped genomic islands in tRNA and tmRNA genes, Nucleic Acids Res 43 (D1) :D48–D53, 2015. Crossref, MedlineGoogle Scholar
    • 18. Che D, Hockenbury C, Marmelstein R, Rasheed K, Classification of genomic islands using decision trees and their ensemble algorithms, BMC Genomics 11 (Suppl 2) :S1, 2010. Crossref, MedlineGoogle Scholar
    • 19. Che D, Wang H, Fazekas J, Chen B, An accurate genomic island prediction method for sequenced bacterial and archaeal genomes, J Proteomics Bioinform 7 (8) :214, 2014. Google Scholar
    • 20. Lee CC, Chen YPP, Yao TJ, Ma CY, Lo WC, Lyu PC, Tang CY, GI-POP: A combinational annotation and genomic island prediction pipeline for ongoing microbial genome projects, Gene 518 (1) :114–123, 2013. Crossref, MedlineGoogle Scholar
    • 21. Azad RK, Lawrence JG, Towards more robust methods of alien gene detection, Nucleic Acids Res 39 (9) :e56, 2011. Crossref, MedlineGoogle Scholar
    • 22. Vega-Pons S, Ruiz-Shulcloper J, A survey of clustering ensemble algorithms, Int J Pattern Recognit 25 (3) :337–372, 2011. LinkGoogle Scholar
    • 23. Zhang R, Zhang CT, A systematic method to identify genomic islands and its applications in analyzing the genomes of Corynebacterium glutamicum and Vibrio vulnificus CMCP6 chromosome I, Bioinf 20 (5) :612–622, 2004. Crossref, MedlineGoogle Scholar
    • 24. Arvey AJ, Azad RK, Raval A, Lawrence JG, Detection of genomic islands via segmental genome heterogeneity, Nucleic Acids Res 37 (16) :5255–5266, 2009. Crossref, MedlineGoogle Scholar
    • 25. Tsirigos A, Rigoutsos I, A new computational method for the detection of horizontal gene transfer events, Nucleic Acids Res 33 (3) :922–933, 2005. Crossref, MedlineGoogle Scholar
    • 26. Monti S, Tamayo P, Mesirov J, Golub T, Consensus clustering: A resampling-based method for class discovery and visualization of gene expression microarray data, Mach Learn 52 (1–2) :91–118, 2003. CrossrefGoogle Scholar
    • 27. Wilkerson MD, Hayes DN, ConsensusClusterPlus: A class discovery tool with confidence assessments and item tracking, Bioinf 26 (12) :1572–1573, 2010. Crossref, MedlineGoogle Scholar
    • 28. Wang H, Song M, Ckmeans. 1d. dp: Optimal k-means clustering in one dimension by dynamic programming, R J 3 (2) :29–33, 2011. Crossref, MedlineGoogle Scholar
    • 29. Scrucca L, Fop M, Murphy TB, Raftery AE, mclust 5: Clustering, classification and density estimation using gaussian finite mixture models, R J 8 (1) :289–317, 2016. Crossref, MedlineGoogle Scholar
    • 30. Wei W, Gao F, Du MZ, Hua HL, Wang J, Guo FB, Zisland explorer: Detect genomic islands by combining homogeneity and heterogeneity properties, Brief Bioinf 18 (3) :357–366, 2017. MedlineGoogle Scholar
    • 31. Winstanley C, Langille MGI, Fothergill JL, Kukavica-Ibrulj I, Paradis-Bleau C, Sanschagrin F, Thomson NR, Winsor GL, Quail MA, Lennard N, Bignell A, Clarke L, Seeger K, Saunders D, Harris D, Parkhill J, Hancock REW, Brinkman FSL, Levesque RC, Newly introduced genomic prophage islands are critical determinants of in vivo competitiveness in the liverpool epidemic strain of Pseudomonas aeruginosa, Genome Res 19 (1) :12–23, 2009. Crossref, MedlineGoogle Scholar
    • 32. Bi D, Xu Z, Harrison EM, Tai C, Wei Y, He X, Jia S, Deng Z, Rajakumar K, Ou HY, ICEberg: A web-based resource for integrative and conjugative elements found in Bacteria, Nucleic Acids Res 40 (D1) :D621–D626, 2012. Crossref, MedlineGoogle Scholar
    • 33. Chun J, Grim CJ, Hasan NA, Lee JH, Choi SY, Haley BJ, Taviani E, Jeon YS, Kim DW, Lee JH, et al., Comparative genomics reveals mechanism for short-term and long-term clonal transitions in pandemic Vibrio cholerae, Proc Natl Acad Sci USA 106 (36) :15442–15447, 2009. Crossref, MedlineGoogle Scholar
    • 34. Langille MGI, Hsiao WWL, Brinkman FSL, Evaluation of genomic island predictors using a comparative genomics approach., BMC Bioinf 9 :329, 2008. Crossref, MedlineGoogle Scholar
    • 35. Langille MG, Brinkman FS, IslandViewer: An integrated interface for computational identification and visualization of genomic islands, Bioinf 25 (5) :664–665, 2009. Crossref, MedlineGoogle Scholar
    • 36. Flannery EL, Mody L, Mobley HL, Identification of a modular pathogenicity island that is widespread among urease-producing uropathogens and shares features with a diverse group of mobile elements, Infect Immun 77 (11) :4887–4894, 2009. Crossref, MedlineGoogle Scholar