Show simple item record Riley, Michael Charles Clare, Amanda King, Ross Donald 2008-08-19T07:56:37Z 2008-08-19T07:56:37Z 2007-03-30
dc.identifier.citation Riley , M C , Clare , A & King , R D 2007 , ' Locational distribution of gene functional classes in Arabidopsis thaliana ' BMC Bioinformatics , vol 8 , 112 . DOI: 10.1186/1471-2105-8-112 en
dc.identifier.issn 1471-2105
dc.identifier.other PURE: 77378
dc.identifier.other PURE UUID: 3bc3d511-5c0f-4ce8-905a-733fffef9375
dc.identifier.other dspace: 2160/634
dc.identifier.other DSpace_20121128.csv: row: 448
dc.identifier.other Scopus: 34247478148
dc.identifier.uri en
dc.description Riley, M. C., Clare, A., King, R. D. (2007). Locational distribution of gene functional classes in Arabidopsis thaliana. BMC Bioinformatics 8, Article No: 112 Sponsorship: EPSRC / RAEng en
dc.description.abstract Background: We are interested in understanding the locational distribution of genes and their functions in genomes, as this distribution has both functional and evolutionary significance. Gene locational distribution is known to be affected by various evolutionary processes, with tandem duplication thought to be the main process producing clustering of homologous sequences. Recent research has found clustering of protein structural families in the human genome, even when genes identified as tandem duplicates have been removed from the data. However, this previous research was hindered as they were unable to analyse small sample sizes. This is a challenge for bioinformatics as more specific functional classes have fewer examples and conventional statistical analyses of these small data sets often produces unsatisfactory results. Results: We have developed a novel bioinformatics method based on Monte Carlo methods and Greenwood's spacing statistic for the computational analysis of the distribution of individual functional classes of genes (from GO). We used this to make the first comprehensive statistical analysis of the relationship between gene functional class and location on a genome. Analysis of the distribution of all genes except tandem duplicates on the five chromosomes of A. thaliana reveals that the distribution on chromosomes I, II, IV and V is clustered at P = 0.001. Many functional classes are clustered, with the degree of clustering within an individual class generally consistent across all five chromosomes. A novel and surprising result was that the locational distribution of some functional classes were significantly more evenly spaced than would be expected by chance. Conclusion: Analysis of the A. thaliana genome reveals evidence of unexplained order in the locational distribution of genes. The same general analysis method can be applied to any genome, and indeed any sequential data involving classes. en
dc.language.iso eng
dc.relation.ispartof BMC Bioinformatics en
dc.rights en
dc.title Locational distribution of gene functional classes in Arabidopsis thaliana en
dc.type /dk/atira/pure/researchoutput/researchoutputtypes/contributiontojournal/article en
dc.contributor.institution Department of Computer Science en
dc.contributor.institution Bioinformatics and Computational Biology Group en
dc.description.status Peer reviewed en

Files in this item

Aside from theses and in the absence of a specific licence document on an item page, all works in Cadair are accessible under the CC BY-NC-ND Licence. AU theses and dissertations held on Cadair are made available for the purposes of private study and non-commercial research and brief extracts may be reproduced under fair dealing for the purpose of criticism or review. If you have any queries in relation to the re-use of material on Cadair, contact

This item appears in the following Collection(s)

Show simple item record

Search Cadair

Advanced Search