Utilize este identificador para referenciar este registo: http://hdl.handle.net/10198/4357
Título: Employing compact intra-genomic language models to predict genomic sequences and characterize their entropy
Autor: Deusdado, Sérgio
Carvalho, Paulo
Palavras-chave: Language models
Genomic sequences modeling
DNA entropy estimation
Data: 2010
Editora: Springer-Verlag
Citação: Deusdado, Sérgio; Carvalho, Paulo (2010) - Employing compact intra-genomic language models to predict genomic sequences and characterize their entropy. In Rocha, Miguel P. [et tal.] 4th International Workshop on Practical Applications of Computational Biology & Bioinformatics. Guimarães. p. 143-150. ISBN 978-3-642-13214-8
Resumo: Probabilistic models of languages are fundamental to understand and learn the profile of the subjacent code in order to estimate its entropy, enabling the verification and prediction of “natural” emanations of the language. Language models are devoted to capture salient statistical characteristics of the distribution of sequences of words, which transposed to the genomic language, allow modeling a predictive system of the peculiarities and regularities of genomic code in different inter and intra-genomic conditions. In this paper, we propose the application of compact intra-genomic language models to predict the composition of genomic sequences, aiming to achieve valuable resources for data compression and to contribute to enlarge the similarity analysis perspectives in genomic sequences. The obtained results encourage further investigation and validate the use of language models in biological sequence analysis.
Peer review: yes
URI: http://hdl.handle.net/10198/4357
DOI: 10.1007/978-3-642-13214-8_19
ISBN: 978-3-642-13214-8
Aparece nas colecções:ESA - Publicações em Proceedings Indexadas à WoS/Scopus

FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpace
Formato BibTex MendeleyEndnote Degois 

Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.