Repository logo
 
Publication

Employing compact intra-genomic language models to predict genomic sequences and characterize their entropy

dc.contributor.authorDeusdado, Sérgio
dc.contributor.authorCarvalho, Paulo
dc.date.accessioned2011-05-18T10:25:35Z
dc.date.available2011-05-18T10:25:35Z
dc.date.issued2010
dc.description.abstractProbabilistic models of languages are fundamental to understand and learn the profile of the subjacent code in order to estimate its entropy, enabling the verification and prediction of “natural” emanations of the language. Language models are devoted to capture salient statistical characteristics of the distribution of sequences of words, which transposed to the genomic language, allow modeling a predictive system of the peculiarities and regularities of genomic code in different inter and intra-genomic conditions. In this paper, we propose the application of compact intra-genomic language models to predict the composition of genomic sequences, aiming to achieve valuable resources for data compression and to contribute to enlarge the similarity analysis perspectives in genomic sequences. The obtained results encourage further investigation and validate the use of language models in biological sequence analysis.por
dc.identifier.citationDeusdado, Sérgio; Carvalho, Paulo (2010). Employing compact intra-genomic language models to predict genomic sequences and characterize their entropy. In Rocha, Miguel P. [et tal.] 4th International Workshop on Practical Applications of Computational Biology & Bioinformatics. Guimarães. p. 143-150. ISBN 978-3-642-13214-8por
dc.identifier.doi10.1007/978-3-642-13214-8_19
dc.identifier.isbn978-3-642-13214-8
dc.identifier.urihttp://hdl.handle.net/10198/4357
dc.language.isoengpor
dc.peerreviewedyespor
dc.publisherSpringer-Verlagpor
dc.subjectLanguage modelspor
dc.subjectGenomic sequences modelingpor
dc.subjectDNA entropy estimationpor
dc.titleEmploying compact intra-genomic language models to predict genomic sequences and characterize their entropypor
dc.typeconference paper
dspace.entity.typePublication
oaire.citation.conferencePlaceBerlinpor
oaire.citation.endPage150por
oaire.citation.startPage143por
oaire.citation.titleAdvances in Bioinformaticspor
person.familyNameDeusdado
person.givenNameSérgio
person.identifier.ciencia-id1D14-2CBC-54F2
person.identifier.orcid0000-0003-2638-2230
person.identifier.scopus-author-id15764598600
rcaap.rightsopenAccesspor
rcaap.typeconferenceObjectpor
relation.isAuthorOfPublication1363c41f-0861-40ea-a87a-4a24d9658f03
relation.isAuthorOfPublication.latestForDiscovery1363c41f-0861-40ea-a87a-4a24d9658f03

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Employing Compact Intra-genomic Language Models to Predict Genomic Sequences and Characterize Their Entropy IWPACBB2010.pdf
Size:
199.46 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: