Main content


Date created: | Last Updated:


Creating DOI. Please wait...

Create DOI

Category: Project

Description: Many researchers have shown that formal language theory is an appropriate tool in analyzing various biological sequences. Markov model is most closely related to regular grammars, because an n-gram is a subsequence of n items from a given sequence, and language models that are built from n-grams are actually (n-1)-order Markov models. We investigated whether some subsets of the annotated ENCODE /RoadmapGenomics 15-state model can be predicted by simply creating n-gram models of DNA sequences, in reverse. To achieve this, ChromHMM blocks of human genome were initially dissected into a nucleosome resolution of 200-bp units and, by analyzing the BED files of ChromHMM, each individual unit was assigned one dominant chromatin state.


Loading files...


Recent Activity

Loading logs...

OSF does not support the use of Internet Explorer. For optimal performance, please switch to another browser.
This website relies on cookies to help provide a better user experience. By clicking Accept or continuing to use the site, you agree. For more information, see our Privacy Policy and information on cookie use.

Start managing your projects on the OSF today.

Free and easy to use, the Open Science Framework supports the entire research lifecycle: planning, execution, reporting, archiving, and discovery.