Automated Classi cation of DNA Structure from Sequence Information

semanticscholar(2018)

引用 0|浏览3
暂无评分
摘要
We introduce an algorithm, lllama, which combines simple pattern recognizers into a general method for estimating the entropy of a sequence. Each pattern recognizer exploits a partial match between subsequences to build a model of the sequence. Since the primary features of interest in biological sequence domains are subsequences with small variations in exact composition, lllama is particularly suited to such domains. We describe two methods, lllama-length and lllama-alone, which use this entropy estimate to perform maximum a posteriori classi cation. We apply these methods to several problems in three-dimensional structure classi cation of short DNA sequences. The results include a Email: loewenst@paul.rutgers.edu. Phone: 201-761-5949. Fax: 908-445-0537. y Email: berman@adenine.rutgers.edu. z Email: hirsh@cs.rutgers.edu.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要