why nucleotides is more than 4?
1 次查看(过去 30 天)
显示 更早的评论
hi as I know the no. of nucleotides is 4 letters. why in matlab consider it 17 letters as in table here:
thanks
0 个评论
采纳的回答
Walter Roberson
2011-11-16
The table there looks pretty straight-forward to me: http://www.mathworks.com/help/toolbox/bioinfo/ref/int2nt.html#bp_rekb-1 . It has codes for situations in which particular sets of nucleotides are known to be present or known to be absent.
Besides, the number of known nucleotides is not 4: it is currently 8. The 7th and 8th were announced in July 2011, with the 5th and 6th having been announced in April 2005.
更多回答(1 个)
Lucio Cetto
2011-11-19
Ambiguous nucleotide symbols are used to characterize sequences that can have variations. It was introduced in the 80's and they are useful nowadays in certain cases, for example describing restriction enzymes. (e.g. http://www.chem.qmul.ac.uk/iubmb/misc/naseq.html). In my personal opinion I think that there are other situations in which we have better options, such as sequence motifs, sequence profiles and the more elaborated profile HMMs. If you plan to convert to aa, Matlab can actually use also ambiguous aa codes when possible, although this is no longer a standard practice; most people now uses only ACGT.
0 个评论
另请参阅
类别
在 Help Center 和 File Exchange 中查找有关 Genomics and Next Generation Sequencing 的更多信息
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!