the number of occurences of each character of one string,in another

Question

hiva 2014-12-28

1
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another

编辑： Luuk van Oosten 2015-1-24

i have a string of more than 100 characters (fasta format of a protein sequence. like

'MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH'

which is being shortened here for simplicity) and i want to find out whether or not it is hydrophobic. so i have to check the number of occurrences of each of the characters in the set 'A C F I L M P V W Y'(hydrophob amino acids) in my fasta string. considering the very long length of fasta strings, is there any easy way to do that by matlab string functions?

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Azzi Abdelmalek 2014-12-28

1
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another#answer_163456

编辑：Azzi Abdelmalek 2014-12-28

在 MATLAB Online 中打开

str='MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH'
p={'A' 'C' 'F' 'I' 'L' 'M' 'P' 'V' 'W' 'Y'}'
out=[p cellfun(@(x) nnz(ismember(str,x)),p,'un',0)]

2 个评论
显示无隐藏无

hiva 2014-12-29

thanks a lot.i guess this works well for a lot of similar cases that are supposed to work the same way in my code(since it is feature extraction and there are lots of features). also tells me how much i don't know from matlab.thanks.

Stephen23 2014-12-30

编辑：Stephen23 2014-12-30

在 MATLAB Online 中打开

This could be simplified and speeded-up by using arrayfun instead of cellfun, and removing the ismember:

>> t = 'ACFILMPVWY';
>> arrayfun(@(x)sum(str==x), t)
ans =
     6     2     4     6    13     2     7     7     1     7

请先登录，再进行评论。

Answer 2

Peter Perkins 2014-12-29

2
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another#answer_163537

在 MATLAB Online 中打开

Another possibility:

>> s = 'MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH';
>> t = 'ACFILMPVWY';
>> n = hist(double(s),1:90);
>> n(t)
ans =
     6     2     4     6    13     2     7     7     1     7

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

Jan 2014-12-30

This is a histogram problem, so histc is an efficient and direct solution.

请先登录，再进行评论。

Answer 3

Luuk van Oosten 2015-1-24

2
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another#answer_165835

编辑：Luuk van Oosten 2015-1-24

在 MATLAB Online 中打开

I reckon you are using the BioInformatics Toolbox. In that case you can probably use:

aacount('SEQ')

Where SEQ is of course your sequence of interest: MEQNGLDHDSRSSIDTTINDTQKTFLEF....

and using

nr_A = All.A
nr_C = All.C
nr_F = All.F

etc. (you get the idea)

you get the numbers of your hydrophobic residues. Sum these and you have your hydrophobic score. You might want to 'normalize' this number by dividing this number by the total amount of amino acids in the sequence.

Of course you can write a loop for this and calculate the hydrophobic score for all your sequences in your FASTA file.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

Answer 4

Shoaibur Rahman 2014-12-28

1
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another#answer_163455

在 MATLAB Online 中打开

s = 'MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH';
numA = sum(s=='A')
numC = sum(s=='C')
numF = sum(s=='F')
numI = sum(s=='I')
numL = sum(s=='L')
numM = sum(s=='M')
numP = sum(s=='P')
numV = sum(s=='V')
numW = sum(s=='W')
numY = sum(s=='Y')

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

hiva 2014-12-29

very simple and delicate. really thanks

请先登录，再进行评论。

Answer 5

Stephen23 2014-12-30

1
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/168299-the-number-of-occurences-of-each-character-of-one-string-in-another#answer_163616

编辑：Stephen23 2014-12-30

在 MATLAB Online 中打开

A neat solution using bsxfun :

>> s = 'MEQNGLDHDSRSSIDTTINDTQKTFLEFRSYTQLSEKLASSSSYTAPPLNEDGPKGVASAVSQGSESVVSWTTLTHVYSILGAYGGPTCLYPTATYFLMGTSKGCVLIFNYNEHLQTILVPTLSEDPSIH';
>> t = 'ACFILMPVWY';
>> sum(bsxfun(@eq,s.',t))
ans =
     6     2     4     6    13     2     7     7     1     7

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

hiva 2014-12-30

编辑：hiva 2014-12-30

wow!!! just wonderful. it works pretty well.thanks a lot.

请先登录，再进行评论。

the number of occurences of each character of one string,in another

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

2 个评论
显示无隐藏无

更多回答（4 个）

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

另请参阅

类别

标签

Community Treasure Hunt

the number of occurences of each character of one string,in another

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

2 个评论 显示 无隐藏 无

更多回答（4 个）

1 个评论 显示 -1更早的评论隐藏 -1更早的评论

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

1 个评论 显示 -1更早的评论隐藏 -1更早的评论

1 个评论 显示 -1更早的评论隐藏 -1更早的评论

另请参阅

类别

标签

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

2 个评论
显示无隐藏无

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

1 个评论
显示 -1更早的评论隐藏 -1更早的评论