Info
此问题已关闭。 请重新打开它进行编辑或回答。
How to use regexp to search for separated sequences?
1 次查看(过去 30 天)
显示 更早的评论
Say I have a text file with tons of random letters and I want to search it for the sequence ABBA[AGTC]DABA I want ABBA and DABA to be on the ends of the sequence but also want a variable sequence in the middle. GAGA can be either 0, 4, or 6 letters and can be any combination of the letters AGTC.
Any idea how to search for such sequences using regexp? Maybe another search command?
2 个评论
Star Strider
2016-12-12
Is the ‘variable sequence in the middle’ always enclosed within square brackets []?
Can you provide a sample sequence, or a file with some sample sequences, in the format you expect to use them?
per isakson
2016-12-12
编辑:per isakson
2016-12-12
Are you looking for something like this
>> regexp( 'ABBAAGTGTCDABA', 'ABBA([AGTC]{0}|[AGTC]{4}|[AGTC]{6})DABA', 'match' )
ans =
'ABBAAGTGTCDABA'
>> regexp( 'ABBADABA', 'ABBA([AGTC]{0}|[AGTC]{4}|[AGTC]{6})DABA', 'match' )
ans =
'ABBADABA'
>> regexp( 'xxxABBAAGTGTCDABAzzz', 'ABBA([AGTC]{0}|[AGTC]{4}|[AGTC]{6})DABA', 'match' )
ans =
'ABBAAGTGTCDABA'
回答(0 个)
此问题已关闭。
另请参阅
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!