Looking for alternative of extractBetween function
4 次查看(过去 30 天)
显示 更早的评论
close all
clear all
clc
name = 'Covid_strains';
file = strcat(name,'.fasta');
sequences = fastaread(file);
The code extract the sequences from the fasta file "Covid_strains". The file contains multiple sequences. The fastaread function create a structure containing Header and Sequences. Now I want to extract the some part of the header and store it in a table. The table will be the response variable for the predictor. I wanted to use extractbetween function for this. But since I am using older version of MATLAB, the function is not available to me. Please help.
2 个评论
Rik
2021-11-4
You didn't enter your release when you posted your question, which makes it harder to help you.
I suspect the best choice would be to use regexp. With a regular expression you can capture the tokens.
采纳的回答
Rik
2021-11-4
You need to come up with a regular expression that will exactly match what you want to extract. Below is my guess for what you want.
RE=['\|',... match a literal |
'(.*)',... capture any character in a token
'($|\n)'];% end of matched substring should be a newline or the end of the string
txt='OK584715.1 |B.1.2';
x=regexp(txt,RE,'tokens');x{1}
4 个评论
Rik
2021-11-7
Why did you remove all the semicolons as well as all documentation for the regular expression? And why are you closing all figures?
更多回答(0 个)
另请参阅
类别
在 Help Center 和 File Exchange 中查找有关 Text Files 的更多信息
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!