How to extract information from the filename?

54 次查看(过去 30 天)
I have almost 6000 files with names like Arenosillo.2005.344.13.49.G13.txt where 2005 is the year, 344 is the number of day of that year and 13:49 is the time. I want to extract all these information from the filename. Please help me.

回答(2 个)

jonas
jonas 2018-5-9
If you know something about the structure, then this is quite simple. Let's say the structure is [year.day.hour.min], with some arbitrary string before.
Use regexp to find the separate digits:
string='Arenosillo.2005.344.13.49.G13.txt';
[ind1,ind2]=regexp(string,'\d+')
This gives you the index at start and end of all digits, respectively. Then extract that information:
y=string(a(1):b(1))
d=string(a(2):b(2))
h=string(a(3):b(3))
m=string(a(4):b(4))
It's a bit more complicated if you have other numbers in your string, but can still be solved with regexp

Pawel Jastrzebski
编辑:Pawel Jastrzebski 2018-5-9
This will get you the structure of all of the text files in your current folder:
x = dir('*.txt') % structure
In my case it's:
x =
6×1 struct array with fields:
name
folder
date
bytes
isdir
datenum
And this will extract all of the text file names from the structure to the cell:
y = {x.name}
Once you've got this far, it should be easy to extract the name of the file i.e with the FOR loop and break it down to the information that you need.
  2 个评论
SGMukherjee
SGMukherjee 2018-5-9
I already got this. list=dir('*.txt'); for n=1:length(list); filename=list(n).name; end But I want to extract the year, day number and time from the filename.
Pawel Jastrzebski
Try strsplit :
>> s = 'Arenosillo.2005.344.13.49.G13.txt'
s =
'Arenosillo.2005.344.13.49.G13.txt'
>> c = strsplit(s,'.')
c =
1×7 cell array
Columns 1 through 6
{'Arenosillo'} {'2005'} {'344'} {'13'} {'49'} {'G13'}
Column 7
{'txt'}

请先登录,再进行评论。

类别

Help CenterFile Exchange 中查找有关 Cell Arrays 的更多信息

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by