Pull out strings and its values from a text file.

4 次查看(过去 30 天)
Hi
Please find the attachment *.txt file. I want to analyze the whole text file .
Thanks
-Sriram

采纳的回答

Guillaume
Guillaume 2019-6-11
HI Sriram, sorry I was away last week.
Parsing the the first part of each message (date, level, source) is trivial. It's the part after that that is difficult due to the variations of format. I don't fully understand the algorithm you've written and I don't think you can use : indiscriminately as a delimiter. For example on line 2, it's part of https://www....
Here is how I would start the parsing:
filecontent = string(fileread('File.txt')); %read whole file as STRING (for easier text comparison later)
messages = regexp(filecontent, '^(?<date>[^ ]+) (?<level>[^ ]+) (?<source>[^:]+):\s+(?<content>[^\r\n]+)', 'names', 'lineanchors'); %parse all lines according to common format
dates = num2cell(datetime([messages.date], 'InputFormat', 'yyyy-MM-dd''T''HH:mm:ss.SSSSSSZZZZZ', 'TimeZone', 'UTC')); %decode date
[messages.date] = dates{:}; %and put back into structure
%parsing of kernel messages
iskernel = [messages.source] == "kernel";
parsedkernel = regexp([messages(iskernel).content], '\[\s*(?<cputime>[^\]]+)]\s+(?<message>.*)', 'names'); %parse kernel messages. Not sure of the rule
parsedkernel = [parsedkernel{:}]; %convert into structure array
cputime = num2cell(str2double([parsedkernel.cputime])); %convert cputime to numeric
[parsedkernel.cputime] = cputime{:}; %and put back into structure
parsedkernel = num2cell(parsedkernel); %convert to cell array to put back into messages structure
[messages(iskernel).content] = parsedkernel{:};
  6 个评论
Guillaume
Guillaume 2019-6-14
Sriram's comment mistakenly posted as an answer (please use comments!):
Thanks a lot. I works.
Guillaume
Guillaume 2019-6-14
Then consider changing your accepted answer, particularly after all the hard work that has gone in getting you there.

请先登录,再进行评论。

更多回答(1 个)

Dimitar Georgiev
Dimitar Georgiev 2019-5-26
cell = readcell('filename.xlsx','Range','......');
stringname = '......';
variable = strcmp(stringname,cell);
  12 个评论
Guillaume
Guillaume 2019-6-1
As I wrote:
So, I'm afraid, the task is back onto you You first need to define rules (there's going to be several due to the complex formatting of the lines) on how to split a line into various components. Only once you've done that can we think about writing the code to do it

请先登录,再进行评论。

类别

Help CenterFile Exchange 中查找有关 String Parsing 的更多信息

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by