How do I compare two data sets of unequal length?

21 次查看(过去 30 天)
I have two sets of data, taken on different days, from the same sensor. The temperature was swept from 26C to -30C to 80C and back to 26C. The sensor was read periodically during the temperature sweep. The data sets consist of a temperature column, and another column representing the sensor readings. I would like to take a difference between the two sets of sensor readings, generating another data set having a column of temperatures, and a column of differences between the two original sets of sensor readings. If each data set had exactly the same vector of temperatures, I could just subtract one vector of sensor readings from the other. However, the temperature vectors do not contain exactly the same temperatures, and they don't even have the same number of elements. I would like to interpolate one set of temperatures and sensor readings to match the temperatures of the other, so I have two data sets of the same size, at the same temperatures. One complicating factor is that, due to sensor hysteresis with respect to temperature, the sensor readings are different on the downward temperature ramps from those on the upward temperature ramp. Therefore I can't sort the data on temperature, because that would mix the upward and downward ramps. If I could sort the data on temperature, I could use timeseries objects, with temperature in place of time. However, that won't work in this case.
  8 个评论
DH
DH 2018-8-9
It would not work to average the sensor data for two identical temperatures, because the identical temperatures may not be on the same ramp - one may be as the temperature is going up, and the other as the temperature is going down. Due to hysteresis, the sensor data on the upward ramp may be different from that on the downward ramp. I need to preserve the hysteresis. However, it would be acceptable to drop one of each pair of exact duplicates - something like this:
[uniquetemps, it, iu] = unique(data.Temperature, 'stable');
datareduced = data(it,:);
This would give me a dataset with no duplicate temperatures, and the order would be preserved. However, the temperatures are still not monotonic.
DH
DH 2018-8-9
I think I see how to do it. Your mention of interp1 clued me in.
data1 = readtable([tstfldr 'dataset1.csv']);
data2 = readtable([tstfldr 'dataset2.csv']);
[uniquetemps, it, iu] = unique(data1.Temperature, 'stable');
data1reduced = data1(it,:);
interpSnsr1 = interp1(data1reduced.Temperature, ...
data1reduced.SensorData, data2.Temperature);
df = interpSnsr1 - data2.SensorData;
Thank you for your help.

请先登录,再进行评论。

采纳的回答

DH
DH 2018-8-14
Problem solved - see response from Adam Danz on 8 Aug 2018 at 20:10, and my response - DH on 9 Aug 2018 at 12:15.

更多回答(2 个)

Yuvaraj Venkataswamy
  2 个评论
DH
DH 2018-8-8
编辑:DH 2018-8-8
Those methods finds members of one array that are equal to members of another array, right? My two temperature arrays may not have any members that are exactly equal. I want to interpolate one to the other.

请先登录,再进行评论。


Yuvaraj Venkataswamy
if true
id = ismember(dataset1', dataset2', 'rows');
X = 1:size(dataset1, 2);
Y = X(id);
end

类别

Help CenterFile Exchange 中查找有关 Geodesy and Mapping 的更多信息

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by