Interpolation schemes that produce positive second derivatives of the interpolant

Question

SA-W 2024-1-18

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2071531-interpolation-schemes-that-produce-positive-second-derivatives-of-the-interpolant

编辑： SA-W 2024-2-12

Given a set of x-values and y-values. The interpoland to this data should have non-negative second derivatives, allowed to be discontinuous.

The y-values are strictly increasing and the interpoland is allowed to be a C1 function.

Take this data set for example:

x = [3, 3.8125, 4.6250, 5.4375, 6.25, 7.0625, 7.8750, 8.6875, 9.5, 10.3125, 16];

y = [0, 0.0111, 0.0293, 0.0521, 0.0787, 0.1086, 0.1416, 0.1774, 0.2155, 0.2556, 1.0248];

plot(x,y)

The first 10 points are equally distributed, only the last segment is bigger. Another characteristic of my y-values is that they are close to linear in the beginning, and nonlinear after that.

As for the cubic interpolation family, I tried makima, pchip, as well as csape, but they all result in negative second derivatives at some points.

Is there an interpolation scheme for my purpose? If not, are there easy adjustments to the linear system for the spline coefficients to enforce positive second derivatives?

Thank you!

UPDATE:

As stated in the comments below, I implemented an optimization problem to find the unknown deivatives at the break points using a C1 cubic hermite interpolant. On the interval

, we have

where

and

are the interpolation values and

and

the unknown derivatives at the break points.

The basis functions are given by

and

is the mapping to transform from the "real" interval to the unit interval

. All these equations can be taken from wikipedia "cubic hermite interpolation".

In the optimization problem, I want to find the unknown derivatives

at the break points under the constraints that the second derivative of the interpolant is positive everywhere and that the second derivative is as smooth as possible. That said, the objective function measures the change in the second derivative at the break points. Also, I decided to implement "second derivative > 0" as a penalty term in the objective function, although these are linear constraints.

I solved the problem using fmincon, but as stated in the comments, linprog or quadprog could work too. If this is indeed a linear optimization problem, I expect that the derivative of the interpolant w.r.t the interpolation values is the unit spline. My overall goal is to get these sensitivities, and the question is how to get them, if my problem were non-linear.

Here is my code:

% inerpolation points (break points)
x = [3, 3.8125, 4.6250, 5.4375, 6.25, 7.0625, 7.8750, 8.6875, 9.5, 10.3125, 16];
% interpolation values)
y = [0, 0.0198463, 0.0397084, 0.0596019, 0.0795445, 0.0995199, 0.119908, 0.140298, 0.160773, 0.181772, 6.64478];
% create points at which the change of the 2nd derivative is measured
%   for every breakpoint x(i) in the interior, i.e., x(2:end-1), 
%   two points are created x(i)-e and x(i)+e
xObj = zeros(2*(numel(x)-2), 1);
% offset "e" in x(i)+e
offset = min(diff(x))/100;
ctr = 1;
for i=2:numel(x)-1
    xObj(ctr) = x(i) - offset;
    xObj(ctr+1) = x(i) + offset;
    ctr = ctr + 2;
end
% objective function handle
objectiveFunc = @(yprime)Objective_function(yprime, x, y, xObj);
% start vector
yprime0 = ones(size(x));
% call optimizer
yprime = fmincon(objectiveFunc, ...
                    yprime0, [], []);          
function fval = Objective_function(yprime, pointsx, pointsy, pointsxObj)
        
    % minimize the jump of the second derivative at the
    % break points (to obtain best smoothness of 2nd derivative)
    fval = 0.0;
    ctr = 1;
    for i = 1:numel(pointsxObj)/2
        % calculate second derivative at point (x(i) - e)
        idxL = find(pointsxObj(ctr) >= pointsx, 1, "last");
        hiL = pointsx(idxL + 1) - pointsx(idxL);
        tL = (pointsxObj(ctr) - pointsx(idxL)) / hiL;
        secDerivL = Evaluate_cubicSegment(tL, 2, hiL, ...
                         pointsy(idxL), pointsy(idxL + 1), ...
                         yprime(idxL), yprime(idxL + 1));
        % calculate second derivative at point (x(i) + e)
        idxR = find(pointsxObj(ctr+1) >= pointsx, 1, "last");
        hiR = pointsx(idxR + 1) - pointsx(idxR);
        tR = (pointsxObj(ctr+1) - pointsx(idxR)) / hiR;
        secDerivR = Evaluate_cubicSegment(tR, 2, hiR, ...
                         pointsy(idxR), pointsy(idxR + 1), ...
                         yprime(idxR), yprime(idxR + 1));
        % add squared difference to objective function
        fval = fval + (secDerivL - secDerivR)^2;
        ctr = ctr + 2;
    end
    % ----------------------------- 2nd derivative > 0 (lienar constraints) .......................
    % impose the constraint at discrete points in the interpolation domain
    %   choose 100 points in each cubic segment
    nConstraints = 100*(numel(pointsx)-1);
    xCtr = linspace(min(pointsx), max(pointsx)-1e-3, nConstraints);
    for i=1:nConstraints
        % find polynomial segment in which xCtr(i) is located
        idx = find(xCtr(i) >= pointsx, 1, "last");
        hi = pointsx(idx+1) - pointsx(idx);
        t = (xCtr(i) - pointsx(idx)) / hi;
        % calculate second derivatives 
        %   - of basis functions H
        H = Evaluate_basisFunctions(t, 2);
        %   - of the interpolant
        secondDeriv = pointsy(idx)*H(1) + pointsy(idx+1)*H(2) + hi*yprime(idx)*H(3) + hi*yprime(idx+1)*H(4);
        % add contribution to objective function if derivative is negative 
        tmp = max(0.0, -secondDeriv);
        fval = fval + 10*tmp^2;
    
    end
end
function H = Evaluate_basisFunctions(t, order)
    
    if order==0
        % value    
        H1 = 1 - 3*t^2 + 2*t^3;
        H2 = 3*t^2 - 2*t^3;
        H3 = t - 2*t^2 + t^3;
        H4 = -t^2 + t^3;
    
    elseif order==1
        % first derivative
        H1 = -6*t + 6*t^2;
        H2 = 6*t - 6*t^2;
        H3 = 1 - 4*t + 3*t^2;
        H4 = -2*t + 3*t^2;
    elseif order==2
        % second derivative
        H1 = -6 + 12.*t;
        H2 = 6 - 12.*t;
        H3 = -4 + 6.*t;
        H4 = -2 + 6.*t;
        
    elseif order==3
        % third derivative
        H1 = 12;
        H2 = -12;
        H3 = 6;
        H4 = 6;
    else
        assert(false, "Evaluations up to 3rd derivative possible");
    end
    H = [H1; H2; H3; H4];
    
end
function pi = Evaluate_cubicSegment(t, order, hi, yi, yi1, yiprime, yi1prime)
    
    H = Evaluate_basisFunctions(t, order);
    chainRuleFactor = (1/hi)^order;
    pi = chainRuleFactor*( H(1).*yi + H(2).*yi1 + H(3).*hi*yiprime + H(4).*hi*yi1prime );
      
end

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Bruno Luong 2024-1-23

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2071531-interpolation-schemes-that-produce-positive-second-derivatives-of-the-interpolant#answer_1395646

编辑：Bruno Luong 2024-1-23

在 MATLAB Online 中打开

ppder.m

You can use approximation spline, just put more (denser) knots than the data points so it actually interpolates.

For example using my FEX BSFK to get cubic spline interpolant, C^2, and convex interpolant functtion:

Note that the "knee" location of the interpolant seems to be quite unstable.

x = [3, 3.8125, 4.6250, 5.4375, 6.25, 7.0625, 7.8750, 8.6875, 9.5, 10.3125, 16];
y = [0, 0.0198463, 0.0397084, 0.0596019, 0.0795445, 0.0995199, 0.119908, 0.140298, 0.160773, 0.181772, 1.02478];
% FEX https://www.mathworks.com/matlabcentral/fileexchange/25872-free-knot-spline-approximation?s_tid=srchtitle_BSFK_1
opt = struct('KnotRemoval','none','sigma',1e-10,'shape',struct('p',2,'lo',0,'up',inf));
pp = BSFK(x,y,4,20,[],opt);
% prepare graphical data
xi = linspace(min(x),max(x),1025);
yi = ppval(pp,xi);
xb = pp.breaks(2);
yb = ppval(pp,xb);
% Check if approximation is close to interpolation
norm(y-ppval(pp,x),'inf')/norm(y,'inf') % 2.6938e-05
% Evaluate secpn derivative
fdd = ppder(ppder(pp));
yddi = ppval(fdd,xi);
all(yddi >= 0) % true
figure(1)
clf
plot(x, y, 'or')
hold on
plot(xi, yi, 'b')
xlabel('x')
ylabel('y')
yyaxis right
plot(xi, yddi) % second derivative is postive
ylabel('y''''')
legend('data', 'spline interpolation', 'second derivative')

Note: you can regularize the interpolant by setting parameter d and lambda, however function will not interpolate the data as stricty as the above

opt = struct('KnotRemoval','none','sigma',1e-10,'d',2,'lambda', 1e-6,'shape',struct('p',2,'lo',0,'up',inf));

6 个评论
显示 4更早的评论隐藏 4更早的评论

SA-W 2024-1-24

I see.

So if you use the approach with lsqlin and f''>=0, a solution always exists and is unique as you said above. If I go with your FEX, I can achieve similar results by playing with the regullarization. Right?

Bruno Luong 2024-1-24

Yes.

There is a subtility you should be aware of. My code make a trade-off between fitness and regularized solution with the lambda value. Ideally if the solution of non-regularized is not unique, you should rather formulate theproblem as the interpolation values as hard constraints and minimize the second derivative energy.

请先登录，再进行评论。

Answer 2

Torsten 2024-1-18

1
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2071531-interpolation-schemes-that-produce-positive-second-derivatives-of-the-interpolant#answer_1392441

移动：Torsten 2024-1-18

If you do function approximation (e.g. by splines), there are methods to enforce f''>=0. But for function interpolation, it doesn't make sense in my opinion.

46 个评论
显示 44更早的评论隐藏 44更早的评论

SA-W 2024-1-25

编辑：SA-W 2024-1-25

在 MATLAB Online 中打开

@Torsten or @Bruno Luong

I tried to implement the fintite difference approach to calculate the derivative of f w.r.t the y-values.

x = linspace(3, 18, 17); %numel(x) = 17
y = [0 0.0915    0.2564    0.4722    0.7258    1.0146    1.3345    1.6827    2.0519   2.4502    2.8733    3.3100    3.7742    4.2586    4.7577    5.2692    5.8130];
plot(x,y);
% fit curve f with lsqlin and f''>=0 and f'(x(1))>0
slmF = slmengine(x,y,'knots', x, 'concaveup', 'on', 'leftminslope', 0);
% forward differencing to calculate df/dyi
h = 1e-6;
unitVectors = eye(numel(x));
for i=1:numel(x)
    yi = y + h.*unitVectors(:,i);
    slmi = slmengine(x,yi,'knots', x, 'concaveup', 'on', 'leftminslope', 0);
    normCoef = (slmi.coef - slmF.coef) ./ h;
    % normCoef are the coefficients for the function si(x) =: df/dyi (x)
    % ...
end

The .coef matrices have numel(x) rows and 2 columns, where the first column stores the values of the fitted curve at the points, and the second column the first derivatives.

Given that the x-y data are convex, I expected that the step size h has negligible influence on df/dyi = f(y + h*e_i) - f(y) / h. However, this is not the case. Consider normCoef for i=1:

h = 1e-6
normCoef =
0.999995033031503  -1.317473322631768
0.000009659231748  -0.504431128950378
-0.000013950784972   0.135169090947906
0.000019731993817  -0.036213002152508
-0.000017326473589   0.009672115941495
0.000007386535827  -0.002514967512024
0.000006979528067   0.000465534388816
-0.000062704286208   0.000428542923370
0.000286064505417  -0.001286629025543
-0.001068024335638   0.001500947199862
0.001697645135579  -0.000200101823999
-0.001211744038443  -0.001160455398441
0.000550326451076   0.001170506247483
-0.000295791835470  -0.000590524629196
0.000131358923738  -0.000149107171055
-0.000021653789872   0.002064198922902
-0.000012998491172  -0.008569634557531
h = 1e-4
normCoef =
   0.999995491566257  -1.349519663240706
   0.000007276449859  -0.495840698605254
  -0.000009334372719   0.132867014661087
   0.000015788467200  -0.035600121559565
  -0.000015832012368   0.009512679141621
   0.000007386422585  -0.002477481512164
   0.000006079643455   0.000467364215662
  -0.000058511488987   0.000397151335152
   0.000268930491210  -0.001214846808706
  -0.001009108188121   0.001420326427026
   0.001605728923160  -0.000188703885295
  -0.001144649104390  -0.001099241813685
   0.000515927571776   0.001098306983138
  -0.000269104241113  -0.000492242546724
   0.000111552225235  -0.000423337971345
  -0.000015869225933   0.002995946402073
  -0.000011753211737  -0.011955025051025
  
h = 1e-2
normCoef =
   0.999994228845148  -1.193527414114345
   0.000011898524646  -0.537634109765617
  -0.000009258211109   0.144052694596855
   0.000006093611810  -0.038595244342654
  -0.000005391433910   0.010340656461227
   0.000002375859265  -0.002779278310200
   0.000001377334069   0.000798116836970
   0.000012155281559  -0.000381894886553
  -0.000082785041489   0.000460143107456
   0.000320083377625  -0.000473307636045
  -0.000511022516925   0.000062727514960
   0.000376935793556   0.000404325307030
  -0.000191000298022  -0.000655957641682
   0.000120700500794   0.001399552322212
  -0.000074713115872  -0.004570132664994
   0.000030995909484   0.016593923645802
  -0.000002674421129  -0.061575038096129

E.g., the derivative of s1=:df/dy1 at x(1) varies between -1.19 and -1.31 if the step size alters between 1e-2 and 1e-6. For my real code, I evaluate ds1/dx, where the differences are even more visible.

How would you choose the step size h here given the large variations? Does that make sense at all in your opinion?

SA-W 2024-1-25

在 MATLAB Online 中打开

@Bruno Luong Here is what I get. The variations are similar.

h = -1e-6
normCoef =
   0.999995830302609  -1.360127859353910
   0.000006013509135  -0.492996190398776
  -0.000006815048526   0.132104157335444
   0.000013115675213  -0.035397712150331
  -0.000014465095788   0.009462210348588
   0.000007326805829  -0.002469653426207
   0.000004652056518   0.000477580253122
  -0.000053457682725   0.000364824837007
   0.000252192933203  -0.001144739469883
  -0.000951206224897   0.001341342537042
   0.001514661285285  -0.000180731041244
  -0.001080445510837  -0.001031955965125
   0.000487524243198   0.001021727369377
  -0.000252407872381  -0.000405212974286
   0.000101975317079  -0.000634624464091
  -0.000013292478229   0.003708900631061
  -0.000011200818051  -0.014563127259670
  
h = -1e-4
normCoef =
   0.999995557095890  -1.344854551437596
   0.000007355945575  -0.497090932684574
  -0.000009569136594   0.133201878235456
   0.000015747003146  -0.035689728855592
  -0.000015648105034   0.009537584476083
   0.000007524463275  -0.002486921180100
   0.000004888007776   0.000475815832290
  -0.000055428370782   0.000382208811822
   0.000261331662976  -0.001184031336865
  -0.000983200907356   0.001385044410829
   0.001564531446618  -0.000185907001460
  -0.001116373882226  -0.001067569649904
   0.000505343344948   0.001066783836823
  -0.000266497268697  -0.000479960434729
   0.000112867839519  -0.000402863722382
  -0.000016596670704   0.002891097362623
  -0.000011832490543  -0.011560566747226
  
h = -1e-2
normCoef =
   0.999997290366472  -0.936422807636378
   0.000006362756698  -0.606526258139201
  -0.000006834026561   0.162514642135314
   0.000004531791242  -0.043538169491225
  -0.000001453809995   0.011655252522630
  -0.000001651546766  -0.003102627281132
   0.000007493624055   0.000783888390993
  -0.000021443238807  -0.000096259697407
   0.000065923896475  -0.000211872729772
  -0.000226974935025   0.000286049188003
   0.000353742383163  -0.000011304865039
  -0.000229165389465  -0.000247839180934
   0.000072819274166   0.000103707640642
  -0.000040335123863   0.000437265468689
   0.000029608248386  -0.001991044797922
  -0.000006640745109   0.007634735736906
  -0.000003273524651  -0.028653119822264

SA-W 2024-1-25

编辑：SA-W 2024-1-25

在 MATLAB Online 中打开

If you remove the constraint 'concaveup', 'on' and set h = 1e-2, ae-6 what finite difference do you get?

It should become linear and the finite difference should give the same numerical answer

Yes, this is exactly what happens. h = {1e-2, 1e-4, 1e-6,...} give the coefficients below. Interestingly, the derivative of df/dy1 at x(1) is roughly -1 while it was somehow -1.3 with f''>=0.

The slmengine imposes two BC internally if none are povided.

It is probably non-sense to calculate the coefficients for f with 'concaveup', 'on', but for the sensitivities without. They constraints and BCs must match, right?

normCoef =
   0.999999999773452  -1.000317634407111
   0.000000000416375  -0.589403113383802
  -0.000000000242162   0.157930087892399
   0.000000000066819  -0.042317239304165
  -0.000000000018496   0.011338870040206
   0.000000000005018  -0.003038241054493
  -0.000000000001377   0.000814094232565
   0.000000000000333  -0.000218135890789
  -0.000000000000044   0.000058449334900
  -0.000000000000089  -0.000015661450015
  -0.000000000000044   0.000004196465159
   0.000000000000133  -0.000001124410148
   0.000000000000089   0.000000301175940
  -0.000000000000089  -0.000000080293905
   0.000000000000089   0.000000019999180
   0.000000000000089   0.000000000297629
   0.000000000000178  -0.000000021189644

Bruno Luong 2024-1-30

编辑：Bruno Luong 2024-1-30

@SA-W " Assuming "traditional cubic spline interpolation" results in f''<0 somewhere, is it possible that the constrained problem results in f''>=m ?"

IMO it is not possible. Since the fitting score is convex function. Here is the outline of the proof

By contradiction, If you draw a line in the space of functions that link the solution of the unconstrained solution fu to the constrained solution fc that is fc''>=m > 0, the score function will have directional derivative negative started from fc (since the function is convex, and score(fu) < score(fc) ), that means there is small step h > 0 such as

fh := fc + h g
score(fh) < score(fc), since g is negative
fh''(x) >= 0 for all (x) by continuity of f''

which contradicts with the fact that fc is the minimum of the contrained problem. End of the proof.

I'm sorry but you seem to want something that look mathematically impossible.

PS: For the same reason I think why I think SLM solution posted by John is not fuly converged yet since it has f''(x) > 0 for all x.

SA-W 2024-1-30

@Bruno Luong

PS: For the same reason I think why I think SLM solution posted by John is not fuly converged yet since it has f''(x) > 0 for all x.

I just double-checked that the x-y-data I posted and John used in his answer produces f''>0 using traditional cubic spline interpolation such as csape. Hence no active constraints.

Indeed, for all the fits I made so far, there is a f'' at a knot amounted to 1e-5. This supports your proof.

I would like to understand the gist of your sketched proof.

You make the assumption that unconstrained solution fu has a lower score (objective function value) than the constrained solution fc. Makes sense, unconstrained solution should have numerical zero score.

the score function will have directional derivative negative started from fc

Not fully clear to me. Lets make a 1d example and minimize f(x) = x^2. fu is at x=0 (since f' = 2x). Say the constraint is x=3, then the "minimum" is at x=3 and the directional derivative at x=3 is 2*3 = 6 > 0.

Is that example too simple?

SA-W 2024-1-30

Ok. But where in your proof becomes fc''>=m > 0 relevant? Are not the steps and the conclusion the same if we only assume fc''>=0 ? What we wanted to show is that the constrained solution can not fulfill fc''>=m > 0.

Bruno Luong 2024-1-30

编辑：Bruno Luong 2024-1-31

"Are not the steps and the conclusion the same if we only assume fc''>=0 ? "

Because I can only show a strict smaller lower bound, strictly smaller than m (with epsion). If m is 0 (your case) the whole proof cannot work, since m-epsilon is negative and fh no longer meet the constraint, so I canot tell it is an admissible solution.

请先登录，再进行评论。

Answer 3

John D'Errico 2024-1-18

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2071531-interpolation-schemes-that-produce-positive-second-derivatives-of-the-interpolant#answer_1392541

编辑：John D'Errico 2024-1-18

在 MATLAB Online 中打开

Are there easy adjustments? Not really. At least not totally trivial. It sounds as if you want an interpolant, NOT an approximant, so you want a curve that truly passes exactly through the points. That may not always be possible of course.

x = [3, 3.8125, 4.6250, 5.4375, 6.25, 7.0625, 7.8750, 8.6875, 9.5, 10.3125, 16];

y = [0, 0.0111, 0.0293, 0.0521, 0.0787, 0.1086, 0.1416, 0.1774, 0.2155, 0.2556, 1.0248];

plot(x,y,'-o')

In this case, an interpolant with a positively curved second serivative should work, though I would need to look very carefully at the data to be sure of that.

How would I do it? I would start with a C1 cubic interpolant, probably one formulated in terms of a set of unknown first derivatives at each break point. Then choose the set of first derivatives such that the second derivative is everywhere positive, and that minimizes the integral of the square of the second derivative over the support of the interpolant. You could write it using a quadratic programming tool to solve the problem, so it would be efficient. That square of the integral would be written as a simple quadratic form.

So, not utterly trivial to write, but not hard either. It that would depend on your skill in writing and formulating things like quadratic programming, and working with piecewise interpolants.

Can it be done? Using my SLM toolbox, I built this curve:

With a second derivative plot as shown below:

As you can see, it is everywhere non-negative as required.

33 个评论
显示 31更早的评论隐藏 31更早的评论

SA-W 2024-1-18

Yes, I want an interpolant.

I have some experience with the optimization toolbox and want to give your scheme a try.

I would start with a C1 cubic interpolant, probably one formulated in terms of a set of unknown first derivatives at each break point. Then choose the set of first derivatives such that the second derivative is everywhere positive, and that minimizes the integral of the square of the second derivative over the support of the interpolant.

A cubic segment can be written as

p(x) = a + b(x-x_i) + c(x-x_i)^2 + d(x-x_i)^3

p'(x) = b + 2c(x-x_i) + 3d(x-x_i)^2

p''(x) = 2c + 6dx

It sounds you would minimize an objective function like

min

i.e., minimizing the second derivatives. But why? f''(x) > 0 is required, but the absolute (positive) value can be as big as it is.

Also, it is not clear to me what the parameters are of the optimization problem. For cubic splines, we have 4 unknowns per segment, but it sounds you only assume the first derivatives to be unknowns.

Maybe I did not fully understand your idea and you can clarify my concerns.

SA-W 2024-1-20

编辑：SA-W 2024-1-20

@John D'Errico

I implemented a slightly different approach than you suggested.

I started with a C1 cubic hermite interpolant formulated in terms of the derivatives at the break points. However, in my objective function, I minimized the jump in the second derivatives of the interpolant over the break points. Say x(2) is breakpoint 2, then I created two points in the vincinity of x(2), x(2) - delta, and x(2) + delta, evaluated the second derivative there, build the difference. I do this for all interior break points and sum the squared values up.

The constraints that the second derivative must be greater than zero are implemented in linear constraints (A and b, in matlab terminology).

I solved the problem using fmincon optimization algorithm to get the derivatives at the break points.

That worked, however, for my application, I also need to derivative of the interpolant w.r.t the y-values. You have answered this question here and mentioned that, in most cases, the interpolant is linear w.r.t y-values because all operations to solve for the spline are linear.

But, in my case, the optimization problem I set up is non-linear, right? At least, fmincon takes several iterations to find the derivatives. Do you see a way to calculate the sensitivities in that case? Or do you think I can treat my problem as a linear optimization problem? ( you mentioned quadprog in your answer)

John D'Errico 2024-1-20

编辑：John D'Errico 2024-1-20

在 MATLAB Online 中打开

@SA-W

i.e., minimizing the second derivatives. But why? f''(x) > 0 is required, but the absolute (positive) value can be as big as it is.

You want to minimize the square of the second derivatives to insure the resulting interpolant is as smooth as possible. Small second deritvatives do that.

But, in my case, the optimization problem I set up is non-linear, right? At least, fmincon takes several iterations to find the derivatives. Do you see a way to calculate the sensitivities in that case? Or do you think I can treat my problem as a linear optimization problem? ( you mentioned quadprog in your answer)

No. The optimization is nonlinear, but only in the sense that it would be a quadratic functional of the unknown parameters, thus the unknown first derivatives at the breaks. Again, you can write the integral of the square of the second derivative as a quadratic form. I said EXACTLY that as you should recall.

And that can be formulated as a QUADRATIC PROGRAMMING problem. Ergo, quadprog. That it took fmincon more than one iteration is because a quadratic is not a linear objective. This means it takes fmincon a few iterations just to figure out the shape of the surface it is wandering around.

However, in my objective function, I minimized the jump in the second derivatives of the interpolant over the break points. Say x(2) is breakpoint 2, then I created two points in the vincinity of x(2), x(2) - delta, and x(2) + delta, evaluated the second derivative there, build the difference. I do this for all interior break points and sum the squared values up.

That will do something approximate. Not at all valid, and surely not the smoothest possible curve that passes through the data, yet satisfies the requirements on the second derivative. Probably not obscenely terrible either. It is not the same thing as the integral of the square of the second derivative, which will produce different results.

Why do I say the approach you took is not at all the best? Suppose you had a second derivative that was perfectly continuous, AND always positive, but also a truly jagged sawtooth? If you did, then your solution would love it! But that sawtooth second derivative would be pure crap in terms of being smooth.

Also, it is not clear to me what the parameters are of the optimization problem. For cubic splines, we have 4 unknowns per segment, but it sounds you only assume the first derivatives to be unknowns.

Suppose you have a cubic spline that passes through two points, (x1,y1), (x2,y2). This means the value of the spline interpolant at the end points of that interval are fully KNOWN. Do you aggree with that? So you can then easily formulate a cubic segment using a variation of a Hermite form, where only the end point first derivatives are unknown. Again, do you follow?

For example, write a cubic segment on the interval [0,h] as I do below:

syms t h
syms y1 y2 d1 d2
P1(t) = 2*t^3/h^3 - 3*t^2/h^2 + 1;
P2(t) = 3*t^2/h^2 - 2*t^3/h^3;
P3(t) = t - 2*t^2/h + t^3/h^2;
P4(t) = t^3/h^2 - t^2/h;

Now build a cubic that passes through the points (0,y1), (h,y2), and has first derivatives as d1 and d2, where d1 and d2 are currently unknown. That is simply:

P(t) = y1*P1 + y2*P2 + d1*P3 + d2*P4

P(t) =

We can verify it has the necessary properties.

P(0)

ans =

P(h)

ans =

dP = diff(P,t);

dP(0)

ans =

dP(h)

ans =

Do you understand that P(t), on the interval [0,h] has some nice properties? It interpolates a function with known function values at each end. And you can set the first derivatives to be anything you wish.

Do the same for each interval between breaks. Do you see that the function will be C1 continuous?

I've given you enough information now to build a C1 curve where only the first derivatives are unknowns. It will be fully C1, but by varying those first derivatives are the breaks, you can easily have a curve that is more r less smooth and well-behaved. The curve you will want is the one with minimum integral of the second derivative squared.

SA-W 2024-1-20

@John D'Errico

Suppose you have a cubic spline that passes through two points, (x1,y1), (x2,y2). This means the value of the spline interpolant at the end points of that interval are fully KNOWN. Do you aggree with that? So you can then easily formulate a cubic segment using a variation of a Hermite form, where only the end point first derivatives are unknown. Again, do you follow?

Yes, makes totally sense. When I wrote this comment, I actually was not aware of hermite form of a cubic polynomial.

That will do something approximate. Not at all valid, and surely not the smoothest possible curve that passes through the data, yet satisfies the requirements on the second derivative. Probably not obscenely terrible either. It is not the same thing as the integral of the square of the second derivative, which will produce different results.

Why do I say the approach you took is not at all the best? Suppose you had a second derivative that was perfectly continuous, AND always positive, but also a truly jagged sawtooth? If you did, then your solution would love it! But that sawtooth second derivative would be pure crap in terms of being smooth.

My goal is to obtain a smooth second derivative. By smooth, I think about a C2 interpolant, that is, a continuous second derivative. But whether the absolute value of the second derivative is of order 1e2 or 1e5, does not matter to me. By minimizing the square of the second derivative, I think it can easily happen that the second derivative has jumps at the breaks, a property I want to avoid if possible.

I do not understand your example with the sawtooth second derivative? Using my approach, a second derivative which is "sawtooth shaped" is not possible. Maybe you can comment on that again.

If I stick to the approach (minimizing the jumps of the second derivative), is it also possible to formulate it as a quadprog problem?

SA-W 2024-1-20

编辑：SA-W 2024-1-20

@John D'Errico

If this is your goal, then by implication you think you want to generate a traditional cubic spline. That is, the almost unique curve that is C2. I say almost unique, because the only free parameters are two parameters, and there are several common ways to choose those parameters. HOWEVER, you cannot find a cubic spline (in general) which will always have non-negative second derivatives. (For some problems it can exist. In fact, I showed that I was able to generate such a curve in my answer for your data.) You simply do not have sufficent degrees of freedom in a piecewise cubic interpolant to create a C2 interpolant that has everywhere non-negative second derivatives, thus fitting any set of data.

I got your point. However, the property of positive second derivatives is crucial for me. If I obsere that a traditional cubic spline interpolant violates that, I have to come up with a different interpolant. Your suggestion (minimizing the square of the second derivatives) and my approach (minimizing the jumps in the second derivatives at the breaks) are ways to better ensure f''>0.

As I also wrote, I need to calculate the derivative of the interpolant w.r.t interpolation values y. Since we solde a non-linear optimization problem to completely define the interpolant, the resulting interpolant is probably not linear w.r.t the interpolation values y. In such a case, how can we compute those sensitivities?

Torsten 2024-1-20

编辑：Torsten 2024-1-20

Unknowns are eps_i and yprime_i.

(2)-(5), I guess, is just a summary of important relations.

(1),(1a),(2)-(5) are the constraints of the optimization problem. They can be formulated in terms of yprime - the main part of the solution vector.

You will have to think about useful boundary conditions of your approximating function (for i=1 and i=n).

Why would you impose boundary conditions for i=1 and i=n? The optimization problem can be solved boundary conditions there.

In the usual spline interpolation, two boundary conditions for i = 1 and i = n are necessary to make the solution of the problem unique. I don't know what further conditions might be necessary to make the solution of your or my formulation of the optimization problem unique. An existing and unique solution is necessary to build the derivatives you requested. If you don't have a function that uniquely maps a vector of y-values to a vector of yprime-values, you cannot build derivatives.

SA-W 2024-1-22

Yes, but in "usual quartile spline interpolation", I have no control over the second derivative. Then, I do not see a benefit compared to "usual cubic spline interpolation".

And what do you mean by "prescribe the second derivative" ? Does it mean "keep continuous" ?

For the cubic hermite form, we made an ansatz of the form

p_i(t) = H_0(t)y_i + H_1(t)y_i+1 + H_2(t)y'_i + H_3(t)y'_i+1.

A cubic polynomial has four degrees of freedom such that we can prescribe the function values (y_i and y_i+1) as well as the first derivative (y'_i and y'_i+1) at the left and right end to have a C1-interpolant.

Similar, for a quintic polynomial, we can make an ansatz with size summands

p_i(t) = H_0(t)y_i + H_1(t)y_i+1 + H_2(t)y'_i + H_3(t)y'_i+1 + H_4(t)y''_i + H_5(t)y''_i+1

which gives two more degrees of freedom to prescribe the second derivatives (y''_i and y''_i+1). Since we can prescribe the second derivatives at the breaks, the interpolant is a C2-function.

I think the whole debate is in vain because proving that a unique solution for an optimization problem exists (even if it's a linear one) will be hopeless.

Probably yes. At least I do not have the mathematical background to come up with a proof of the uniqueness of the problem. However, I know my y-values result from sampling of a convex curve, so I can at least give anything of what we discussed before a try.

Above, you said

If this is not that important and you only want to test the approach, I'd start with quartic splines, demand continuity of 0th, 1st and 2nd derivatives and positivity of the second derivative in the breakpoints and minimize sum p_i''(0) over the breakpoints in the objective function.

that you would implement this approach. However, it is not clear to me how you can demand continuity of the second derivative with a quartic spline. A quartic spline in hermite form has only five degrees of freedom, but we need six (two function values, two first derivatives, two second derivatives). Maybe you can comment again on how you want to realize the continuity of the second derivative with a quartic spline.

Torsten 2024-1-22

编辑：Torsten 2024-1-22

Maybe it's a language problem, but the word "prescribe" is wrong for y',y'',... at the breakpoints in usual spline interpolation. The only thing you prescribe are x and y. All other parameters (y',y'',... at the breakpoints) are computed, namely such that the spline function has continuous first, second, third,... derivatives at the breakpoints. So usually you don't have control over the numerical values of y',y'',... at the breakpoints.

But you want to have some sort of control over y'', namely y'' should be >=0. Since the usual cubic spline only has the property that the second derivative is continuous in the breakpoints, you either have to weaken smoothness properties of the cubic spline or add one or more polynomial degree(s) to gain degrees of freedom for the additional conditions p_i''(0) >= 0 in the breakpoints. That's why I suggested the next higher polynomial degree, namely quartic. In usual spline interpolation, for each additional degree of the local polynomials, you can demand 1 degree higher smoothness in the breakpoints. So while in usual spline interpolation, cubic polynomials lead to C^2 functions, quartic polynomials lead to C^3 functions. But maybe if you reduce the demand for quartic splines to C^2 instead of C^3, you gain degrees of freedom for the additional conditions p_i''(0) >= 0. I don't know - it has to be programmed to see what comes out.

请先登录，再进行评论。

Answer 4

David Goodmanson 2024-1-26

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2071531-interpolation-schemes-that-produce-positive-second-derivatives-of-the-interpolant#answer_1397816

在 MATLAB Online 中打开

Hi SW,

Here is one way to go about it. The data has 10 closely spaced points and one well separated point x(11). Fit the first 10 points with a cubic polynomial. It so happens that the second deriviative is positive in that interval. Then fit the last interval, from x(10) to x(11), with a quadratic that has the right values at the end points and the same first deriviative at x(10) as does the cubic fit (three eqns, three unknowns). That way C1 is good, and by eye (and calculation) the second derivative is going to be positive in that interval.

x = [3, 3.8125, 4.6250, 5.4375, 6.25, 7.0625, 7.8750, 8.6875, 9.5, 10.3125, 16];
xshort = x(1:end-1);
xfin = x(end);
y = [0, 0.0111, 0.0293, 0.0521, 0.0787, 0.1086, 0.1416, 0.1774, 0.2155, 0.2556, 1.0248];
yshort = y(1:end-1);
yfin = y(end);
% fit all but the last point with a cubic
p = polyfit(xshort,yshort,3)
% make sure that y''> 0
d2ydx2 = 6*p(1)*xshort + 2*p(2);
figure(1); grid on
plot(xshort,d2ydx2)             % it does stay positive
xs1 = xshort(1);
xse = xshort(end);
yse = yshort(end);
% find the derivative at xse, and fit the interval from xse to xfin
% by a quadratic that matches the derivative at xse
dydx = sum(p.*[3*xse^2 2*xse 1 0]) 
q = [xse^2 xse 1; xfin^2 xfin 1; 2*xse 1 0]\[yse; yfin; dydx];
q = q'
xfit1 = linspace(xs1,xse,50);
yfit1 = polyval(p,xfit1);
xfit2 = linspace(xse,xfin,50);
yfit2 = polyval(q,xfit2);
xfit = [xfit1 xfit2];
yfit = [yfit1 yfit2];
figure(2); grid on
plot(x,y,'o-',xfit,yfit)

p =  -0.0002    0.0058   -0.0186    0.0074
dydx =  0.0496
q = 0.0151   -0.2611    1.3464

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

SA-W 2024-1-26

Thank you for that new idea.

Yes, it might be a good approach for cases where the last point is "far away" from the others. But, also, there is no real control about f''.

请先登录，再进行评论。

Answer 5

Alex Sha 2024-1-31

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2071531-interpolation-schemes-that-produce-positive-second-derivatives-of-the-interpolant#answer_1400251

移动：Bruno Luong 2024-1-31

在 MATLAB Online 中打开

How about to replace interpolation with a fitting function, which ensure non-negative second derivatives:

1: For data:

x = [3, 3.8125, 4.6250, 5.4375, 6.25, 7.0625, 7.8750, 8.6875, 9.5, 10.3125, 16];
y = [0, 0.0111, 0.0293, 0.0521, 0.0787, 0.1086, 0.1416, 0.1774, 0.2155, 0.2556, 1.0248];

Result:

Sum Squared Error (SSE): 1.81612553970697E-5
Root of Mean Square Error (RMSE): 0.00128492148317141
Correlation Coef. (R): 0.999990141249137
R-Square: 0.999980282595468
Parameter	Best Estimate      
---------	-------------      
p1       	0.260835290168589  
p2       	-17.6776599023055  
p3       	0.00142484062709233
p4       	0.400795422709269 

2: For data:

x = [3, 3.8125, 4.6250, 5.4375, 6.25, 7.0625, 7.8750, 8.6875, 9.5, 10.3125, 16];
y = [0, 0.0198463, 0.0397084, 0.0596019, 0.0795445, 0.0995199, 0.119908, 0.140298, 0.160773, 0.181772, 6.64478];

Result:

Sum Squared Error (SSE): 7.99353965884897E-5
Root of Mean Square Error (RMSE): 0.00269571033965396
Correlation Coef. (R): 0.999999018472319
R-Square: 0.999998036945602
Parameter	Best Estimate      
---------	-------------      
p1       	-0.468107959471592 
p2       	-12.9882511576562  
p3       	2.13196222726725E-6
p4       	0.88833547002951

3: For data:

x = [3, 3.8125, 4.6250, 5.4375, 6.25, 7.0625, 7.8750, 8.6875, 9.5, 10.3125, 16];
y = [0, 0.0198463, 0.0397084, 0.0596019, 0.0795445, 0.0995199, 0.119908, 0.140298, 0.160773, 0.181772, 1.02478];

Result:

Sum Squared Error (SSE): 8.04747420418266E-5
Root of Mean Square Error (RMSE): 0.00270478938924384
Correlation Coef. (R): 0.999953749555241
R-Square: 0.999907501249586
Parameter	Best Estimate     
---------	-------------     
p1       	-0.467984356631285
p2       	-12.993660082635  
p3       	9.9621944291978E-6
p4       	0.736411490505736 

19 个评论
显示 17更早的评论隐藏 17更早的评论

Bruno Luong 2024-2-1

编辑：Bruno Luong 2024-2-1

We already discuss about some aspects of your queston. Let me summarize:

for unconstrained optimization that has uniqueness solution, the solution is differentiable wrt input data (y)
for constrained optimization the solution is NOT differentiable wrt input data (y) if one of the inequality constraint is active
IMO transforming the parameters only change the way you solve numerical the optimiztion problem, it does not change the nature of the dependency of solution wrt input, thus the derivative.
Using "model" (such as exponential formula here, or quadratic spline or subic spline) you will compute the dependency of solutions on the model space. For example you can decide the model is f(x) = cst. The solution is f(x) = mean(yi) = 1/n sum_i y(i) for all x, n is number of data. It satisfies f">=0, poorly fit the data, but you have the derivative df(dyi) = yi/n. If that makes sense for you then OK. This example is in the spirit of what proposed by Alex, just push to extreme to demonstrate the point of model dependency.

Bruno Luong 2024-2-1

编辑：Bruno Luong 2024-2-1

" IMO, it affects the quality of the fit a lot, but it is unconstrained optimization."

It should not affect the fit quality. With transformation you simply reparametrize the feasible space differently. Now in pratice the numerical optimizer can prefer some parametrization or in contrary migh fail to find the solution, but that is another issue.

Alex Sha 2024-2-2

在 MATLAB Online 中打开

The fitting function I provided previously will produce negative values for second derivatives in some cases (for example, second data set). Try the function below, which will ensure all positive values of second derivatives.

Fitting function:

The second derivatives of function:

It is easy to see that the above function will always be greater than zero.

1: For data:

x = [3, 3.8125, 4.6250, 5.4375, 6.25, 7.0625, 7.8750, 8.6875, 9.5, 10.3125, 16];
y = [0, 0.0111, 0.0293, 0.0521, 0.0787, 0.1086, 0.1416, 0.1774, 0.2155, 0.2556, 1.0248];

Result:

Sum Squared Error (SSE): 8.47449963637154E-5
Root of Mean Square Error (RMSE): 0.00277562435832365
Correlation Coef. (R): 0.999949332317241
R-Square: 0.999898667201696
Parameter	Best Estimate    
---------	-------------    
p1       	-74.8659942748385
p2       	-2.31846362397161
p3       	0.056083091099568
p4       	4.31132419470017 

2: For data:

x = [3, 3.8125, 4.6250, 5.4375, 6.25, 7.0625, 7.8750, 8.6875, 9.5, 10.3125, 16];
y = [0, 0.0198463, 0.0397084, 0.0596019, 0.0795445, 0.0995199, 0.119908, 0.140298, 0.160773, 0.181772, 6.64478];

Result:

Sum Squared Error (SSE): 1.46665162719541E-7
Root of Mean Square Error (RMSE): 0.000115469461810764
Correlation Coef. (R): 0.999999998124083
R-Square: 0.999999996248166
Parameter	Best Estimate      
---------	-------------      
p1       	-22.141281236849   
p2       	1.89538261915128   
p3       	0.00134010429376403
p4       	1.38071369179591

For data:

x = [3, 3.8125, 4.6250, 5.4375, 6.25, 7.0625, 7.8750, 8.6875, 9.5, 10.3125, 16];
y = [0, 0.0198463, 0.0397084, 0.0596019, 0.0795445, 0.0995199, 0.119908, 0.140298, 0.160773, 0.181772, 1.02478];

Result:

Sum Squared Error (SSE): 7.55692231606095E-8
Root of Mean Square Error (RMSE): 8.28850371191159E-5
Correlation Coef. (R): 0.999999954352124
R-Square: 0.99999990870425
Parameter	Best Estimate      
---------	-------------      
p1       	-18.4830172399681  
p2       	1.74240209734898   
p3       	0.00155605799596946
p4       	1.05985027372439 

请先登录，再进行评论。

Interpolation schemes that produce positive second derivatives of the interpolant

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

6 个评论
显示 4更早的评论隐藏 4更早的评论

更多回答（4 个）

46 个评论
显示 44更早的评论隐藏 44更早的评论

33 个评论
显示 31更早的评论隐藏 31更早的评论

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

19 个评论
显示 17更早的评论隐藏 17更早的评论

另请参阅

类别

标签

Community Treasure Hunt

Interpolation schemes that produce positive second derivatives of the interpolant

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

6 个评论 显示 4更早的评论隐藏 4更早的评论

更多回答（4 个）

46 个评论 显示 44更早的评论隐藏 44更早的评论

33 个评论 显示 31更早的评论隐藏 31更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

6 个评论
显示 4更早的评论隐藏 4更早的评论

46 个评论
显示 44更早的评论隐藏 44更早的评论

33 个评论
显示 31更早的评论隐藏 31更早的评论