datasample/bootstrap procedure

3 次查看(过去 30 天)
Basically I need to ensure for all rows in the variable t1, that they are resampled in the same way.
That means that if the first draw is observation 2, then for all t1, I need observation 2 to be the first draw. This continues, such that I will have b resamples that resamples so that the samples are reordered, but for all t1 they has to be re-ordered the same way. By using rng I achieve this. I also need the variable "carhartt" to follow the same procedure.
However, I am questioning whether there is a better method for doing this? It is very important that i resample all t1 in the same way.
It is also important that each bootstrap/datasample is random.
Do any of you experts have a better solution, or do you approve of the one I use here?
I am trying to bootstrap/resample the best appropriate way:
y=+aDanmark()-bDanmark(:,5);
t1=y(1:t,:);
nans = any(isnan(t1),1);
t1(:,nans)=[];
for i =1:size(t1,2)
resultst1=ols(t1(1:12,i),carhart(1:12,:));
estimatest1(:,i)=resultst1.beta(1);
residuals=resultst1.resid;
carhartt=carhart(1:12,:);
b=99;
for B=1:b
rng(B);
bootresiduals=datasample(residuals,size(residuals,1));
rng(B);
bootcarhart=datasample(carhartt,size(residuals,1));
B
end
end
  3 个评论
Anton Sørensen
Anton Sørensen 2020-3-14
编辑:Anton Sørensen 2020-3-14
Hi, I am sorry for that, didn't think it was important in relation to creating the resample.
I will explain each variable. T1 represents mutual funds performance in the first 12 months of the original sample, and I will use the residuals for each mutual fund in the resample.
Carhartt is the benchmark data for the first 12 month of the sample.
I will illustrate what I need in the following:
If the first resample for mutual fund 1' residuals are drawn from the following observations: Residual(1),Residual(3), Residual(10), Residual(8), Residual(3), Residual(8), Residual (10), Residual (10), Residual (12), Residual(1), Residual(1) and Residual (12) Then I basically need that the first draw for both all mutual funds and the Carhartt benchmark data are drawn in the same way, such that the orders are equal. Meaning, that fund 10 for instance draw the same residuals, but from its own residual distribution (same goes for carhart benchmarks)
Does it make more sense now?
Thanks.
Anton Sørensen
Anton Sørensen 2020-3-14
And important to mention, rng(B) seem to do this, however the resample procedure will not be totally random, since it will dependent on seed B=1...b.
So I wondered if this is an approved solution, or if there is a better way?

请先登录,再进行评论。

采纳的回答

Guillaume
Guillaume 2020-3-14
In the following, I'm assuming that residuals and carhartt are both matrices. It looks like they are from your code. If not the code should be slightly different but the principle still stands.
However, I find it a bit strange that you're drawing K rows with replacement from a matrix with K rows. I would have expected the 2 K to be different.
Anyway, instead of drawing the samples directly, draw their row indices. You can then use the indices for both matrices:
samplerows = datasample(1:size(residuals, 1), size(residuals, 1));
%or without any need of the stat toolbox:
%sampleindices = randi(size(residuals, 1), size(residuals, 1)); %1st size is the size of the input, 2nd size is the number of samples
bootresiduals = residuals(samplerows, :);
bootcarhart = carhartt(samplerows, :);
  3 个评论
Guillaume
Guillaume 2020-3-14
I know nothing about finance computation, so have no idea what "OLS estimates on my resampled residuals" actually mean and don't have any inkling as to what you're asking.
Anton Sørensen
Anton Sørensen 2020-3-14
No problem.
Thanks alot for your help.
I might have a future problem on this datasampling procedure, since some of my funds doesn't have data for the entire sample periods and therefore I am dealing with NaN.

请先登录,再进行评论。

更多回答(0 个)

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by