<< BitShift, Pow2 or standard multiplication? Code Optimization

Question

covfefe 2024-10-31

1
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2163300-bitshift-pow2-or-standard-multiplication-code-optimization

编辑：埃博拉酱 2024-11-5

I was looking to optimize a code base, only to be surprised by the results of comparing these different Matlab methods.

While I was expecting bitshift or pow2 to be order of magnitudes faster, the standard multiplication was actually 100x faster.

Am I missing something here? How can a standard multiplication be any faster than dedicated functions?

n = 1e6; x = floor(rand(1,n)*2^8); tic, for k = 1:1:n, x1(k) = bitshift(x(k),16); end; toc, tic, for k = 1:1:n, x2(k) = pow2(x(k),16); end; toc, tic, for k = 1:1:n, x3(k) = x(k)*2^16; end; toc

Results:

bitshift: Elapsed time is 0.156905 seconds.
pow2:     Elapsed time is 0.146554 seconds.
*:        Elapsed time is 0.001691 seconds.

Vectorized code has similar execution hierarchy:

n = 1e8; x = floor(rand(1,n)*2^8); tic, x1 = bitshift(x,16); toc, tic, x2 = pow2(x,16); toc, tic, x3 = x*2^16; toc
Elapsed time is 0.279544 seconds.
Elapsed time is 0.070829 seconds.
Elapsed time is 0.029266 seconds.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

埃博拉酱 2024-11-1

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2163300-bitshift-pow2-or-standard-multiplication-code-optimization#answer_1539765

编辑：埃博拉酱 2024-11-1

在 MATLAB Online 中打开

If the algorithm of these functions executes literally, then bitshift is actually adding to the exponential segment of the double, and pow2 is the same as the operator. While bitshifts should theoretically have lower power consumption, the difference in the time they consume is very small and far less than the overhead of the function call itself.

Yes, the main amount of time you observe that bitshift and pow2 spend is on function calls. MATLAB's function calls are very expensive because they don't optimize simple functions inline like C++ does.

If you use the vectorization method to calculate:

n = 1e6;
x = floor(rand(1,n)*2^8);
tic;
x1=bitshift(x,16);
toc
tic;
x2=pow2(x,16);
toc
x=uint32(x);
tic;
x3= x*2^16;
toc

You will observe that, as I said, pow2 and operators consume almost the same amount of time. However, bitshift still consumes the most time, which may be due to the fact that MATLAB adds more logical judgment steps to bitshift.

An interesting way to compare this is to use the doc command to look at the document length of bitshift and pow2, and you will find that the bitshift document is longer. Although not absolutely, I believe that in general, the longer the document, the more likely the logic of a function to be more complex. This means that bitshift has to deal with more different and more complex scenarios than pow2, and these judgment operations take more time.

MATLAB's expensive function call overhead feature tells us to try to avoid frequent calls to functions in loops and make the most of the vectorization support features of the functions themselves.

2 个评论
显示无隐藏无

covfefe 2024-11-4

编辑：covfefe 2024-11-4

在 MATLAB Online 中打开

While function calls do slow things down, even with creating a dedicated multiplication AND power function, I still can't manage to have an execution time as slow as bitshift or pow2:

function x3 = pow(x1,x2)
x3 = x1^x2; 
end
function x3 = mult(x1,x2)
x3 = x1*x2;
end

n = 1e6; x = floor(rand(1,n)*2^8); tic, for k = 1:1:n, x1(k) = bitshift(x(k),16); end; toc, tic, for k = 1:1:n, x2(k) = pow2(x(k),16); end; toc, tic, for k = 1:1:n, x3(k) = x(k)*2^16; end; toc, tic, for k = 1:1:n, x4(k) = mult(x(k),2^16); end; toc; tic, for k = 1:1:n, x5(k) = mult(x(k),pow(2,16)); end; toc;

Results:

bitshift:   Elapsed time is 0.156905 seconds.
pow2:       Elapsed time is 0.146554 seconds.
*:          Elapsed time is 0.001691 seconds.
multi:      Elapsed time is 0.015167 seconds.
mult + pow: Elapsed time is 0.107384 seconds.

埃博拉酱 2024-11-5

编辑：埃博拉酱 2024-11-5

There may also be some overhead of logic checks, such as checking if the input is floating-point. But I agree with you that these built-in functions shouldn't be so much slower than a handwritten implementation. If you edit the code files for these built-in functions, you'll see that they don't seem to have been updated in more than 10 years. Perhaps their internal implementations are based on some outdated technical concepts that have not been optimized by the new technology. Perhaps the checking of input parameters is a bit excessive and inefficient. Perhaps they have undergone performance tests that include only efficiency on large array inputs, not on frequent calls to small data.

According to my experience, MATLAB does have a tendency to optimize vectorized computing of big data while infinitely sacrificing the performance of frequent calls to small data. Even if programmers know that there may be some room for optimization for small calls, they may not have the motivation to do these optimizations due to the lack of performance testing for small calls.

请先登录，再进行评论。

<< BitShift, Pow2 or standard multiplication? Code Optimization

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

2 个评论
显示无隐藏无

另请参阅

类别

标签

产品

Community Treasure Hunt

<< BitShift, Pow2 or standard multiplication? Code Optimization

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

2 个评论 显示 无隐藏 无

另请参阅

类别

标签

产品

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

2 个评论
显示无隐藏无