Troubles with data types: integers, doubles, scientific notation, and type casting

69 次查看(过去 30 天)
I have a vector of integer elements, and most/all elements are in the thousands. When I look at the vector, some numbers are display in the normal format (eg, 5037) but some are displayed in scienfitic notation (eg 4.2890e+03). When I call the isinteger() function on my variable, it returns false. I think it is probably due to the way that some elements are stored in scientific notation, but I havent found any way, looking online, to force the variable to save all elements in the normal format. If I call isinteger(round()) on my variable, it still returns false. I also cannot typecast my vector to be eg. int16 (which I will get to in a minute).
The reason I need integer values is that some of these values will be used to index an array. I am using the "S = sparse(i,j,v,m,n,nz)" function, and the values m,n,nz all come from my variable. If I try to run it as is, where all of the values are actually integers but Matlab does not recognize them as such, I get the error message
Every one of the elements should be integers, but it throws me an error regardless. Just so you can see what my variables are,
n = numel(x);
idx = 1:n;
x is my integer-valued double vector, and k is a particular element from x.
The documentation for sparse indicates that it shouldn't be problematic, as it says the first two terms (i,j aka idx, x) can be "Data Types: single | double | int8 | int16 | int32 | int64 | uint8 | uint16 | uint32 | uint64 | logical", the third term (v aka 1) can be "Data Types: double | logical", the fourth and fifth terms (m,n aka n,k) can be "Data Types: double" and the last term (nz aka n) can be "Data Types: double"
Based on the documentation, I don't understand why it won't accept my values -- it claims the type "double" is supported for every single input term.
If I try casting my vector, x, to be something else, like an int16, I get the error
even though the documentation claims that it accepts "single | double | int8 | int16 | int32 | int64 | uint8 | uint16 | uint32 | uint64 | logical"
How can I resolve this? Is there any way to force all of the elements in my vector to be saved in the normal format and not scientific notation? Why do I get an error about input types when the documentation claims that data type is supported?
  3 个评论
Hannah D
Hannah D 2020-4-30
Perhaps I should have been clearer; I suspect that the values displayed in scientific notation are the ones with more floating-point error that makes Matlab think they're not actually integer valued.
And I did your more thorough check for integer values, and even though the documentation claims to be able to support double type inputs for all inputs, I get errors where its asking for integers.
Yes, I can see that one value in one of the input arrays has enough floating point error that it thinks its not an integer, but I don't know why Matlab has a problem with this because 1) the documentation claims to be able to support double type, 2) it still doesn't work if i call round() on the value before feeding it in, and 3) this code has worked before, many times over, when my arrays had values in the 100s, but now the values are in the 1000s and only now do I begin to get errors. I did not change anything in the function except the order of magnitude of the inputs. It had all the same data types, steps, functions, size of arrays, everything.
James Tursa
James Tursa 2020-4-30
I am asking again, please show us a complete small example that demonstrates the problem, not just code snippets. And not pictures of code ... post your code as text and format it with the CODE button.

请先登录,再进行评论。

采纳的回答

Guillaume
Guillaume 2020-4-30
编辑:Guillaume 2020-4-30
left = 100*round(left,2);
In theory, this should indeed give you integers. Unfortunately, with floating points (double), this is not always going to work and you will get non-integers values. See for example:
>> x = 1.127;
>> y = round(x, 2) %appears to be 1.13. It is not!
y =
1.13
>> z = 100*y %appears to be 113. It is not!
z =
113
>> z - 113 %it is off by a tiny amount
ans =
-1.4210854715202e-14
Even though, mathematically, the below is the same, it is much safer:
left = round(left * 100); %do the multiplication before rounding, then round to nearest integer
The reason is that there are many numbers with just two decimals, such as 1.13, that cannot be stored as double. So instead of storing 1.13 matlab stores the nearest possible number (about 1.12999999999999989342...) which of course when multiplied by 100 is not exactly 113 but 112.99999999999998579... (note the change in the latter digits).
Note that this is not dependent on the version of matlab, and actually applies to any code that uses double representation, not just matlab.

更多回答(3 个)

James Tursa
James Tursa 2020-4-29
编辑:James Tursa 2020-4-29
You are confusing integer "types" with integer "values". Integer types are int8, uint8, ... int16, uint64. Integer values a 1, 2, 3, etc. You can store integer "values" in floating point types (single or double) or in integer types (int8, uint16, etc.).
You are also confusing the stored value with the displayed value. A stored integer value can display as an integer format or can display as a floating point format. But how it is displayed on your screen does not affect the underlying storage value. The display format depends on the value and your display settings.
For your particular problem, you need to provide us with a specific example of where you think you are feeding the sparse( ) function proper inputs (show them to us) and then copy & paste the entire error message for us to see. If sparse( ) is complaining that your index arrays are not integer values, then they aren't. And the data values for a sparse matrix need to be either double or logical ... no other sparse matrix data types are supported.
If all of your inputs are type double, and your indexing arrays have only positive integer values, then things should work for you.
  3 个评论
James Tursa
James Tursa 2020-4-30
"... I did provide both the inputs ..."
No. There is a difference between describing your inputs and actually providing your inputs. You need to show us complete code that creates the inputs and calls sparse( ) so we can run it and diagnose things on our own. It may very well be that the documentation is incomplete or incorrect as you say, but we would like to see the code that demonstrates it.
Hannah D
Hannah D 2020-4-30
left = (4+4*rand(2578,1075)).*randn(2578,1075) + (rand(2578,1075)-0.5);
left = 100*round(left,2); % bin and scale
left = left - min(min(min(min(left))))+1; % shift to be integer values
x = left(1,:);
y = left(2,:);
n = numel(x);
x = reshape(x,1,n);
y = reshape(y,1,n);
l = min(min(x),min(y));
x = x-l+1;
y = y-l+1;
k = max(max(x),max(y));
idx = 1:n;
Mx = sparse(idx,x,1,n,k,n);
I have gone and manually checked the values of the variables to confirm that they are integer valued.
What I really don't understand is why the code is giving me errors now because it has worked in the past. I have used it where I had line 2 instead say
left = 10*round(left,1);
and the code ran perfectly, without issue. That was the literal only change, and now I'm getting errors.

请先登录,再进行评论。


Steven Lord
Steven Lord 2020-4-30
Which release of MATLAB are you using? The ability to specify the subscripts (the first two inputs) in the sparse function as arrays of an integer type was added in release R2020a.
From the error message you're receiving, you have at least one element in x that is not an integer value. Find the elements that are not an integer value with:
find(x ~= round(x))
  4 个评论
Steven Lord
Steven Lord 2020-4-30
I agree with Stephen Cobeldick. Either upload the exact data you pass into the sparse function or give us a complete code that we can run with which you can reproduce the behavior. You're going to want to set the random number generator to a specific state with rng before calling rand and randn so we can reproduce exactly what you ran.
Guillaume
Guillaume 2020-4-30
Well, it's fairly easy to reproduce the problem with for example just:
left = [1.127 2; 0 0]
I've explained the problem in my answer.

请先登录,再进行评论。


Hannah D
Hannah D 2020-4-30
In the end I got it to work by calling round() basically on everything at multiple different points (which I am still not certain why it wouldn't work if I just called round on the values once?) before passing it into the function
  3 个评论
Hannah D
Hannah D 2020-4-30
@Guillaume I meant, I have tried calling round() on the variable after its creation, not just the round(left,2) in the declaration, and that still didn't fix the issue. I had to call round() on multiple things in order to get it to work
James Tursa
James Tursa 2020-5-1
@Hannah: I do not mean to be harsh, but this sounds like you don't really understand your problem and you have put a bandaid on it. If you understood the problem and what the floating point calculations are doing, you would be able to create better code as Guillaume has suggested and avoid sprinkling round( ) all over your code until things seem to work ... this time. It would be better if you took the time to understand why you are getting the problems at each step and then write better code to avoid those problems in the first place.

请先登录,再进行评论。

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by