I'm guessing you have a Quadro K120 GPU driving your graphics attached to your machine, or elsewhere on the PCI bus, is that possible?
The K80 is seen by the driver as two separate GPUs. Both have lower performance than a K40 on its own.
To use many GPUs at once you need to use a parallel pool, e.g.
parpool('local', gpuDeviceCount);
You can set each worker to use a different GPU (that is the default anyway). Then use a parallel construct such as parfor, parfeval, batch or spmd to do work on each worker. Keep communication between workers to a minimum and you should be able to get the performance improvements you are looking for.
