Ari Lemmetti
|
b78460b02c
|
Optimize another loop
|
2015-12-11 11:21:43 +02:00 |
|
Ari Lemmetti
|
d71f1b5bd0
|
Disable incompatible optimizations for 32-bit version
|
2015-10-24 15:32:27 +03:00 |
|
Ari Lemmetti
|
df995d85e8
|
Utilize AVX2 for dequantization.
|
2015-10-23 20:17:08 +03:00 |
|
Ari Lemmetti
|
cf347e33c4
|
Move dequant to strategies. Copy generic to AVX2 as well.
|
2015-10-23 19:53:50 +03:00 |
|
Ari Lemmetti
|
47082738aa
|
...and the same tricks for quantized reconstruction
|
2015-10-23 19:44:38 +03:00 |
|
Ari Lemmetti
|
7961ba80d8
|
Add functions for bigger block sizes to calculate more residual simultaneously and reduce memory accesses
|
2015-10-23 19:11:56 +03:00 |
|
Ari Lemmetti
|
15edd5060d
|
Load and store multiple elements simultaneously. Use 128-bit wide zero
test. *wip*
|
2015-10-23 17:03:16 +03:00 |
|
Ari Lemmetti
|
b37cca87c8
|
Copy generic to avx2
|
2015-10-23 17:03:15 +03:00 |
|
Ari Lemmetti
|
38106afa50
|
Add AVX2 version of quantization.
|
2015-10-02 16:18:52 +03:00 |
|