hashirama/uvg266

mirror of https://github.com/ultravideo/uvg266.git synced 2024-11-25 10:54:05 +00:00

Author	SHA1	Message	Date
Ari Koivula	0c3c93d456	Optimize intra SAD intrinsics. - Added 64x64 version for completeness. - With the exception of 16x16, these were all slightly slower than the ASM versions, as measured by "kvazaar_test -s speed -t intra_sad", but now they are on par or slightly faster. - None of these actually use any AVX2 intrinsics, and probably never will, unless someone adds an interface for doing more than one block at a time, in which case the non-destructive versions might come in handy.	2015-08-06 19:35:00 +03:00
Arttu Ylä-Outinen	f7f17a060c	Rename pixel_t to kvz_pixel.	2015-07-02 16:58:28 +03:00
Ari Koivula	ded6fd9ee8	Renamed typedef pixel to pixel_t.	2015-03-04 16:35:53 +02:00
Ari Koivula	d7383ccb25	Change license to LGPL. - Everyone who has contributed code to the project has been asked to license their contributions under LPGL and they have agreed. - COPYING file changed to say LGPLv2.1 instead of GPLv2. - GPL changed to LGPL in the header of every single file that a header and header added to the few that were missing one. - Also.. Happy new year!	2015-02-25 15:19:05 +02:00
Ari Koivula	669e99dd7f	Improve intra SAD AVX2 intrinsics. - Moved implementations for different sizes to inline functions that are defined using each other, reducing the amount of redundant code. - Performance of sad_8bit_32x32_avx2 improved by about 10% due to unrolling of the loop.	2014-07-25 15:59:55 +03:00
Ari Koivula	a8f7103797	Add AVX2 implementations for sad_8bit_ 8x8, 16x16 and 32x32.	2014-07-18 18:27:30 +03:00

6 commits