hashirama/uvg266

mirror of https://github.com/ultravideo/uvg266.git synced 2024-11-25 10:54:05 +00:00

Author	SHA1	Message	Date
Ari Lemmetti	8b57b2bb1a	Refactor SATD to inline most of the function. Replace full horizontal add with shuffle and regular packed add.	2015-10-01 21:29:25 +03:00
Ari Lemmetti	55da2a9958	Add intrinsic version of SATD for 8x8 and larger blocks	2015-10-01 19:42:22 +03:00
Arttu Ylä-Outinen	3a10e9e3e0	Prefix all non-static symbols with "kvz_".	2015-08-26 13:02:28 +03:00
Ari Lemmetti	5d96dbc6c0	Make strategy selection use bit depth given via parameter instead of excluding registration with defines	2015-08-12 13:33:38 +03:00
Ari Lemmetti	4122f36089	Prevent the registration of strategies that are incompatible when KVZ_BIT_DEPTH != 8 Remove unnecessary or misleading mentions of "8bit"	2015-08-12 11:29:53 +03:00
Ari Koivula	0c3c93d456	Optimize intra SAD intrinsics. - Added 64x64 version for completeness. - With the exception of 16x16, these were all slightly slower than the ASM versions, as measured by "kvazaar_test -s speed -t intra_sad", but now they are on par or slightly faster. - None of these actually use any AVX2 intrinsics, and probably never will, unless someone adds an interface for doing more than one block at a time, in which case the non-destructive versions might come in handy.	2015-08-06 19:35:00 +03:00
Arttu Ylä-Outinen	f7f17a060c	Rename pixel_t to kvz_pixel.	2015-07-02 16:58:28 +03:00
Ari Koivula	ded6fd9ee8	Renamed typedef pixel to pixel_t.	2015-03-04 16:35:53 +02:00
Ari Koivula	d7383ccb25	Change license to LGPL. - Everyone who has contributed code to the project has been asked to license their contributions under LPGL and they have agreed. - COPYING file changed to say LGPLv2.1 instead of GPLv2. - GPL changed to LGPL in the header of every single file that a header and header added to the few that were missing one. - Also.. Happy new year!	2015-02-25 15:19:05 +02:00
Ari Koivula	669e99dd7f	Improve intra SAD AVX2 intrinsics. - Moved implementations for different sizes to inline functions that are defined using each other, reducing the amount of redundant code. - Performance of sad_8bit_32x32_avx2 improved by about 10% due to unrolling of the loop.	2014-07-25 15:59:55 +03:00
Ari Koivula	a8f7103797	Add AVX2 implementations for sad_8bit_ 8x8, 16x16 and 32x32.	2014-07-18 18:27:30 +03:00

11 commits