hashirama/uvg266

mirror of https://github.com/ultravideo/uvg266.git synced 2024-12-03 21:44:06 +00:00

Author	SHA1	Message	Date
Ari Koivula	0c3c93d456	Optimize intra SAD intrinsics. - Added 64x64 version for completeness. - With the exception of 16x16, these were all slightly slower than the ASM versions, as measured by "kvazaar_test -s speed -t intra_sad", but now they are on par or slightly faster. - None of these actually use any AVX2 intrinsics, and probably never will, unless someone adds an interface for doing more than one block at a time, in which case the non-destructive versions might come in handy.	2015-08-06 19:35:00 +03:00
Arttu Ylä-Outinen	f7f17a060c	Rename pixel_t to kvz_pixel.	2015-07-02 16:58:28 +03:00
Arttu Ylä-Outinen	fab07d80da	Rename macro BIT_DEPTH to KVZ_BIT_DEPTH.	2015-07-02 16:55:47 +03:00
Marko Viitanen	8ed5d06ebe	Fixed compiler warnings caused by the bipred branch merge	2015-04-23 15:12:48 +03:00
Ari Lemmetti	b9ec4b0a54	AVX2 acceleration for new luma filtering.	2015-03-11 15:33:38 +02:00
Ari Koivula	ded6fd9ee8	Renamed typedef pixel to pixel_t.	2015-03-04 16:35:53 +02:00
Ari Koivula	f6147b410a	Rename struct encoder_control to encoder_control_t. Conflicts: src/encoder_state-geometry.h src/encoderstate.h	2015-03-04 14:01:14 +02:00
Ari Koivula	d7383ccb25	Change license to LGPL. - Everyone who has contributed code to the project has been asked to license their contributions under LPGL and they have agreed. - COPYING file changed to say LGPLv2.1 instead of GPLv2. - GPL changed to LGPL in the header of every single file that a header and header added to the few that were missing one. - Also.. Happy new year!	2015-02-25 15:19:05 +02:00
Ari Lemmetti	7430622038	Copy ipol-generic strategy as a base for avx2 strategy	2015-02-05 13:28:07 +02:00
Ari Lemmetti	0e56d13b5d	Use smaller bit depth for fractional pixel interpolation	2015-01-15 15:00:09 +02:00
Ari Lemmetti	cc061b4c3d	Added ipol strategy for interpolation filters. Added initial files for AVX2 and generic strategies.	2015-01-15 14:59:37 +02:00
Ari Koivula	d893a489d6	Fix mingw compilation issue. strategies/avx2/dct-avx2.c:334:25: error: pasting "g_dct_16" and "[" does not give a valid preprocessing token - The [ is not part of the token so compilation failed on mingw GCC 4.9.1. - Fixes #86.	2014-10-10 16:32:39 +03:00
Ari Lemmetti	bcf12567d0	Added some comments.	2014-10-03 17:51:58 +03:00
Ari Lemmetti	fea517c2ae	Misc code cleanup	2014-10-03 17:06:09 +03:00
Ari Lemmetti	85682c3b6a	Removed unused transpose functions.	2014-10-03 11:39:31 +03:00
Ari Koivula	f6272f06fc	Unify signature for transform functions. - Some used block, coeff and some src, dst. Now all signatures are const input and non-const output.	2014-10-03 11:21:43 +03:00
Ari Koivula	b932cf4b21	Clean up avx2 dct macros.	2014-10-03 11:16:25 +03:00
Ari Koivula	47244a15c3	Merge branch 'dct-optimizations' Conflicts: src/strategies/avx2/dct-avx2.c src/strategies/generic/dct-generic.c	2014-10-02 13:45:21 +03:00
Ari Lemmetti	61e1510480	Transform functions in dct-avx2.c are now generated with macros.	2014-10-02 13:24:30 +03:00
Ari Lemmetti	9407610555	Moved DCT / DST matrices to dct-generic.c	2014-10-02 13:24:30 +03:00
Ari Lemmetti	7255112bd8	Added transposed DCT/DST tables. Use them while calculating transforms instead of doing runtime transpose. Added separate functions for DST and IDST.	2014-10-02 13:24:30 +03:00
Ari Lemmetti	e7bcb58846	Added 32x32 IDCT	2014-10-02 13:24:30 +03:00
Ari Lemmetti	eacf173b7e	Added 32x32 DCT for AVX2	2014-10-02 13:24:30 +03:00
Ari Lemmetti	d2856a5d40	Added 32x32 transpose	2014-10-02 13:24:30 +03:00
Ari Lemmetti	7a33f08312	Added 16x16 DCT and IDCT for AVX2	2014-10-02 13:24:30 +03:00
Ari Lemmetti	d2fe2a5391	Added 16x16 transpose	2014-10-02 13:24:30 +03:00
Ari Lemmetti	d6af146a2e	Added part of the functions 16x16 DCT needs	2014-10-02 13:24:30 +03:00
Ari Lemmetti	aba3acdfff	Added AVX2 optimized transforms for 4x4 and 8x8 blocks	2014-10-02 13:24:30 +03:00
Ari Lemmetti	41b032664d	First version of 4x4 forward DCT	2014-10-02 13:24:29 +03:00
Laurent Fasnacht	f1b303a2d2	Fix compilation errors	2014-08-11 09:53:06 +02:00
Ari Lemmetti	0beb278f5b	Partial butterfly strategy is now called DCT strategy. Made changes to transform functions in preparation for optimizations. -Moved fast_forward_dst and fast_inverse_dst to DCT strategies	2014-07-31 13:25:28 +03:00
Ari Lemmetti	6bf63bd171	Added AVX2 strategy for partial butterfly (no optimizations yet)	2014-07-31 13:25:28 +03:00
Ari Koivula	669e99dd7f	Improve intra SAD AVX2 intrinsics. - Moved implementations for different sizes to inline functions that are defined using each other, reducing the amount of redundant code. - Performance of sad_8bit_32x32_avx2 improved by about 10% due to unrolling of the loop.	2014-07-25 15:59:55 +03:00
Ari Koivula	a8f7103797	Add AVX2 implementations for sad_8bit_ 8x8, 16x16 and 32x32.	2014-07-18 18:27:30 +03:00

... 4 5 6 7 8

384 commits