Commit graph

4626 commits

Author SHA1 Message Date
Joose Sainio 2caf077cff Remove avx512 instrincis 2023-09-26 09:21:23 +03:00
Joose Sainio 254826d396 [avx2] Add comments 2023-09-26 09:21:19 +03:00
Joose Sainio f2fb641acb [avx2] Replace inefficient loop with AVX2 code 2023-09-26 09:21:19 +03:00
Joose Sainio bc24601369 [avx2] Improve avx2 version of update_common_context 2023-09-26 09:21:19 +03:00
Joose Sainio 915104cf10 [dep_quant] Change order of absLevels 2023-09-26 09:21:18 +03:00
Joose Sainio d850c346d6 [dep_quant] Change order of ctxInit 2023-09-26 09:21:18 +03:00
Joose Sainio a624988c91 [dep_quant] Separate abs levels and ctx init 2023-09-26 09:21:18 +03:00
Joose Sainio dda972c665 [avx2] Try to do lnz decision with avx2 2023-09-26 09:21:18 +03:00
Joose Sainio cf6f03b73b [avx2] This has worked but I'm pretty sure these should be unaligned 2023-09-26 09:20:56 +03:00
Joose Sainio b4c84e820c [avx2] Simplify 2023-09-26 09:20:56 +03:00
Joose Sainio 2811ce58f4 [avx2] AVX2 version of depquant now exactly matches scalar version 2023-09-26 09:20:56 +03:00
Joose Sainio 48ea4bff4d [dep_quant] Fix rate_estimator and quant_block init cases 2023-09-26 09:20:55 +03:00
Joose Sainio dfff9a8030 [avx2] Move dep quant stuff to strategies 2023-09-26 09:20:55 +03:00
Joose Sainio 0591342b3a [avx2] replace or 2023-09-26 09:20:38 +03:00
Joose Sainio 8b1d6fab59 [avx2] Replace loads and stores with non-avx512 stores 2023-09-26 09:20:37 +03:00
Joose Sainio 6d0a3fa5fc [avx2] Replace _mm_and_epi32 with _mm_and_si128 2023-09-26 09:20:37 +03:00
Joose Sainio 7fdc045690 [dep_quant] Clean up 2023-09-26 09:20:37 +03:00
Joose Sainio 8eb0f66734 [depquant] update_state_eos_avx2 working 2023-09-26 09:20:37 +03:00
Joose Sainio 00cc58bc55 [depquant] Only initialize rate_estimator when necessary 2023-09-26 09:20:37 +03:00
Joose Sainio 00f838306f [depquant] Initialize quant_block only when necessary 2023-09-26 09:20:37 +03:00
Joose Sainio 9e27b4056a [avx2] WIP update_state_eos_avx2 2023-09-26 09:20:36 +03:00
Joose Sainio c56350b8d6 [avx2] and last 2023-09-26 09:20:36 +03:00
Joose Sainio 9f69713c24 [depquant] remove an unnecessary memcpy 2023-09-26 09:20:36 +03:00
Joose Sainio aa48943c22 [avx2] Do decision cost comparison with avx2 2023-09-26 09:20:36 +03:00
Joose Sainio cd6110cfac [depquant] Pre calculate things
sig_ctx_offset gtx_ctx_offset cg_pos pos_y  pos_x next_sbb_right next_sbb_below
2023-09-26 09:20:30 +03:00
Joose Sainio 8f4c3cecbf [avx2] update_states_avx2 working 2023-09-26 09:20:29 +03:00
Joose Sainio 58a66c0654 [avx2] WIP update_states_avx2 2023-09-26 09:20:29 +03:00
Joose Sainio 04be92a8ec [avx2] simplify 2023-09-26 09:20:29 +03:00
Joose Sainio 8b19c468cf [avx2] check_rd_costs_avx2 done 2023-09-26 09:20:29 +03:00
Joose Sainio c6e6f5da33 [avx2] WIP check_rd_costs_avx2, almost? 2023-09-26 09:20:29 +03:00
Joose Sainio 8caabcde1a [avx2] WIP check_rd_costs_avx2 2023-09-26 09:20:28 +03:00
Joose Sainio 2912db5fca [dep_quant.c] Small refactor 2023-09-26 09:20:28 +03:00
Joose Sainio 64d34f8559 [depquant] AoS -> SoA pre quant 2023-09-26 09:20:28 +03:00
Joose Sainio 2f1e9c4020 [depquant] AoS -> SoA all states 2023-09-26 09:20:28 +03:00
Joose Sainio 73442f1bba [depquant] AoS -> SoA for Decision 2023-09-26 09:20:28 +03:00
Marko Viitanen 26ef1dda09 [ibc] Fix chroma SAD handling and disable chroma SAD for now 2023-08-30 15:06:08 +03:00
Marko Viitanen 0239572796 [ibc] Fix some instances where CU_INTER was checked instead of !CU_INTRA 2023-08-23 15:21:45 +03:00
Marko Viitanen 312ac6731c [ibc] dual-tree rebase fixes 2023-08-15 13:24:22 +03:00
Joose Sainio 805afb1331 [fix] Minor fixes 2023-08-15 13:11:50 +03:00
Joose Sainio 8c14fa94ba [mtt] Fix small issues with luma and chroma searches 2023-08-15 13:11:49 +03:00
Joose Sainio 7a5245c5a4 [dual-tree] Fix chroma tree split model context derivation during search 2023-08-15 13:11:31 +03:00
Joose Sainio 707e11dbcf [dual-tree] Small fixes 2023-08-15 13:11:30 +03:00
Joose Sainio 91591c7e7c [dual-tree] Remove the limitation of not allowing 2 height chroma blocks in dual tree 2023-08-15 13:11:29 +03:00
Joose Sainio 146e1cb85e [dual-tree] WIP simplification 2023-08-15 13:11:28 +03:00
Joose Sainio 0f50caa2d0 [mtt] Fix various small issues and DepQuant for non-square blocks 2023-08-15 13:11:27 +03:00
Joose Sainio d222718c22 [mtt] Minor fixes 2023-08-15 13:11:26 +03:00
Joose Sainio d69bdf79f4 [mtt] Fix couple of issues with 64x32 CUs and non square tr skip rdoq 2023-08-15 13:10:13 +03:00
Joose Sainio 7d787c6b22 [ISP] Fix ISP cost calculation and DepQuant with mts 2023-08-15 13:10:13 +03:00
Joose Sainio 6e24b9a7f9 [DepQuant] Fix isp+depquant and trskip + isp 2023-08-15 13:10:12 +03:00
Joose Sainio 93c7e9c296 [DepQuant] Fix for mts and lfnst being quantized incorrectly during search 2023-08-15 13:10:11 +03:00