Pauli Oikkonen
|
fb2481b7e4
|
16x16 filtered DC implemented in AVX2
|
2019-12-10 15:54:50 +02:00 |
|
Joose Sainio
|
b78aa7b272
|
save c and k to frame
|
2019-12-06 10:52:54 +02:00 |
|
Joose Sainio
|
5b10e5fb7e
|
parameterize the clipping option
|
2019-12-06 09:51:04 +02:00 |
|
Pauli Oikkonen
|
da370ea36d
|
Implement AVX2 8x8 filtered DC algorithm
|
2019-11-28 14:10:10 +02:00 |
|
Pauli Oikkonen
|
5d9b7019ca
|
Implement a 4x4 filtered DC pred function
|
2019-11-26 17:05:54 +02:00 |
|
Joose Sainio
|
ca0060cbba
|
try the original clipping
|
2019-11-26 15:13:04 +02:00 |
|
Pauli Oikkonen
|
f1485ab087
|
Start doing an arbitrary size filtered DC pred - maybe easier to just create separate functions for fixed block sizes?
|
2019-11-25 15:20:29 +02:00 |
|
Joose Sainio
|
ce44735364
|
Update visual studio requirment
|
2019-11-21 15:04:24 +02:00 |
|
Joose Sainio
|
ab2fded8af
|
Update threadwrapper to enable pthread_rwlock_t
|
2019-11-21 13:38:40 +02:00 |
|
Joose Sainio
|
eb78aead1f
|
Fix additional potential data races
|
2019-11-21 11:03:12 +02:00 |
|
Joose Sainio
|
35d7e0d88b
|
Fix data race
|
2019-11-21 10:25:04 +02:00 |
|
Marko Viitanen
|
fc8982b8c0
|
Merge branch 'VTM7.0_sync'
|
2019-11-20 11:03:59 +02:00 |
|
Marko Viitanen
|
94d89f03c7
|
Added cfg variable intra_smoothing_disabled and some cleanup
|
2019-11-20 08:38:33 +02:00 |
|
Marko Viitanen
|
eb2caf9118
|
Fix intra angle filter, changed from gauss filter table to run-time calculated 4-tap filter
|
2019-11-19 15:15:21 +02:00 |
|
Joose Sainio
|
7a49cb60fb
|
Merge branch 'rc-intra' of gitlab.tut.fi:TIE/ultravideo/kvazaar into rc-intra
|
2019-11-19 15:09:49 +02:00 |
|
Pauli Oikkonen
|
979d66031c
|
Create a strategy out of intra_pred_filtered_dc
|
2019-11-19 14:50:31 +02:00 |
|
Joose Sainio
|
7ebe30448e
|
added tests for the new ratecontrol
|
2019-11-19 14:38:37 +02:00 |
|
Marko Viitanen
|
466d8772b0
|
Apply JVET_P0170_ZERO_POS_SIMPLIFICATION in coeff bypass coding
|
2019-11-19 14:32:38 +02:00 |
|
Joose Sainio
|
0e8815a3d8
|
test clipping qp to previous frame instead of previous ctus
|
2019-11-19 14:32:31 +02:00 |
|
Joose Sainio
|
ddb4e5a131
|
move the intra bit calculation so that it is used also with lambda rc
|
2019-11-19 14:16:48 +02:00 |
|
Joose Sainio
|
a07833f3e6
|
check that mallocs in rc initialization were successful
only call kvz_update_after_picture when using the OBA rc
|
2019-11-19 13:59:44 +02:00 |
|
Joose Sainio
|
50d410a316
|
re-enable static qp encoding and lambda rc
|
2019-11-19 13:45:58 +02:00 |
|
Pauli Oikkonen
|
fa4bb86406
|
Optimize intra_pred_planar_avx2 for 4x4 blocks
|
2019-11-19 13:39:02 +02:00 |
|
Marko Viitanen
|
3df2642b03
|
Fix qt cbf context init value
|
2019-11-19 13:27:36 +02:00 |
|
Joose Sainio
|
57e5615ece
|
Fix incorrect intra rc calculation skipping
|
2019-11-19 13:25:31 +02:00 |
|
Joose Sainio
|
6cc3bcd87e
|
Command line parameters for oba rc and implementation of the usage of the intra parameter
|
2019-11-19 09:29:06 +02:00 |
|
Joose Sainio
|
eb73548af5
|
Encode first frame completely before starting others to enable owf
|
2019-11-18 09:51:37 +02:00 |
|
Marko Viitanen
|
17a53230fd
|
Code cleanup, remove unused arrays and remove tabs
|
2019-11-18 09:01:23 +02:00 |
|
Pauli Oikkonen
|
4761d228f9
|
Start to vectorize the 4x4 loop
|
2019-11-15 17:32:40 +02:00 |
|
Pauli Oikkonen
|
8d45ab4951
|
Stupidify the 4x4 planar loop for vectorization
|
2019-11-14 17:14:04 +02:00 |
|
Marko Viitanen
|
91528f3292
|
Update contexts
|
2019-11-14 13:46:51 +02:00 |
|
Marko Viitanen
|
b309ed90be
|
Fix NAL packet and missing fields in SPS
|
2019-11-14 09:21:11 +02:00 |
|
Marko Viitanen
|
74514981a9
|
Fixed PPS, SPS and slice headers and NAL unit types
|
2019-11-13 15:59:36 +02:00 |
|
Joose Sainio
|
c759c138ed
|
Prepare the rc data structure to be shared among all frame encoders
|
2019-11-13 11:56:25 +02:00 |
|
Joose Sainio
|
cdb7c851a4
|
Fix weight calculation
|
2019-11-13 08:55:31 +02:00 |
|
Joose Sainio
|
b9b01f8036
|
WPP with threading
|
2019-11-12 12:12:57 +02:00 |
|
Joose Sainio
|
615973adca
|
should enable threading with wpp when owf is not used
|
2019-11-12 09:03:00 +02:00 |
|
Pauli Oikkonen
|
6f13f6525c
|
Merge branch 'new_prints'
|
2019-11-07 17:04:21 +02:00 |
|
Joose Sainio
|
d353f7dd1a
|
Disable debug prints, fix multiple bugs in the calculation
|
2019-11-07 15:08:57 +02:00 |
|
mercat
|
57e8c3ebc2
|
Merge branch 'ML-cplx_red_ICIP'
|
2019-11-07 13:25:47 +02:00 |
|
Pauli Oikkonen
|
558f0ec401
|
Mbps, not mbps
|
2019-11-05 18:06:00 +02:00 |
|
Pauli Oikkonen
|
2edf533925
|
Tidy the end report printing
Also fix a bug with non-integer target FPS
|
2019-11-05 17:20:00 +02:00 |
|
Joose Sainio
|
408fd4ccb6
|
Fix lambda and qp calcualtion for intra frames
also fixes a bug with selecting the clip neighbor lambda and clip neighbor qp
selection for inter frames
|
2019-11-05 10:51:39 +02:00 |
|
Pauli Oikkonen
|
c7313ce567
|
Store AVG QP information in encmain
|
2019-11-04 17:08:07 +02:00 |
|
Reima Hyvönen
|
80575c59bf
|
Some updates done to get right bitrate and avg QP
|
2019-10-31 15:56:24 +02:00 |
|
Reima Hyvönen
|
252bab8820
|
Added prints to bitrate and AVG QP
|
2019-10-31 15:56:24 +02:00 |
|
Pauli Oikkonen
|
6d7a4f555c
|
Also remove 16x16 (A * B^T)^T matrix multiply
Can be done using (B * A^T) instead, it's the exact same
|
2019-10-28 16:19:42 +02:00 |
|
Pauli Oikkonen
|
2c2deb2366
|
Tidy AVX2 32x32 matrix multiply
|
2019-10-28 16:19:42 +02:00 |
|
Pauli Oikkonen
|
98ad78b333
|
Tidy the old AVX2 32x32 matrix multiply
It was actually a very good algorithm, just looked messy!
|
2019-10-28 16:19:42 +02:00 |
|
Pauli Oikkonen
|
4a921cbdb5
|
Retain data as much in YMM registers as possible
This seems to make it a whole lot quicker
|
2019-10-28 16:19:42 +02:00 |
|