Tue, 10 Oct 2006 06:56:51 +0000 |
michael |
moving another bit&1 out, this is as fast as with it in there, but it makes more sense with it outside of the loop
libavcodec
|
Tue, 10 Oct 2006 01:17:39 +0000 |
michael |
move the &1 out of the asm so gcc can optimize it away in inlined cases (yes this is slightly faster)
libavcodec
|
Tue, 10 Oct 2006 01:08:39 +0000 |
michael |
replace a few and/sub/... by cmov
libavcodec
|
Mon, 09 Oct 2006 21:57:10 +0000 |
michael |
reading 8bit mem into a 8bit register needs 2 uops on P4, 8bit->32bit with zero extension needs just 1
libavcodec
|
Mon, 09 Oct 2006 21:39:07 +0000 |
michael |
on the P4 inc needs twice as much time a add
libavcodec
|
Mon, 09 Oct 2006 21:21:10 +0000 |
michael |
10l
libavcodec
|
Mon, 09 Oct 2006 21:14:16 +0000 |
michael |
reverse remainder of the failed attempt to optimize *state=c->mps_state[s]
libavcodec
|
Mon, 09 Oct 2006 20:51:33 +0000 |
michael |
x86 branchless cabac decoder
libavcodec
|
Mon, 09 Oct 2006 20:44:11 +0000 |
michael |
optimize branchless C CABAC decoder
libavcodec
|
Mon, 09 Oct 2006 18:29:46 +0000 |
lu_zero |
removing ALTIVEC_USE_REFERENCE_C_CODE, since has no use anymore
libavcodec
|
Mon, 09 Oct 2006 18:20:00 +0000 |
michael |
move outcommented START/STOP_TIMER to a hopefully better place for benchmarking ...
libavcodec
|
Mon, 09 Oct 2006 15:52:17 +0000 |
michael |
drop failed attempt to optimize *state= c->mps_state[s];
libavcodec
|
Mon, 09 Oct 2006 14:15:53 +0000 |
michael |
10l bugfix for some disabled code
libavcodec
|
Mon, 09 Oct 2006 14:15:14 +0000 |
michael |
first try of a handwritten get_cabac() for x86, this is 10-20% faster on P3 depening on if you try to subtract the START/STOP_TIMER overhead
libavcodec
|
Mon, 09 Oct 2006 13:37:43 +0000 |
lu_zero |
add_bytes passes tests
libavcodec
|
Mon, 09 Oct 2006 12:25:24 +0000 |
michael |
remove bytestream_end checks, seems to work fine without them and the bitstream reader doesnt check for the end either
libavcodec
|