log

age author description
Tue, 10 Oct 2006 06:56:51 +0000 michael moving another bit&1 out, this is as fast as with it in there, but it makes more sense with it outside of the loop libavcodec
Tue, 10 Oct 2006 01:17:39 +0000 michael move the &1 out of the asm so gcc can optimize it away in inlined cases (yes this is slightly faster) libavcodec
Tue, 10 Oct 2006 01:08:39 +0000 michael replace a few and/sub/... by cmov libavcodec
Mon, 09 Oct 2006 21:57:10 +0000 michael reading 8bit mem into a 8bit register needs 2 uops on P4, 8bit->32bit with zero extension needs just 1 libavcodec
Mon, 09 Oct 2006 21:39:07 +0000 michael on the P4 inc needs twice as much time a add libavcodec
Mon, 09 Oct 2006 21:21:10 +0000 michael 10l libavcodec
Mon, 09 Oct 2006 21:14:16 +0000 michael reverse remainder of the failed attempt to optimize *state=c->mps_state[s] libavcodec
Mon, 09 Oct 2006 20:51:33 +0000 michael x86 branchless cabac decoder libavcodec
Mon, 09 Oct 2006 20:44:11 +0000 michael optimize branchless C CABAC decoder libavcodec
Mon, 09 Oct 2006 18:29:46 +0000 lu_zero removing ALTIVEC_USE_REFERENCE_C_CODE, since has no use anymore libavcodec
Mon, 09 Oct 2006 18:20:00 +0000 michael move outcommented START/STOP_TIMER to a hopefully better place for benchmarking ... libavcodec
Mon, 09 Oct 2006 15:52:17 +0000 michael drop failed attempt to optimize *state= c->mps_state[s]; libavcodec
Mon, 09 Oct 2006 14:15:53 +0000 michael 10l bugfix for some disabled code libavcodec
Mon, 09 Oct 2006 14:15:14 +0000 michael first try of a handwritten get_cabac() for x86, this is 10-20% faster on P3 depening on if you try to subtract the START/STOP_TIMER overhead libavcodec
Mon, 09 Oct 2006 13:37:43 +0000 lu_zero add_bytes passes tests libavcodec
Mon, 09 Oct 2006 12:25:24 +0000 michael remove bytestream_end checks, seems to work fine without them and the bitstream reader doesnt check for the end either libavcodec