log cabac.h @ 4062:683d458a251f libavcodec

age author description
Fri, 20 Oct 2006 21:34:02 +0000 michael decode_significance_8x8_x86() libavcodec
Fri, 20 Oct 2006 17:53:19 +0000 gpoirier Protect code that uses CMOV instructions with HAVE_CMOV, libavcodec
Fri, 20 Oct 2006 10:00:28 +0000 michael another instruction less in decode_significance_x86() -> 1% faster ion P3 libavcodec
Fri, 20 Oct 2006 08:46:33 +0000 michael 1 instruction less libavcodec
Fri, 20 Oct 2006 00:51:04 +0000 michael reordering instructions a little in decode_significance_x86() -> 2 instructions less / 1% faster decode_residual on P3 libavcodec
Fri, 20 Oct 2006 00:35:54 +0000 michael factorize get_cabac asm (0.5% slower but its much cleaner) libavcodec
Thu, 19 Oct 2006 20:24:33 +0000 diego Fix PIC compilation, some defines were under #ifdef !PIC but used libavcodec
Thu, 19 Oct 2006 18:22:03 +0000 michael unused variable libavcodec
Thu, 19 Oct 2006 01:19:03 +0000 michael optimize sign decoding code in decode_residual() libavcodec
Wed, 18 Oct 2006 09:37:25 +0000 gpoirier Kill a warning with MSVC libavcodec
Tue, 17 Oct 2006 22:18:29 +0000 michael x86 asm version of the decode significance loop (not 8x8) of decode_residual() 5% faster decode_residual() on P3 libavcodec
Tue, 17 Oct 2006 09:38:37 +0000 michael cosmetic (%%eax->%0) libavcodec
Mon, 16 Oct 2006 21:47:19 +0000 diego Fix crash with illegal instruction, cmov is available on 686 and later only. libavcodec
Mon, 16 Oct 2006 21:22:47 +0000 diego Expand some #endif comments. libavcodec
Mon, 16 Oct 2006 11:11:20 +0000 michael fix !CMOV_IS_FAST case (iam not really happy with the fix but i didnt come up with a better one quickly) libavcodec
Sun, 15 Oct 2006 21:04:10 +0000 michael 10l libavcodec
Sun, 15 Oct 2006 20:50:05 +0000 michael this code will not work with PIC as it needs 7 registers and gcc doesnt support that in PIC libavcodec
Sun, 15 Oct 2006 20:40:50 +0000 michael shift CABACContext.range right, this reduces the number of shifts needed in get_cabac() and is slightly faster on P3 (and should be much faster on P4 as the P4 except the more recent variants lacks an integer shifter and so shifts have ~10 times longer latency then simple operations like adds) libavcodec
Fri, 13 Oct 2006 14:21:25 +0000 michael dehack *ps_state indexing in the branchless decoder libavcodec
Thu, 12 Oct 2006 21:32:56 +0000 michael add "memory" to the clobber list we change memory so we need it, this also fixes some problems with gcc svn libavcodec
Thu, 12 Oct 2006 14:49:19 +0000 michael prevent "mb level" get_cabac() calls from being inlined (3% faster decode_mb_cabac() on P3) libavcodec
Thu, 12 Oct 2006 07:51:18 +0000 gpoirier adds some useful comments after some of the #else, #elseif, libavcodec
Wed, 11 Oct 2006 23:17:58 +0000 diego Rename ABS macro to FFABS. libavcodec
Wed, 11 Oct 2006 17:59:40 +0000 michael slightly faster on P3 slightly slower on athlon and probably faster on P4 libavcodec
Wed, 11 Oct 2006 16:39:50 +0000 michael moving lps state transition code a little up in the branched asm code (1% faster on P3) libavcodec
Wed, 11 Oct 2006 16:11:41 +0000 michael write cabac low and range variables as early as possible to prevent stalls from reading them before they where written, the P4 is said to disslike that alot, on P3 its 2% faster (START/STOP_TIMER over decode_residual) libavcodec
Wed, 11 Oct 2006 15:20:08 +0000 michael use ecx instead of cl (no speed change on P3 but might avoid partial register stalls on some cpus) libavcodec
Wed, 11 Oct 2006 14:44:17 +0000 michael make state transition tables global as they are constant and the code is slightly faster that way libavcodec
Wed, 11 Oct 2006 13:25:29 +0000 michael 10l libavcodec
Wed, 11 Oct 2006 13:21:42 +0000 michael make lps_range a global table its constant anyway (saves 1 addition for accessing it) libavcodec