Wed, 01 Nov 2006 17:18:49 +0000 |
reimar |
Reenable AMD64 optimizations for cabac accidentially disabled in r6852
libavcodec
|
Wed, 01 Nov 2006 17:01:33 +0000 |
diego |
Add ARCH_X86_32 as a new define for 32 bit x86 architectures and change
libavcodec
|
Sun, 29 Oct 2006 13:14:37 +0000 |
diego |
Fix compilation with PIC enabled, BRANCHLESS_GET_CABAC is defined under
libavcodec
|
Mon, 23 Oct 2006 16:54:21 +0000 |
reimar |
CABAC assembler optimizations ported to AMD64
libavcodec
|
Fri, 20 Oct 2006 21:34:02 +0000 |
michael |
decode_significance_8x8_x86()
libavcodec
|
Fri, 20 Oct 2006 17:53:19 +0000 |
gpoirier |
Protect code that uses CMOV instructions with HAVE_CMOV,
libavcodec
|
Fri, 20 Oct 2006 10:00:28 +0000 |
michael |
another instruction less in decode_significance_x86() -> 1% faster ion P3
libavcodec
|
Fri, 20 Oct 2006 08:46:33 +0000 |
michael |
1 instruction less
libavcodec
|
Fri, 20 Oct 2006 00:51:04 +0000 |
michael |
reordering instructions a little in decode_significance_x86() -> 2 instructions less / 1% faster decode_residual on P3
libavcodec
|
Fri, 20 Oct 2006 00:35:54 +0000 |
michael |
factorize get_cabac asm (0.5% slower but its much cleaner)
libavcodec
|
Thu, 19 Oct 2006 20:24:33 +0000 |
diego |
Fix PIC compilation, some defines were under #ifdef !PIC but used
libavcodec
|
Thu, 19 Oct 2006 18:22:03 +0000 |
michael |
unused variable
libavcodec
|
Thu, 19 Oct 2006 01:19:03 +0000 |
michael |
optimize sign decoding code in decode_residual()
libavcodec
|
Wed, 18 Oct 2006 09:37:25 +0000 |
gpoirier |
Kill a warning with MSVC
libavcodec
|
Tue, 17 Oct 2006 22:18:29 +0000 |
michael |
x86 asm version of the decode significance loop (not 8x8) of decode_residual() 5% faster decode_residual() on P3
libavcodec
|
Tue, 17 Oct 2006 09:38:37 +0000 |
michael |
cosmetic (%%eax->%0)
libavcodec
|
Mon, 16 Oct 2006 21:47:19 +0000 |
diego |
Fix crash with illegal instruction, cmov is available on 686 and later only.
libavcodec
|
Mon, 16 Oct 2006 21:22:47 +0000 |
diego |
Expand some #endif comments.
libavcodec
|
Mon, 16 Oct 2006 11:11:20 +0000 |
michael |
fix !CMOV_IS_FAST case (iam not really happy with the fix but i didnt come up with a better one quickly)
libavcodec
|
Sun, 15 Oct 2006 21:04:10 +0000 |
michael |
10l
libavcodec
|
Sun, 15 Oct 2006 20:50:05 +0000 |
michael |
this code will not work with PIC as it needs 7 registers and gcc doesnt support that in PIC
libavcodec
|
Sun, 15 Oct 2006 20:40:50 +0000 |
michael |
shift CABACContext.range right, this reduces the number of shifts needed in get_cabac() and is slightly faster on P3 (and should be much faster on P4 as the P4 except the more recent variants lacks an integer shifter and so shifts have ~10 times longer latency then simple operations like adds)
libavcodec
|
Fri, 13 Oct 2006 14:21:25 +0000 |
michael |
dehack *ps_state indexing in the branchless decoder
libavcodec
|
Thu, 12 Oct 2006 21:32:56 +0000 |
michael |
add "memory" to the clobber list we change memory so we need it, this also fixes some problems with gcc svn
libavcodec
|
Thu, 12 Oct 2006 14:49:19 +0000 |
michael |
prevent "mb level" get_cabac() calls from being inlined (3% faster decode_mb_cabac() on P3)
libavcodec
|
Thu, 12 Oct 2006 07:51:18 +0000 |
gpoirier |
adds some useful comments after some of the #else, #elseif,
libavcodec
|
Wed, 11 Oct 2006 23:17:58 +0000 |
diego |
Rename ABS macro to FFABS.
libavcodec
|
Wed, 11 Oct 2006 17:59:40 +0000 |
michael |
slightly faster on P3 slightly slower on athlon and probably faster on P4
libavcodec
|
Wed, 11 Oct 2006 16:39:50 +0000 |
michael |
moving lps state transition code a little up in the branched asm code (1% faster on P3)
libavcodec
|
Wed, 11 Oct 2006 16:11:41 +0000 |
michael |
write cabac low and range variables as early as possible to prevent stalls from reading them before they where written, the P4 is said to disslike that alot, on P3 its 2% faster (START/STOP_TIMER over decode_residual)
libavcodec
|