Mercurial > libavcodec.hg
annotate x86/idct_xvid.h @ 12492:58a960d6e34c libavcodec
Rename h264_idct_sse2.asm to h264_idct.asm; move inline IDCT asm from
h264dsp_mmx.c to h264_idct.asm (as yasm code). Because the loops are now
coded in asm instead of C, this is (depending on the function) up to 50%
faster for cases where gcc didn't do a great job at looping.
Since h264_idct_add8() is now faster than the manual loop setup in h264.c,
in-asm idct calling can now be enabled for chroma as well (see r16207). For
MMX, this is 5% faster. For SSE2 (which isn't done for chroma if h264.c does
the looping), this makes it up to 50% faster. Speed gain overall is ~0.5-1.0%.
author | rbultje |
---|---|
date | Tue, 14 Sep 2010 13:36:26 +0000 |
parents | 7dd2a45249a9 |
children |
rev | line source |
---|---|
8430 | 1 /* |
2 * XVID MPEG-4 VIDEO CODEC | |
3 * | |
4 * This file is part of FFmpeg. | |
5 * | |
6 * FFmpeg is free software; you can redistribute it and/or | |
7 * modify it under the terms of the GNU Lesser General Public | |
8 * License as published by the Free Software Foundation; either | |
9 * version 2.1 of the License, or (at your option) any later version. | |
10 * | |
11 * FFmpeg is distributed in the hope that it will be useful, | |
12 * but WITHOUT ANY WARRANTY; without even the implied warranty of | |
13 * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU | |
14 * Lesser General Public License for more details. | |
15 * | |
16 * You should have received a copy of the GNU Lesser General Public | |
17 * License along with FFmpeg; if not, write to the Free Software | |
18 * Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA | |
19 */ | |
20 | |
21 /*! | |
11644
7dd2a45249a9
Remove explicit filename from Doxygen @file commands.
diego
parents:
8718
diff
changeset
|
22 * @file |
8430 | 23 * header for Xvid IDCT functions |
24 */ | |
25 | |
26 #ifndef AVCODEC_X86_IDCT_XVID_H | |
27 #define AVCODEC_X86_IDCT_XVID_H | |
28 | |
29 #include <stdint.h> | |
30 | |
31 void ff_idct_xvid_mmx(short *block); | |
32 void ff_idct_xvid_mmx2(short *block); | |
33 void ff_idct_xvid_sse2(short *block); | |
34 void ff_idct_xvid_sse2_put(uint8_t *dest, int line_size, short *block); | |
35 void ff_idct_xvid_sse2_add(uint8_t *dest, int line_size, short *block); | |
36 | |
37 #endif /* AVCODEC_X86_IDCT_XVID_H */ |