annotate mp3lib/dct64_sse.s @ 9520:2860f7c9d9ca

A new nuppel video encoder. Mainly for RT encoding on slow box.
author albeu
date Mon, 03 Mar 2003 11:03:19 +0000
parents 8312f4bc8dab
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
3203
8312f4bc8dab Disable SSE code and reenable FPU dct for SSE cpus (fpu code is 0.3% faster and I don't get data aligned in dct64_sse.s, so I can't finish optimizing it)
atmos4
parents: 1394
diff changeset
1 / This code is a translation of dct64_k7.s from MPlayer.
8312f4bc8dab Disable SSE code and reenable FPU dct for SSE cpus (fpu code is 0.3% faster and I don't get data aligned in dct64_sse.s, so I can't finish optimizing it)
atmos4
parents: 1394
diff changeset
2 / Coded by Felix Buenemann <atmosfear at users.sourceforge.net>
8312f4bc8dab Disable SSE code and reenable FPU dct for SSE cpus (fpu code is 0.3% faster and I don't get data aligned in dct64_sse.s, so I can't finish optimizing it)
atmos4
parents: 1394
diff changeset
3 /
8312f4bc8dab Disable SSE code and reenable FPU dct for SSE cpus (fpu code is 0.3% faster and I don't get data aligned in dct64_sse.s, so I can't finish optimizing it)
atmos4
parents: 1394
diff changeset
4 / TODO: - fix phases 4 and 5 (sse)
8312f4bc8dab Disable SSE code and reenable FPU dct for SSE cpus (fpu code is 0.3% faster and I don't get data aligned in dct64_sse.s, so I can't finish optimizing it)
atmos4
parents: 1394
diff changeset
5 / - optimize scalar FPU code? (interleave with sse code)
8312f4bc8dab Disable SSE code and reenable FPU dct for SSE cpus (fpu code is 0.3% faster and I don't get data aligned in dct64_sse.s, so I can't finish optimizing it)
atmos4
parents: 1394
diff changeset
6 / - fix alignment (prohibits finishing this code)
8312f4bc8dab Disable SSE code and reenable FPU dct for SSE cpus (fpu code is 0.3% faster and I don't get data aligned in dct64_sse.s, so I can't finish optimizing it)
atmos4
parents: 1394
diff changeset
7 / - then use faster insns for aligned data
8312f4bc8dab Disable SSE code and reenable FPU dct for SSE cpus (fpu code is 0.3% faster and I don't get data aligned in dct64_sse.s, so I can't finish optimizing it)
atmos4
parents: 1394
diff changeset
8 /
8312f4bc8dab Disable SSE code and reenable FPU dct for SSE cpus (fpu code is 0.3% faster and I don't get data aligned in dct64_sse.s, so I can't finish optimizing it)
atmos4
parents: 1394
diff changeset
9 / Note: currently code is disabled as I couldn't get input data aligned!
8312f4bc8dab Disable SSE code and reenable FPU dct for SSE cpus (fpu code is 0.3% faster and I don't get data aligned in dct64_sse.s, so I can't finish optimizing it)
atmos4
parents: 1394
diff changeset
10 /
1394
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
11
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
12 //.data
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
13 // .align 8
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
14 //x_plus_minus_3dnow: .long 0x00000000, 0x80000000
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
15 //plus_1f: .float 1.0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
16
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
17 .text
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
18
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
19 .align 16
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
20
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
21 .global dct64_MMX_sse
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
22
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
23 dct64_MMX_sse:
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
24 pushl %ebx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
25 pushl %esi
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
26 pushl %edi
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
27 subl $256,%esp
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
28 movl 280(%esp),%eax
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
29
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
30 leal 128(%esp),%edx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
31 movl 272(%esp),%esi
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
32 movl 276(%esp),%edi
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
33 movl $costab_mmx,%ebx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
34 orl %ecx,%ecx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
35 movl %esp,%ecx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
36
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
37 /* Phase 1 (complete, worx) */
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
38
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
39 // [1] Process Block A1 (16 Bytes)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
40 / movq (%eax), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
41 / movq 8(%eax), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
42 movups (%eax), %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
43
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
44 // Copy A1 to another register A2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
45 / movq %mm0, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
46 / movq %mm4, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
47 movaps %xmm0, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
48
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
49 // Process Block B1 (last 16 bytes)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
50 / movq 120(%eax), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
51 / movq 112(%eax), %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
52 movups 112(%eax), %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
53
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
54 /* The PSWAPD instruction swaps or reverses the upper and lower
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
55 * doublewords of the source operand. PSWAPD mmreg1, mmreg2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
56 * performs the following operations:
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
57 * temp = mmreg2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
58 * mmreg1[63:32] = temp[31:0 ]
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
59 * mmreg1[31:0 ] = temp[63:32]
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
60 */
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
61 / pswapd %mm1, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
62 / pswapd %mm5, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
63 // shufps here exchanges a,b,c,d to b,a,d,c in xmm1 (desc ia32-ref p.752)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
64 //// shufps $177, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
65 shufps $27, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
66
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
67 // Add B1 to A1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
68 / pfadd %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
69 / pfadd %mm5, %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
70 addps %xmm1, %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
71
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
72 // Save Block A1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
73 / movq %mm0, (%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
74 / movq %mm4, 8(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
75 movups %xmm0, (%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
76
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
77 // Sub B1 from A2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
78 / pfsub %mm1, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
79 / pfsub %mm5, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
80 subps %xmm1, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
81
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
82 // Mul mem with A2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
83 / pfmul (%ebx), %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
84 / pfmul 8(%ebx), %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
85 movups (%ebx), %xmm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
86 mulps %xmm7, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
87
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
88 // Shuffle A2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
89 / pswapd %mm3, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
90 / pswapd %mm7, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
91 // I do a,b,c,d -> d,c,b,a to suit order when writing to mem (saves one shufps)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
92 shufps $27, %xmm2, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
93
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
94 // Save A2 to mem (end)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
95 / movq %mm3, 120(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
96 / movq %mm7, 112(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
97 movups %xmm2, 112(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
98
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
99 // [2] Process next data block
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
100 / movq 16(%eax), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
101 / movq 24(%eax), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
102 movups 16(%eax), %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
103
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
104 / movq %mm0, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
105 / movq %mm4, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
106 movaps %xmm0, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
107
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
108 / movq 104(%eax), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
109 / movq 96(%eax), %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
110 movups 96(%eax), %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
111
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
112 / pswapd %mm1, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
113 / pswapd %mm5, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
114 //// shufps $177, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
115 shufps $27, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
116
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
117 / pfadd %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
118 / pfadd %mm5, %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
119 addps %xmm1, %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
120
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
121 / movq %mm0, 16(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
122 / movq %mm4, 24(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
123 movups %xmm0, 16(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
124
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
125 / pfsub %mm1, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
126 / pfsub %mm5, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
127 subps %xmm1, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
128
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
129 / pfmul 16(%ebx), %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
130 / pfmul 24(%ebx), %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
131 movups 16(%ebx), %xmm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
132 mulps %xmm7, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
133
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
134 / pswapd %mm3, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
135 / pswapd %mm7, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
136 shufps $27, %xmm2, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
137
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
138 / movq %mm3, 104(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
139 / movq %mm7, 96(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
140 movups %xmm2, 96(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
141
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
142 // [3]
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
143 / movq 32(%eax), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
144 / movq 40(%eax), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
145 movups 32(%eax), %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
146
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
147 / movq %mm0, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
148 / movq %mm4, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
149 movaps %xmm0, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
150
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
151 / movq 88(%eax), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
152 / movq 80(%eax), %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
153 movups 80(%eax), %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
154
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
155 / pswapd %mm1, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
156 / pswapd %mm5, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
157 //// shufps $177, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
158 shufps $27, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
159
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
160 / pfadd %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
161 / pfadd %mm5, %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
162 addps %xmm1, %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
163
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
164 / movq %mm0, 32(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
165 / movq %mm4, 40(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
166 movups %xmm0, 32(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
167
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
168 / pfsub %mm1, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
169 / pfsub %mm5, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
170 subps %xmm1, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
171
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
172 / pfmul 32(%ebx), %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
173 / pfmul 40(%ebx), %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
174 movups 32(%ebx), %xmm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
175 mulps %xmm7, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
176
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
177 / pswapd %mm3, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
178 / pswapd %mm7, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
179 shufps $27, %xmm2, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
180
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
181 / movq %mm3, 88(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
182 / movq %mm7, 80(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
183 movups %xmm2, 80(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
184
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
185 // [4]
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
186 / movq 48(%eax), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
187 / movq 56(%eax), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
188 movups 48(%eax), %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
189
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
190 / movq %mm0, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
191 / movq %mm4, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
192 movaps %xmm0, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
193
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
194 / movq 72(%eax), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
195 / movq 64(%eax), %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
196 movups 64(%eax), %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
197
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
198 / pswapd %mm1, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
199 / pswapd %mm5, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
200 //// shufps $177, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
201 shufps $27, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
202
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
203 / pfadd %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
204 / pfadd %mm5, %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
205 addps %xmm1, %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
206
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
207 / movq %mm0, 48(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
208 / movq %mm4, 56(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
209 movups %xmm0, 48(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
210
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
211 / pfsub %mm1, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
212 / pfsub %mm5, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
213 subps %xmm1, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
214
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
215 / pfmul 48(%ebx), %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
216 / pfmul 56(%ebx), %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
217 movups 48(%ebx), %xmm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
218 mulps %xmm7, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
219
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
220 / pswapd %mm3, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
221 / pswapd %mm7, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
222 shufps $27, %xmm2, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
223
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
224 / movq %mm3, 72(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
225 / movq %mm7, 64(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
226 movups %xmm2, 64(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
227
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
228
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
229 // phase 1 fpu code
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
230 /* Phase 1*/
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
231 /*
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
232 flds (%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
233 leal 128(%esp),%edx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
234 fadds 124(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
235 movl 272(%esp),%esi
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
236 fstps (%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
237 movl 276(%esp),%edi
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
238
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
239 flds 4(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
240 movl $costab_mmx,%ebx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
241 fadds 120(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
242 orl %ecx,%ecx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
243 fstps 4(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
244
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
245 flds (%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
246 movl %esp,%ecx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
247 fsubs 124(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
248 fmuls (%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
249 fstps 124(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
250
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
251 flds 4(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
252 fsubs 120(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
253 fmuls 4(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
254 fstps 120(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
255
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
256 flds 8(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
257 fadds 116(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
258 fstps 8(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
259
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
260 flds 12(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
261 fadds 112(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
262 fstps 12(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
263
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
264 flds 8(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
265 fsubs 116(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
266 fmuls 8(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
267 fstps 116(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
268
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
269 flds 12(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
270 fsubs 112(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
271 fmuls 12(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
272 fstps 112(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
273
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
274 flds 16(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
275 fadds 108(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
276 fstps 16(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
277
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
278 flds 20(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
279 fadds 104(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
280 fstps 20(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
281
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
282 flds 16(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
283 fsubs 108(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
284 fmuls 16(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
285 fstps 108(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
286
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
287 flds 20(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
288 fsubs 104(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
289 fmuls 20(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
290 fstps 104(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
291
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
292 flds 24(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
293 fadds 100(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
294 fstps 24(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
295
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
296 flds 28(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
297 fadds 96(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
298 fstps 28(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
299
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
300 flds 24(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
301 fsubs 100(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
302 fmuls 24(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
303 fstps 100(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
304
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
305 flds 28(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
306 fsubs 96(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
307 fmuls 28(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
308 fstps 96(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
309
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
310 flds 32(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
311 fadds 92(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
312 fstps 32(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
313
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
314 flds 36(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
315 fadds 88(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
316 fstps 36(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
317
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
318 flds 32(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
319 fsubs 92(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
320 fmuls 32(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
321 fstps 92(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
322
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
323 flds 36(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
324 fsubs 88(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
325 fmuls 36(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
326 fstps 88(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
327
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
328 flds 40(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
329 fadds 84(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
330 fstps 40(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
331
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
332 flds 44(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
333 fadds 80(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
334 fstps 44(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
335
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
336 flds 40(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
337 fsubs 84(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
338 fmuls 40(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
339 fstps 84(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
340
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
341 flds 44(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
342 fsubs 80(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
343 fmuls 44(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
344 fstps 80(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
345
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
346 flds 48(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
347 fadds 76(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
348 fstps 48(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
349
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
350 flds 52(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
351 fadds 72(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
352 fstps 52(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
353
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
354 flds 48(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
355 fsubs 76(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
356 fmuls 48(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
357 fstps 76(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
358
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
359 flds 52(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
360 fsubs 72(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
361 fmuls 52(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
362 fstps 72(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
363
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
364 flds 56(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
365 fadds 68(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
366 fstps 56(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
367
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
368 flds 60(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
369 fadds 64(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
370 fstps 60(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
371
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
372 flds 56(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
373 fsubs 68(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
374 fmuls 56(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
375 fstps 68(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
376
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
377 flds 60(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
378 fsubs 64(%eax)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
379 fmuls 60(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
380 fstps 64(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
381 */
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
382 // end phase 1 fpu code
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
383
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
384 /* Phase 2 (completed, worx) */
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
385
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
386 / movq (%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
387 / movq 8(%edx), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
388 movups (%edx), %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
389
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
390 / movq %mm0, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
391 / movq %mm4, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
392 movaps %xmm0, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
393
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
394 / movq 56(%edx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
395 / movq 48(%edx), %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
396 movups 48(%edx), %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
397
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
398 / pswapd %mm1, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
399 / pswapd %mm5, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
400 //// shufps $177, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
401 shufps $27, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
402
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
403 / pfadd %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
404 / pfadd %mm5, %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
405 addps %xmm1, %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
406
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
407 / movq %mm0, (%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
408 / movq %mm4, 8(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
409 movups %xmm0, (%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
410
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
411 / pfsub %mm1, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
412 / pfsub %mm5, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
413 subps %xmm1, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
414
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
415 / pfmul 64(%ebx), %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
416 / pfmul 72(%ebx), %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
417 movups 64(%ebx), %xmm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
418 mulps %xmm7, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
419
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
420 / pswapd %mm3, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
421 / pswapd %mm7, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
422 shufps $27, %xmm2, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
423
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
424 / movq %mm3, 56(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
425 / movq %mm7, 48(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
426 movups %xmm2, 48(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
427
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
428 / movq 16(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
429 / movq 24(%edx), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
430 movups 16(%edx), %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
431
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
432 / movq %mm0, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
433 / movq %mm4, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
434 movaps %xmm0, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
435
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
436 / movq 40(%edx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
437 / movq 32(%edx), %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
438 movups 32(%edx), %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
439
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
440 / pswapd %mm1, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
441 / pswapd %mm5, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
442 //// shufps $177, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
443 shufps $27, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
444
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
445 / pfadd %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
446 / pfadd %mm5, %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
447 addps %xmm1, %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
448
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
449 / movq %mm0, 16(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
450 / movq %mm4, 24(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
451 movups %xmm0, 16(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
452
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
453 / pfsub %mm1, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
454 / pfsub %mm5, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
455 subps %xmm1, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
456
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
457 / pfmul 80(%ebx), %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
458 / pfmul 88(%ebx), %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
459 movups 80(%ebx), %xmm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
460 mulps %xmm7, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
461
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
462 / pswapd %mm3, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
463 / pswapd %mm7, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
464 shufps $27, %xmm2, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
465
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
466 / movq %mm3, 40(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
467 / movq %mm7, 32(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
468 movups %xmm2, 32(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
469
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
470
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
471 // phase 2 fpu
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
472 /* Phase 2*/
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
473 /*
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
474 flds (%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
475 fadds 60(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
476 fstps (%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
477
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
478 flds 4(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
479 fadds 56(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
480 fstps 4(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
481
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
482 flds (%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
483 fsubs 60(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
484 fmuls 64(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
485 fstps 60(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
486
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
487 flds 4(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
488 fsubs 56(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
489 fmuls 68(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
490 fstps 56(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
491
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
492 flds 8(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
493 fadds 52(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
494 fstps 8(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
495
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
496 flds 12(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
497 fadds 48(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
498 fstps 12(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
499
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
500 flds 8(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
501 fsubs 52(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
502 fmuls 72(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
503 fstps 52(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
504
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
505 flds 12(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
506 fsubs 48(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
507 fmuls 76(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
508 fstps 48(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
509
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
510 flds 16(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
511 fadds 44(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
512 fstps 16(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
513
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
514 flds 20(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
515 fadds 40(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
516 fstps 20(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
517
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
518 flds 16(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
519 fsubs 44(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
520 fmuls 80(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
521 fstps 44(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
522
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
523 flds 20(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
524 fsubs 40(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
525 fmuls 84(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
526 fstps 40(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
527
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
528 flds 24(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
529 fadds 36(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
530 fstps 24(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
531
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
532 flds 28(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
533 fadds 32(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
534 fstps 28(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
535
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
536 flds 24(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
537 fsubs 36(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
538 fmuls 88(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
539 fstps 36(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
540
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
541 flds 28(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
542 fsubs 32(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
543 fmuls 92(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
544 fstps 32(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
545 */
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
546 // end phase 2 fpu
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
547
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
548 /* Phase 3 (completed, working) */
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
549
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
550 / movq 64(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
551 / movq 72(%edx), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
552 movups 64(%edx), %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
553
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
554 / movq %mm0, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
555 / movq %mm4, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
556 movaps %xmm0, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
557
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
558 / movq 120(%edx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
559 / movq 112(%edx), %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
560 movups 112(%edx), %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
561
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
562 / pswapd %mm1, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
563 / pswapd %mm5, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
564 //// shufps $177, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
565 shufps $27, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
566
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
567 / pfadd %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
568 / pfadd %mm5, %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
569 addps %xmm1, %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
570
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
571 / movq %mm0, 64(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
572 / movq %mm4, 72(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
573 movups %xmm0, 64(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
574
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
575 / pfsubr %mm1, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
576 / pfsubr %mm5, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
577 // optimized (xmm1<->xmm2)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
578 subps %xmm2, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
579
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
580 / pfmul 64(%ebx), %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
581 / pfmul 72(%ebx), %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
582 movups 64(%ebx), %xmm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
583 mulps %xmm7, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
584
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
585 / pswapd %mm3, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
586 / pswapd %mm7, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
587 shufps $27, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
588
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
589 / movq %mm3, 120(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
590 / movq %mm7, 112(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
591 movups %xmm1, 112(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
592
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
593
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
594 / movq 80(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
595 / movq 88(%edx), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
596 movups 80(%edx), %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
597
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
598 / movq %mm0, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
599 / movq %mm4, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
600 movaps %xmm0, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
601
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
602 / movq 104(%edx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
603 / movq 96(%edx), %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
604 movups 96(%edx), %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
605
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
606 / pswapd %mm1, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
607 / pswapd %mm5, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
608 //// shufps $177, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
609 shufps $27, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
610
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
611 / pfadd %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
612 / pfadd %mm5, %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
613 addps %xmm1, %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
614
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
615 / movq %mm0, 80(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
616 / movq %mm4, 88(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
617 movups %xmm0, 80(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
618
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
619 / pfsubr %mm1, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
620 / pfsubr %mm5, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
621 // optimized (xmm1<->xmm2)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
622 subps %xmm2, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
623
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
624 / pfmul 80(%ebx), %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
625 / pfmul 88(%ebx), %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
626 movups 80(%ebx), %xmm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
627 mulps %xmm7, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
628
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
629 / pswapd %mm3, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
630 / pswapd %mm7, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
631 shufps $27, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
632
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
633 / movq %mm3, 104(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
634 / movq %mm7, 96(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
635 movups %xmm1, 96(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
636
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
637
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
638 // phase 3 fpu
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
639 /* Phase 3*/
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
640 /*
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
641 flds 64(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
642 fadds 124(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
643 fstps 64(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
644
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
645 flds 68(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
646 fadds 120(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
647 fstps 68(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
648
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
649 flds 124(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
650 fsubs 64(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
651 fmuls 64(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
652 fstps 124(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
653
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
654 flds 120(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
655 fsubs 68(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
656 fmuls 68(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
657 fstps 120(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
658
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
659 flds 72(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
660 fadds 116(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
661 fstps 72(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
662
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
663 flds 76(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
664 fadds 112(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
665 fstps 76(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
666
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
667 flds 116(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
668 fsubs 72(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
669 fmuls 72(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
670 fstps 116(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
671
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
672 flds 112(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
673 fsubs 76(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
674 fmuls 76(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
675 fstps 112(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
676
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
677 flds 80(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
678 fadds 108(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
679 fstps 80(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
680
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
681 flds 84(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
682 fadds 104(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
683 fstps 84(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
684
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
685 flds 108(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
686 fsubs 80(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
687 fmuls 80(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
688 fstps 108(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
689
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
690 flds 104(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
691 fsubs 84(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
692 fmuls 84(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
693 fstps 104(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
694
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
695 flds 88(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
696 fadds 100(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
697 fstps 88(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
698
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
699 flds 92(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
700 fadds 96(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
701 fstps 92(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
702
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
703 flds 100(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
704 fsubs 88(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
705 fmuls 88(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
706 fstps 100(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
707
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
708 flds 96(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
709 fsubs 92(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
710 fmuls 92(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
711 fstps 96(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
712 */
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
713 // end phase 3 fpu
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
714
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
715
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
716 /* Phase 4 (completed, buggy) */
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
717 /*
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
718 / movq 96(%ebx), %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
719 / movq 104(%ebx), %mm6
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
720 movups 96(%ebx), %xmm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
721
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
722
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
723 / movq (%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
724 / movq 8(%ecx), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
725 movups (%ecx), %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
726
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
727 / movq %mm0, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
728 / movq %mm4, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
729 movaps %xmm0, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
730
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
731 / movq 24(%ecx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
732 / movq 16(%ecx), %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
733 movups 16(%ecx), %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
734
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
735 / pswapd %mm1, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
736 / pswapd %mm5, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
737 //// shufps $177, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
738 shufps $27, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
739
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
740 / pfadd %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
741 / pfadd %mm5, %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
742 addps %xmm1, %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
743
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
744 / movq %mm0, (%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
745 / movq %mm4, 8(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
746 movups %xmm0, (%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
747
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
748 / pfsub %mm1, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
749 / pfsub %mm5, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
750 subps %xmm1, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
751
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
752 / pfmul %mm2, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
753 / pfmul %mm6, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
754 mulps %xmm4, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
755
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
756 / pswapd %mm3, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
757 / pswapd %mm7, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
758 shufps $27, %xmm2, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
759
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
760 / movq %mm3, 24(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
761 / movq %mm7, 16(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
762 movups %xmm2, 16(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
763
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
764 / movq 32(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
765 / movq 40(%ecx), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
766 movups 32(%ecx), %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
767
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
768 / movq %mm0, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
769 / movq %mm4, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
770 movaps %xmm0, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
771
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
772 / movq 56(%ecx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
773 / movq 48(%ecx), %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
774 movups 48(%ecx), %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
775
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
776 / pswapd %mm1, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
777 / pswapd %mm5, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
778 //// shufps $177, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
779 shufps $27, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
780
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
781 / pfadd %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
782 / pfadd %mm5, %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
783 addps %xmm1, %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
784
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
785 / movq %mm0, 32(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
786 / movq %mm4, 40(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
787 movups %xmm0, 32(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
788
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
789 / pfsubr %mm1, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
790 / pfsubr %mm5, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
791 // Luckily we can swap this (xmm1<->xmm2)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
792 subps %xmm2, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
793
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
794 / pfmul %mm2, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
795 / pfmul %mm6, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
796 mulps %xmm4, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
797
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
798 / pswapd %mm3, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
799 / pswapd %mm7, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
800 shufps $27, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
801
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
802 / movq %mm3, 56(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
803 / movq %mm7, 48(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
804 movups %xmm1, 48(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
805
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
806
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
807 / movq 64(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
808 / movq 72(%ecx), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
809 movups 64(%ecx), %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
810
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
811 / movq %mm0, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
812 / movq %mm4, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
813 movaps %xmm0, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
814
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
815 / movq 88(%ecx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
816 / movq 80(%ecx), %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
817 movups 80(%ecx), %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
818
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
819 / pswapd %mm1, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
820 / pswapd %mm5, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
821 //// shufps $177, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
822 shufps $27, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
823
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
824 / pfadd %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
825 / pfadd %mm5, %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
826 addps %xmm1, %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
827
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
828 / movq %mm0, 64(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
829 / movq %mm4, 72(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
830 movups %xmm0, 64(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
831
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
832 / pfsub %mm1, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
833 / pfsub %mm5, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
834 subps %xmm1, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
835
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
836 / pfmul %mm2, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
837 / pfmul %mm6, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
838 mulps %xmm4, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
839
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
840 / pswapd %mm3, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
841 / pswapd %mm7, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
842 shufps $27, %xmm2, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
843
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
844 / movq %mm3, 88(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
845 / movq %mm7, 80(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
846 movups %xmm2, 80(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
847
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
848
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
849 / movq 96(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
850 / movq 104(%ecx), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
851 movups 96(%ecx), %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
852
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
853 / movq %mm0, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
854 / movq %mm4, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
855 movaps %xmm0, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
856
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
857 / movq 120(%ecx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
858 / movq 112(%ecx), %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
859 movups 112(%ecx), %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
860
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
861 / pswapd %mm1, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
862 / pswapd %mm5, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
863 //// shufps $177, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
864 shufps $27, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
865
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
866 / pfadd %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
867 / pfadd %mm5, %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
868 addps %xmm1, %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
869
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
870 / movq %mm0, 96(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
871 / movq %mm4, 104(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
872 movups %xmm0, 96(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
873
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
874 / pfsubr %mm1, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
875 / pfsubr %mm5, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
876 // This is already optimized, so xmm2 must be swapped with xmm1 for rest of phase
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
877 subps %xmm2, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
878
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
879 / pfmul %mm2, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
880 / pfmul %mm6, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
881 mulps %xmm4, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
882
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
883 / pswapd %mm3, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
884 / pswapd %mm7, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
885 shufps $27, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
886
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
887 / movq %mm3, 120(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
888 / movq %mm7, 112(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
889 movups %xmm1, 112(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
890 */
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
891
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
892 // phase 4 fpu code
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
893 /* Phase 4*/
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
894
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
895 flds (%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
896 fadds 28(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
897 fstps (%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
898
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
899 flds (%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
900 fsubs 28(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
901 fmuls 96(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
902 fstps 28(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
903
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
904 flds 4(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
905 fadds 24(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
906 fstps 4(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
907
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
908 flds 4(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
909 fsubs 24(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
910 fmuls 100(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
911 fstps 24(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
912
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
913 flds 8(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
914 fadds 20(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
915 fstps 8(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
916
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
917 flds 8(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
918 fsubs 20(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
919 fmuls 104(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
920 fstps 20(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
921
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
922 flds 12(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
923 fadds 16(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
924 fstps 12(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
925
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
926 flds 12(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
927 fsubs 16(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
928 fmuls 108(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
929 fstps 16(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
930
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
931 flds 32(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
932 fadds 60(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
933 fstps 32(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
934
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
935 flds 60(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
936 fsubs 32(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
937 fmuls 96(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
938 fstps 60(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
939
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
940 flds 36(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
941 fadds 56(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
942 fstps 36(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
943
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
944 flds 56(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
945 fsubs 36(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
946 fmuls 100(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
947 fstps 56(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
948
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
949 flds 40(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
950 fadds 52(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
951 fstps 40(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
952
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
953 flds 52(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
954 fsubs 40(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
955 fmuls 104(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
956 fstps 52(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
957
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
958 flds 44(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
959 fadds 48(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
960 fstps 44(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
961
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
962 flds 48(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
963 fsubs 44(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
964 fmuls 108(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
965 fstps 48(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
966
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
967 flds 64(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
968 fadds 92(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
969 fstps 64(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
970
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
971 flds 64(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
972 fsubs 92(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
973 fmuls 96(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
974 fstps 92(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
975
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
976 flds 68(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
977 fadds 88(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
978 fstps 68(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
979
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
980 flds 68(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
981 fsubs 88(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
982 fmuls 100(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
983 fstps 88(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
984
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
985 flds 72(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
986 fadds 84(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
987 fstps 72(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
988
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
989 flds 72(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
990 fsubs 84(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
991 fmuls 104(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
992 fstps 84(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
993
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
994 flds 76(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
995 fadds 80(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
996 fstps 76(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
997
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
998 flds 76(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
999 fsubs 80(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1000 fmuls 108(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1001 fstps 80(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1002
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1003 flds 96(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1004 fadds 124(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1005 fstps 96(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1006
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1007 flds 124(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1008 fsubs 96(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1009 fmuls 96(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1010 fstps 124(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1011
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1012 flds 100(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1013 fadds 120(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1014 fstps 100(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1015
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1016 flds 120(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1017 fsubs 100(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1018 fmuls 100(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1019 fstps 120(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1020
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1021 flds 104(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1022 fadds 116(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1023 fstps 104(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1024
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1025 flds 116(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1026 fsubs 104(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1027 fmuls 104(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1028 fstps 116(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1029
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1030 flds 108(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1031 fadds 112(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1032 fstps 108(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1033
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1034 flds 112(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1035 fsubs 108(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1036 fmuls 108(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1037 fstps 112(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1038
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1039 flds (%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1040 fadds 12(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1041 fstps (%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1042
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1043 flds (%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1044 fsubs 12(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1045 fmuls 112(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1046 fstps 12(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1047
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1048 flds 4(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1049 fadds 8(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1050 fstps 4(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1051
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1052 flds 4(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1053 fsubs 8(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1054 fmuls 116(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1055 fstps 8(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1056
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1057 flds 16(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1058 fadds 28(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1059 fstps 16(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1060
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1061 flds 28(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1062 fsubs 16(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1063 fmuls 112(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1064 fstps 28(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1065
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1066 flds 20(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1067 fadds 24(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1068 fstps 20(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1069
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1070 flds 24(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1071 fsubs 20(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1072 fmuls 116(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1073 fstps 24(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1074
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1075 flds 32(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1076 fadds 44(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1077 fstps 32(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1078
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1079 flds 32(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1080 fsubs 44(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1081 fmuls 112(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1082 fstps 44(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1083
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1084 flds 36(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1085 fadds 40(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1086 fstps 36(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1087
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1088 flds 36(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1089 fsubs 40(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1090 fmuls 116(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1091 fstps 40(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1092
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1093 flds 48(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1094 fadds 60(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1095 fstps 48(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1096
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1097 flds 60(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1098 fsubs 48(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1099 fmuls 112(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1100 fstps 60(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1101
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1102 flds 52(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1103 fadds 56(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1104 fstps 52(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1105
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1106 flds 56(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1107 fsubs 52(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1108 fmuls 116(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1109 fstps 56(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1110
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1111 flds 64(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1112 fadds 76(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1113 fstps 64(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1114
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1115 flds 64(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1116 fsubs 76(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1117 fmuls 112(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1118 fstps 76(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1119
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1120 flds 68(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1121 fadds 72(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1122 fstps 68(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1123
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1124 flds 68(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1125 fsubs 72(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1126 fmuls 116(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1127 fstps 72(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1128
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1129 flds 80(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1130 fadds 92(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1131 fstps 80(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1132
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1133 flds 92(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1134 fsubs 80(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1135 fmuls 112(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1136 fstps 92(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1137
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1138 flds 84(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1139 fadds 88(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1140 fstps 84(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1141
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1142 flds 88(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1143 fsubs 84(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1144 fmuls 116(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1145 fstps 88(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1146
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1147 flds 96(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1148 fadds 108(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1149 fstps 96(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1150
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1151 flds 96(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1152 fsubs 108(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1153 fmuls 112(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1154 fstps 108(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1155
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1156 flds 100(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1157 fadds 104(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1158 fstps 100(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1159
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1160 flds 100(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1161 fsubs 104(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1162 fmuls 116(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1163 fstps 104(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1164
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1165 flds 112(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1166 fadds 124(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1167 fstps 112(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1168
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1169 flds 124(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1170 fsubs 112(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1171 fmuls 112(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1172 fstps 124(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1173
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1174 flds 116(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1175 fadds 120(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1176 fstps 116(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1177
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1178 flds 120(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1179 fsubs 116(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1180 fmuls 116(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1181 fstps 120(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1182
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1183 // end of phase 4 fpu
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1184
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1185 // below stuff needs to be finished I use FPU code for first
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1186 /* Phase 5 (completed, crashing) */
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1187 /*
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1188 / movq 112(%ebx), %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1189 // move 8 byte data to (low)high quadword - check this! atmos
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1190 movlps 112(%ebx), %xmm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1191 // maybe I need movhlps too to get data into correct quadword
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1192 movlhps %xmm4, %xmm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1193
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1194 / movq (%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1195 / movq 16(%edx), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1196 movups (%edx), %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1197
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1198 / movq %mm0, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1199 / movq %mm4, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1200 movaps %xmm0, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1201
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1202 // hmm? this is strange
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1203 / movq 8(%edx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1204 / movq 24(%edx), %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1205 movlps 8(%edx), %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1206 movhps 24(%edx), %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1207
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1208 / pswapd %mm1, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1209 / pswapd %mm5, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1210 pshufd $177, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1211
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1212 / pfadd %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1213 / pfadd %mm5, %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1214 addps %xmm1, %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1215
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1216 / movq %mm0, (%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1217 / movq %mm4, 16(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1218 movlps %xmm0, (%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1219 movhps %xmm0, 16(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1220
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1221 / pfsub %mm1, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1222 / pfsubr %mm5, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1223 // I need to emulate pfsubr here
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1224 movaps %xmm1, %xmm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1225 subps %xmm2, %xmm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1226 subps %xmm1, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1227 // now move correct quadword from reverse substration in xmm3 to correct
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1228 // quadword in xmm2 and leave other quadword with non-reversed substration untouched
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1229 /// shufpd $2, %xmm3, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1230 // (or $1?) (see ia32-ref p.749)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1231 // optimize
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1232 movq %xmm2, %xmm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1233 movaps %xmm3, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1234
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1235 / pfmul %mm2, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1236 / pfmul %mm2, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1237 mulps %xmm4, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1238
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1239 / pswapd %mm3, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1240 / pswapd %mm7, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1241 shufps $177, %xmm2, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1242
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1243 / movq %mm3, 8(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1244 / movq %mm7, 24(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1245 movlps %xmm2, 8(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1246 movhps %xmm2, 24(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1247
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1248 / movq 32(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1249 / movq 48(%edx), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1250 movlps 32(%edx), %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1251 movhps 48(%edx), %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1252
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1253 / movq %mm0, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1254 / movq %mm4, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1255 movaps %xmm0, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1256
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1257 / movq 40(%edx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1258 / movq 56(%edx), %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1259 movlps 40(%edx), %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1260 movhps 56(%edx), %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1261
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1262 / pswapd %mm1, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1263 / pswapd %mm5, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1264 shufps $177, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1265
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1266 / pfadd %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1267 / pfadd %mm5, %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1268 addps %xmm1, %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1269
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1270 / movq %mm0, 32(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1271 / movq %mm4, 48(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1272 movlps %xmm0, 32(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1273 movhps %xmm0, 48(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1274
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1275 / pfsub %mm1, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1276 / pfsubr %mm5, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1277 movaps %xmm1, %xmm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1278 subps %xmm2, %xmm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1279 subps %xmm1, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1280 /// shufpd $2, %xmm3, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1281 // (or $1?)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1282 // optimize
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1283 movq %xmm2, %xmm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1284 movaps %xmm3, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1285
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1286 / pfmul %mm2, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1287 / pfmul %mm2, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1288 mulps %xmm4, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1289
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1290 / pswapd %mm3, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1291 / pswapd %mm7, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1292 shufps $177, %xmm2, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1293
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1294 / movq %mm3, 40(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1295 / movq %mm7, 56(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1296 movlps %xmm2, 40(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1297 movhps %xmm2, 56(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1298
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1299
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1300 / movq 64(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1301 / movq 80(%edx), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1302 movlps 64(%edx), %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1303 movhps 80(%edx), %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1304
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1305 / movq %mm0, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1306 / movq %mm4, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1307 movaps %xmm0, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1308
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1309 / movq 72(%edx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1310 / movq 88(%edx), %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1311 movlps 72(%edx), %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1312 movhps 88(%edx), %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1313
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1314 / pswapd %mm1, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1315 / pswapd %mm5, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1316 shufps $177, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1317
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1318 / pfadd %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1319 / pfadd %mm5, %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1320 addps %xmm1, %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1321
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1322 / movq %mm0, 64(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1323 / movq %mm4, 80(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1324 movlps %xmm0, 64(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1325 movhps %xmm0, 80(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1326
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1327 / pfsub %mm1, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1328 / pfsubr %mm5, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1329 movaps %xmm1, %xmm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1330 subps %xmm2, %xmm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1331 subps %xmm1, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1332 /// shufpd $2, %xmm3, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1333 // (or $1?)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1334 // optimize
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1335 movq %xmm2, %xmm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1336 movaps %xmm3, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1337
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1338 / pfmul %mm2, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1339 / pfmul %mm2, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1340 mulps %xmm4, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1341
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1342 / pswapd %mm3, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1343 / pswapd %mm7, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1344 shufps $177, %xmm2, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1345
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1346 / movq %mm3, 72(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1347 / movq %mm7, 88(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1348 movlps %xmm2, 72(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1349 movhps %xmm2, 88(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1350
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1351 / movq 96(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1352 / movq 112(%edx), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1353 movups 96(%edx), %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1354
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1355 / movq %mm0, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1356 / movq %mm4, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1357 movaps %xmm0, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1358
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1359 / movq 104(%edx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1360 / movq 120(%edx), %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1361 movlps 104(%edx), %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1362 movhps 120(%edx), %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1363
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1364 / pswapd %mm1, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1365 / pswapd %mm5, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1366 shufps $177, %xmm1, %xmm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1367
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1368 / pfadd %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1369 / pfadd %mm5, %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1370 addps %xmm1, %xmm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1371
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1372 / movq %mm0, 96(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1373 / movq %mm4, 112(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1374 movups %xmm0, 96(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1375
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1376 / pfsub %mm1, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1377 / pfsubr %mm5, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1378 movaps %xmm1, %xmm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1379 subps %xmm2, %xmm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1380 subps %xmm1, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1381 /// shufpd $2, %xmm3, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1382 // (or $1?)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1383 // optimize
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1384 movq %xmm2, %xmm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1385 movaps %xmm3, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1386
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1387 / pfmul %mm2, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1388 / pfmul %mm2, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1389 mulps %xmm4, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1390
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1391 / pswapd %mm3, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1392 / pswapd %mm7, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1393 shufps $177, %xmm2, %xmm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1394
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1395 / movq %mm3, 104(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1396 / movq %mm7, 120(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1397 movlps %xmm2, 104(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1398 movhps %xmm2, 120(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1399 */
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1400
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1401
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1402 /* Phase 6. This is the end of easy road. */
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1403 /* Code below is coded in scalar mode. Should be optimized */
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1404 //
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1405 // movd plus_1f, %mm6
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1406 // punpckldq 120(%ebx), %mm6 /* mm6 = 1.0 | 120(%ebx)*/
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1407 // movq x_plus_minus_3dnow, %mm7 /* mm7 = +1 | -1 */
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1408 /*
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1409 movq 32(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1410 movq 64(%ecx), %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1411 movq %mm0, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1412 movq %mm2, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1413 pxor %mm7, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1414 pxor %mm7, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1415 pfacc %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1416 pfacc %mm3, %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1417 pfmul %mm6, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1418 pfmul %mm6, %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1419 movq %mm0, 32(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1420 movq %mm2, 64(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1421
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1422 movd 44(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1423 movd 40(%ecx), %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1424 movd 120(%ebx), %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1425 punpckldq 76(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1426 punpckldq 72(%ecx), %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1427 punpckldq %mm3, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1428 movq %mm0, %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1429 movq %mm2, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1430 pfsub %mm2, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1431 pfmul %mm3, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1432 movq %mm0, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1433 pfadd %mm5, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1434 pfadd %mm4, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1435 movq %mm0, %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1436 punpckldq %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1437 punpckhdq %mm1, %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1438 movq %mm0, 40(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1439 movq %mm2, 72(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1440
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1441 movd 48(%ecx), %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1442 movd 60(%ecx), %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1443 pfsub 52(%ecx), %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1444 pfsub 56(%ecx), %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1445 pfmul 120(%ebx), %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1446 pfmul 120(%ebx), %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1447 movq %mm2, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1448
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1449 pfadd 56(%ecx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1450 pfadd 60(%ecx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1451 movq %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1452
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1453 pfadd 48(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1454 pfadd 52(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1455 pfadd %mm3, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1456 punpckldq %mm2, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1457 pfadd %mm3, %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1458 punpckldq %mm2, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1459 movq %mm1, 56(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1460 movq %mm0, 48(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1461 */
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1462 /*---*/
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1463 /*
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1464 movd 92(%ecx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1465 pfsub 88(%ecx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1466 pfmul 120(%ebx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1467 movd %mm1, 92(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1468 pfadd 92(%ecx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1469 pfadd 88(%ecx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1470 movq %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1471
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1472 pfadd 80(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1473 pfadd 84(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1474 movd %mm0, 80(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1475
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1476 movd 80(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1477 pfsub 84(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1478 pfmul 120(%ebx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1479 pfadd %mm0, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1480 pfadd 92(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1481 punpckldq %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1482 movq %mm0, 84(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1483
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1484 movq 96(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1485 movq %mm0, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1486 pxor %mm7, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1487 pfacc %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1488 pfmul %mm6, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1489 movq %mm0, 96(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1490
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1491 movd 108(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1492 pfsub 104(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1493 pfmul 120(%ebx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1494 movd %mm0, 108(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1495 pfadd 104(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1496 pfadd 108(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1497 movd %mm0, 104(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1498
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1499 movd 124(%ecx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1500 pfsub 120(%ecx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1501 pfmul 120(%ebx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1502 movd %mm1, 124(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1503 pfadd 120(%ecx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1504 pfadd 124(%ecx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1505 movq %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1506
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1507 pfadd 112(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1508 pfadd 116(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1509 movd %mm0, 112(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1510
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1511 movd 112(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1512 pfsub 116(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1513 pfmul 120(%ebx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1514 pfadd %mm0,%mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1515 pfadd 124(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1516 punpckldq %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1517 movq %mm0, 116(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1518
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1519 jnz .L01
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1520 */
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1521
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1522
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1523 /* Phase 7*/
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1524 /* Code below is coded in scalar mode. Should be optimized */
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1525 /*
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1526 movd (%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1527 pfadd 4(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1528 movd %mm0, 1024(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1529
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1530 movd (%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1531 pfsub 4(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1532 pfmul 120(%ebx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1533 movd %mm0, (%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1534 movd %mm0, (%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1535
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1536 movd 12(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1537 pfsub 8(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1538 pfmul 120(%ebx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1539 movd %mm0, 512(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1540 pfadd 12(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1541 pfadd 8(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1542 movd %mm0, 512(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1543
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1544 movd 16(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1545 pfsub 20(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1546 pfmul 120(%ebx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1547 movq %mm0, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1548
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1549 movd 28(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1550 pfsub 24(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1551 pfmul 120(%ebx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1552 movd %mm0, 768(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1553 movq %mm0, %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1554
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1555 pfadd 24(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1556 pfadd 28(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1557 movq %mm0, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1558
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1559 pfadd 16(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1560 pfadd 20(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1561 movd %mm0, 768(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1562 pfadd %mm3, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1563 movd %mm1, 256(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1564 pfadd %mm3, %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1565 movd %mm2, 256(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1566 */
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1567
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1568
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1569 /* Phase 8*/
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1570 /*
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1571 movq 32(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1572 movq 48(%edx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1573 pfadd 48(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1574 pfadd 40(%edx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1575 movd %mm0, 896(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1576 movd %mm1, 640(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1577 psrlq $32, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1578 psrlq $32, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1579 movd %mm0, 128(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1580 movd %mm1, 384(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1581
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1582 movd 40(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1583 pfadd 56(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1584 movd %mm0, 384(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1585
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1586 movd 56(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1587 pfadd 36(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1588 movd %mm0, 128(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1589
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1590 movd 60(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1591 movd %mm0, 896(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1592 pfadd 44(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1593 movd %mm0, 640(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1594
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1595 movq 96(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1596 movq 112(%edx), %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1597 movq 104(%edx), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1598 pfadd 112(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1599 pfadd 104(%edx), %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1600 pfadd 120(%edx), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1601 movq %mm0, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1602 movq %mm2, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1603 movq %mm4, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1604 pfadd 64(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1605 pfadd 80(%edx), %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1606 pfadd 72(%edx), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1607 movd %mm0, 960(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1608 movd %mm2, 704(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1609 movd %mm4, 448(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1610 psrlq $32, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1611 psrlq $32, %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1612 psrlq $32, %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1613 movd %mm0, 64(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1614 movd %mm2, 320(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1615 movd %mm4, 576(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1616 pfadd 80(%edx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1617 pfadd 72(%edx), %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1618 pfadd 88(%edx), %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1619 movd %mm1, 832(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1620 movd %mm3, 576(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1621 movd %mm5, 320(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1622 psrlq $32, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1623 psrlq $32, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1624 psrlq $32, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1625 movd %mm1, 192(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1626 movd %mm3, 448(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1627 movd %mm5, 704(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1628
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1629 movd 120(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1630 pfadd 100(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1631 movq %mm0, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1632 pfadd 88(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1633 movd %mm0, 192(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1634 pfadd 68(%edx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1635 movd %mm1, 64(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1636
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1637 movd 124(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1638 movd %mm0, 960(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1639 pfadd 92(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1640 movd %mm0, 832(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1641
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1642 jmp .L_bye
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1643 .L01:
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1644 */
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1645
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1646
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1647 /* Phase 9*/
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1648 /*
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1649 movq (%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1650 movq %mm0, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1651 pxor %mm7, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1652 pfacc %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1653 pfmul %mm6, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1654 pf2id %mm0, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1655 movd %mm0, %eax
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1656 movw %ax, 512(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1657 psrlq $32, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1658 movd %mm0, %eax
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1659 movw %ax, (%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1660
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1661 movd 12(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1662 pfsub 8(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1663 pfmul 120(%ebx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1664 pf2id %mm0, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1665 movd %mm7, %eax
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1666 movw %ax, 256(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1667 pfadd 12(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1668 pfadd 8(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1669 pf2id %mm0, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1670 movd %mm0, %eax
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1671 movw %ax, 256(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1672
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1673 movd 16(%ecx), %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1674 pfsub 20(%ecx), %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1675 pfmul 120(%ebx), %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1676 movq %mm3, %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1677
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1678 movd 28(%ecx), %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1679 pfsub 24(%ecx), %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1680 pfmul 120(%ebx), %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1681 movq %mm2, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1682
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1683 pf2id %mm2, %mm7
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1684 movd %mm7, %eax
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1685 movw %ax, 384(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1686
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1687 pfadd 24(%ecx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1688 pfadd 28(%ecx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1689 movq %mm1, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1690
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1691 pfadd 16(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1692 pfadd 20(%ecx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1693 pf2id %mm0, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1694 movd %mm0, %eax
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1695 movw %ax, 384(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1696 pfadd %mm3, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1697 pf2id %mm1, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1698 movd %mm1, %eax
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1699 movw %ax, 128(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1700 pfadd %mm3, %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1701 pf2id %mm2, %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1702 movd %mm2, %eax
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1703 movw %ax, 128(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1704 */
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1705
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1706
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1707 /* Phase 10*/
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1708 /*
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1709 movq 32(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1710 movq 48(%edx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1711 pfadd 48(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1712 pfadd 40(%edx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1713 pf2id %mm0, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1714 pf2id %mm1, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1715 movd %mm0, %eax
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1716 movd %mm1, %ecx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1717 movw %ax, 448(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1718 movw %cx, 320(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1719 psrlq $32, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1720 psrlq $32, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1721 movd %mm0, %eax
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1722 movd %mm1, %ecx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1723 movw %ax, 64(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1724 movw %cx, 192(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1725
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1726 movd 40(%edx), %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1727 movd 56(%edx), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1728 movd 60(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1729 movd 44(%edx), %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1730 movd 120(%edx), %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1731 punpckldq %mm4, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1732 punpckldq 124(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1733 pfadd 100(%edx), %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1734 punpckldq 36(%edx), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1735 punpckldq 92(%edx), %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1736 movq %mm5, %mm6
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1737 pfadd %mm4, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1738 pf2id %mm0, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1739 pf2id %mm3, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1740 pfadd 88(%edx), %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1741 movd %mm1, %eax
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1742 movd %mm3, %ecx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1743 movw %ax, 448(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1744 movw %cx, 192(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1745 pf2id %mm5, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1746 psrlq $32, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1747 psrlq $32, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1748 movd %mm5, %ebx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1749 movd %mm1, %eax
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1750 movd %mm3, %ecx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1751 movw %bx, 96(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1752 movw %ax, 480(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1753 movw %cx, 64(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1754 pfadd %mm2, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1755 pf2id %mm0, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1756 movd %mm0, %eax
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1757 pfadd 68(%edx), %mm6
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1758 movw %ax, 320(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1759 psrlq $32, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1760 pf2id %mm6, %mm6
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1761 movd %mm0, %eax
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1762 movd %mm6, %ebx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1763 movw %ax, 416(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1764 movw %bx, 32(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1765
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1766 movq 96(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1767 movq 112(%edx), %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1768 movq 104(%edx), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1769 pfadd %mm2, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1770 pfadd %mm4, %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1771 pfadd 120(%edx), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1772 movq %mm0, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1773 movq %mm2, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1774 movq %mm4, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1775 pfadd 64(%edx), %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1776 pfadd 80(%edx), %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1777 pfadd 72(%edx), %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1778 pf2id %mm0, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1779 pf2id %mm2, %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1780 pf2id %mm4, %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1781 movd %mm0, %eax
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1782 movd %mm2, %ecx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1783 movd %mm4, %ebx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1784 movw %ax, 480(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1785 movw %cx, 352(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1786 movw %bx, 224(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1787 psrlq $32, %mm0
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1788 psrlq $32, %mm2
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1789 psrlq $32, %mm4
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1790 movd %mm0, %eax
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1791 movd %mm2, %ecx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1792 movd %mm4, %ebx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1793 movw %ax, 32(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1794 movw %cx, 160(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1795 movw %bx, 288(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1796 pfadd 80(%edx), %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1797 pfadd 72(%edx), %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1798 pfadd 88(%edx), %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1799 pf2id %mm1, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1800 pf2id %mm3, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1801 pf2id %mm5, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1802 movd %mm1, %eax
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1803 movd %mm3, %ecx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1804 movd %mm5, %ebx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1805 movw %ax, 416(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1806 movw %cx, 288(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1807 movw %bx, 160(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1808 psrlq $32, %mm1
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1809 psrlq $32, %mm3
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1810 psrlq $32, %mm5
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1811 movd %mm1, %eax
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1812 movd %mm3, %ecx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1813 movd %mm5, %ebx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1814 movw %ax, 96(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1815 movw %cx, 224(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1816 movw %bx, 352(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1817
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1818 movsw
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1819
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1820 .L_bye:
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1821 addl $256,%esp
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1822 / femms
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1823 emms
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1824 popl %edi
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1825 popl %esi
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1826 popl %ebx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1827 ret $12
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1828 */
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1829
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1830 // here comes old fashioned FPU code for the tough parts
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1831
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1832 /* Phase 5*/
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1833
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1834 flds 32(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1835 fadds 36(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1836 fstps 32(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1837
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1838 flds 32(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1839 fsubs 36(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1840 fmuls 120(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1841 fstps 36(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1842
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1843 flds 44(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1844 fsubs 40(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1845 fmuls 120(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1846 fsts 44(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1847 fadds 40(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1848 fadds 44(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1849 fstps 40(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1850
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1851 flds 48(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1852 fsubs 52(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1853 fmuls 120(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1854
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1855 flds 60(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1856 fsubs 56(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1857 fmuls 120(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1858 fld %st(0)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1859 fadds 56(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1860 fadds 60(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1861 fld %st(0)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1862 fadds 48(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1863 fadds 52(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1864 fstps 48(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1865 fadd %st(2)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1866 fstps 56(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1867 fsts 60(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1868 faddp %st(1)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1869 fstps 52(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1870
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1871 flds 64(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1872 fadds 68(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1873 fstps 64(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1874
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1875 flds 64(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1876 fsubs 68(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1877 fmuls 120(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1878 fstps 68(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1879
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1880 flds 76(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1881 fsubs 72(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1882 fmuls 120(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1883 fsts 76(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1884 fadds 72(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1885 fadds 76(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1886 fstps 72(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1887
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1888 flds 92(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1889 fsubs 88(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1890 fmuls 120(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1891 fsts 92(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1892 fadds 92(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1893 fadds 88(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1894 fld %st(0)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1895 fadds 80(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1896 fadds 84(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1897 fstps 80(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1898
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1899 flds 80(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1900 fsubs 84(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1901 fmuls 120(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1902 fadd %st(0), %st(1)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1903 fadds 92(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1904 fstps 84(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1905 fstps 88(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1906
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1907 flds 96(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1908 fadds 100(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1909 fstps 96(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1910
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1911 flds 96(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1912 fsubs 100(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1913 fmuls 120(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1914 fstps 100(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1915
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1916 flds 108(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1917 fsubs 104(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1918 fmuls 120(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1919 fsts 108(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1920 fadds 104(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1921 fadds 108(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1922 fstps 104(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1923
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1924 flds 124(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1925 fsubs 120(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1926 fmuls 120(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1927 fsts 124(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1928 fadds 120(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1929 fadds 124(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1930 fld %st(0)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1931 fadds 112(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1932 fadds 116(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1933 fstps 112(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1934
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1935 flds 112(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1936 fsubs 116(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1937 fmuls 120(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1938 fadd %st(0),%st(1)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1939 fadds 124(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1940 fstps 116(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1941 fstps 120(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1942 jnz .L01
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1943
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1944
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1945 /* Phase 6*/
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1946
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1947 flds (%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1948 fadds 4(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1949 fstps 1024(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1950
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1951 flds (%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1952 fsubs 4(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1953 fmuls 120(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1954 fsts (%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1955 fstps (%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1956
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1957 flds 12(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1958 fsubs 8(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1959 fmuls 120(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1960 fsts 512(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1961 fadds 12(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1962 fadds 8(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1963 fstps 512(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1964
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1965 flds 16(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1966 fsubs 20(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1967 fmuls 120(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1968
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1969 flds 28(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1970 fsubs 24(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1971 fmuls 120(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1972 fsts 768(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1973 fld %st(0)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1974 fadds 24(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1975 fadds 28(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1976 fld %st(0)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1977 fadds 16(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1978 fadds 20(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1979 fstps 768(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1980 fadd %st(2)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1981 fstps 256(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1982 faddp %st(1)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1983 fstps 256(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1984
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1985 /* Phase 7*/
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1986
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1987 flds 32(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1988 fadds 48(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1989 fstps 896(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1990
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1991 flds 48(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1992 fadds 40(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1993 fstps 640(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1994
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1995 flds 40(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1996 fadds 56(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1997 fstps 384(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1998
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
1999 flds 56(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2000 fadds 36(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2001 fstps 128(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2002
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2003 flds 36(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2004 fadds 52(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2005 fstps 128(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2006
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2007 flds 52(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2008 fadds 44(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2009 fstps 384(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2010
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2011 flds 60(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2012 fsts 896(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2013 fadds 44(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2014 fstps 640(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2015
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2016 flds 96(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2017 fadds 112(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2018 fld %st(0)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2019 fadds 64(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2020 fstps 960(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2021 fadds 80(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2022 fstps 832(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2023
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2024 flds 112(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2025 fadds 104(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2026 fld %st(0)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2027 fadds 80(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2028 fstps 704(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2029 fadds 72(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2030 fstps 576(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2031
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2032 flds 104(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2033 fadds 120(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2034 fld %st(0)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2035 fadds 72(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2036 fstps 448(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2037 fadds 88(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2038 fstps 320(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2039
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2040 flds 120(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2041 fadds 100(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2042 fld %st(0)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2043 fadds 88(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2044 fstps 192(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2045 fadds 68(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2046 fstps 64(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2047
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2048 flds 100(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2049 fadds 116(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2050 fld %st(0)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2051 fadds 68(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2052 fstps 64(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2053 fadds 84(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2054 fstps 192(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2055
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2056 flds 116(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2057 fadds 108(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2058 fld %st(0)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2059 fadds 84(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2060 fstps 320(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2061 fadds 76(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2062 fstps 448(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2063
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2064 flds 108(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2065 fadds 124(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2066 fld %st(0)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2067 fadds 76(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2068 fstps 576(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2069 fadds 92(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2070 fstps 704(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2071
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2072 flds 124(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2073 fsts 960(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2074 fadds 92(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2075 fstps 832(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2076 addl $256,%esp
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2077 popl %edi
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2078 popl %esi
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2079 popl %ebx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2080 ret
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2081 .L01:
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2082 /* Phase 8*/
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2083
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2084 flds (%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2085 fadds 4(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2086 fistp 512(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2087
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2088 flds (%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2089 fsubs 4(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2090 fmuls 120(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2091
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2092 fistp (%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2093
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2094
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2095 flds 12(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2096 fsubs 8(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2097 fmuls 120(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2098 fist 256(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2099 fadds 12(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2100 fadds 8(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2101 fistp 256(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2102
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2103 flds 16(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2104 fsubs 20(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2105 fmuls 120(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2106
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2107 flds 28(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2108 fsubs 24(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2109 fmuls 120(%ebx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2110 fist 384(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2111 fld %st(0)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2112 fadds 24(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2113 fadds 28(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2114 fld %st(0)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2115 fadds 16(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2116 fadds 20(%ecx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2117 fistp 384(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2118 fadd %st(2)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2119 fistp 128(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2120 faddp %st(1)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2121 fistp 128(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2122
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2123 /* Phase 9*/
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2124
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2125 flds 32(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2126 fadds 48(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2127 fistp 448(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2128
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2129 flds 48(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2130 fadds 40(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2131 fistp 320(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2132
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2133 flds 40(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2134 fadds 56(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2135 fistp 192(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2136
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2137 flds 56(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2138 fadds 36(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2139 fistp 64(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2140
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2141 flds 36(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2142 fadds 52(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2143 fistp 64(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2144
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2145 flds 52(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2146 fadds 44(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2147 fistp 192(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2148
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2149 flds 60(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2150 fist 448(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2151 fadds 44(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2152 fistp 320(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2153
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2154 flds 96(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2155 fadds 112(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2156 fld %st(0)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2157 fadds 64(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2158 fistp 480(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2159 fadds 80(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2160 fistp 416(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2161
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2162 flds 112(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2163 fadds 104(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2164 fld %st(0)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2165 fadds 80(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2166 fistp 352(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2167 fadds 72(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2168 fistp 288(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2169
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2170 flds 104(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2171 fadds 120(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2172 fld %st(0)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2173 fadds 72(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2174 fistp 224(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2175 fadds 88(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2176 fistp 160(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2177
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2178 flds 120(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2179 fadds 100(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2180 fld %st(0)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2181 fadds 88(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2182 fistp 96(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2183 fadds 68(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2184 fistp 32(%esi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2185
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2186 flds 100(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2187 fadds 116(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2188 fld %st(0)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2189 fadds 68(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2190 fistp 32(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2191 fadds 84(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2192 fistp 96(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2193
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2194 flds 116(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2195 fadds 108(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2196 fld %st(0)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2197 fadds 84(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2198 fistp 160(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2199 fadds 76(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2200 fistp 224(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2201
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2202 flds 108(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2203 fadds 124(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2204 fld %st(0)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2205 fadds 76(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2206 fistp 288(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2207 fadds 92(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2208 fistp 352(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2209
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2210 flds 124(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2211 fist 480(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2212 fadds 92(%edx)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2213 fistp 416(%edi)
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2214 movsw
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2215 addl $256,%esp
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2216 popl %edi
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2217 popl %esi
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2218 popl %ebx
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2219 ret $12
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2220
d9e3f91d6da9 First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff changeset
2221 // end of FPU stuff