Mercurial > mplayer.hg
annotate mp3lib/dct64_sse.s @ 11025:ee86d710b030
New spanish doc file. Only faq.xml remaining!
author | nauj27 |
---|---|
date | Mon, 06 Oct 2003 11:06:05 +0000 |
parents | 8312f4bc8dab |
children |
rev | line source |
---|---|
3203
8312f4bc8dab
Disable SSE code and reenable FPU dct for SSE cpus (fpu code is 0.3% faster and I don't get data aligned in dct64_sse.s, so I can't finish optimizing it)
atmos4
parents:
1394
diff
changeset
|
1 / This code is a translation of dct64_k7.s from MPlayer. |
8312f4bc8dab
Disable SSE code and reenable FPU dct for SSE cpus (fpu code is 0.3% faster and I don't get data aligned in dct64_sse.s, so I can't finish optimizing it)
atmos4
parents:
1394
diff
changeset
|
2 / Coded by Felix Buenemann <atmosfear at users.sourceforge.net> |
8312f4bc8dab
Disable SSE code and reenable FPU dct for SSE cpus (fpu code is 0.3% faster and I don't get data aligned in dct64_sse.s, so I can't finish optimizing it)
atmos4
parents:
1394
diff
changeset
|
3 / |
8312f4bc8dab
Disable SSE code and reenable FPU dct for SSE cpus (fpu code is 0.3% faster and I don't get data aligned in dct64_sse.s, so I can't finish optimizing it)
atmos4
parents:
1394
diff
changeset
|
4 / TODO: - fix phases 4 and 5 (sse) |
8312f4bc8dab
Disable SSE code and reenable FPU dct for SSE cpus (fpu code is 0.3% faster and I don't get data aligned in dct64_sse.s, so I can't finish optimizing it)
atmos4
parents:
1394
diff
changeset
|
5 / - optimize scalar FPU code? (interleave with sse code) |
8312f4bc8dab
Disable SSE code and reenable FPU dct for SSE cpus (fpu code is 0.3% faster and I don't get data aligned in dct64_sse.s, so I can't finish optimizing it)
atmos4
parents:
1394
diff
changeset
|
6 / - fix alignment (prohibits finishing this code) |
8312f4bc8dab
Disable SSE code and reenable FPU dct for SSE cpus (fpu code is 0.3% faster and I don't get data aligned in dct64_sse.s, so I can't finish optimizing it)
atmos4
parents:
1394
diff
changeset
|
7 / - then use faster insns for aligned data |
8312f4bc8dab
Disable SSE code and reenable FPU dct for SSE cpus (fpu code is 0.3% faster and I don't get data aligned in dct64_sse.s, so I can't finish optimizing it)
atmos4
parents:
1394
diff
changeset
|
8 / |
8312f4bc8dab
Disable SSE code and reenable FPU dct for SSE cpus (fpu code is 0.3% faster and I don't get data aligned in dct64_sse.s, so I can't finish optimizing it)
atmos4
parents:
1394
diff
changeset
|
9 / Note: currently code is disabled as I couldn't get input data aligned! |
8312f4bc8dab
Disable SSE code and reenable FPU dct for SSE cpus (fpu code is 0.3% faster and I don't get data aligned in dct64_sse.s, so I can't finish optimizing it)
atmos4
parents:
1394
diff
changeset
|
10 / |
1394
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
11 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
12 //.data |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
13 // .align 8 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
14 //x_plus_minus_3dnow: .long 0x00000000, 0x80000000 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
15 //plus_1f: .float 1.0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
16 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
17 .text |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
18 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
19 .align 16 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
20 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
21 .global dct64_MMX_sse |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
22 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
23 dct64_MMX_sse: |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
24 pushl %ebx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
25 pushl %esi |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
26 pushl %edi |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
27 subl $256,%esp |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
28 movl 280(%esp),%eax |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
29 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
30 leal 128(%esp),%edx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
31 movl 272(%esp),%esi |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
32 movl 276(%esp),%edi |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
33 movl $costab_mmx,%ebx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
34 orl %ecx,%ecx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
35 movl %esp,%ecx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
36 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
37 /* Phase 1 (complete, worx) */ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
38 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
39 // [1] Process Block A1 (16 Bytes) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
40 / movq (%eax), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
41 / movq 8(%eax), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
42 movups (%eax), %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
43 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
44 // Copy A1 to another register A2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
45 / movq %mm0, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
46 / movq %mm4, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
47 movaps %xmm0, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
48 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
49 // Process Block B1 (last 16 bytes) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
50 / movq 120(%eax), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
51 / movq 112(%eax), %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
52 movups 112(%eax), %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
53 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
54 /* The PSWAPD instruction swaps or reverses the upper and lower |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
55 * doublewords of the source operand. PSWAPD mmreg1, mmreg2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
56 * performs the following operations: |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
57 * temp = mmreg2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
58 * mmreg1[63:32] = temp[31:0 ] |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
59 * mmreg1[31:0 ] = temp[63:32] |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
60 */ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
61 / pswapd %mm1, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
62 / pswapd %mm5, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
63 // shufps here exchanges a,b,c,d to b,a,d,c in xmm1 (desc ia32-ref p.752) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
64 //// shufps $177, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
65 shufps $27, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
66 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
67 // Add B1 to A1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
68 / pfadd %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
69 / pfadd %mm5, %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
70 addps %xmm1, %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
71 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
72 // Save Block A1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
73 / movq %mm0, (%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
74 / movq %mm4, 8(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
75 movups %xmm0, (%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
76 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
77 // Sub B1 from A2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
78 / pfsub %mm1, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
79 / pfsub %mm5, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
80 subps %xmm1, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
81 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
82 // Mul mem with A2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
83 / pfmul (%ebx), %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
84 / pfmul 8(%ebx), %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
85 movups (%ebx), %xmm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
86 mulps %xmm7, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
87 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
88 // Shuffle A2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
89 / pswapd %mm3, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
90 / pswapd %mm7, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
91 // I do a,b,c,d -> d,c,b,a to suit order when writing to mem (saves one shufps) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
92 shufps $27, %xmm2, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
93 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
94 // Save A2 to mem (end) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
95 / movq %mm3, 120(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
96 / movq %mm7, 112(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
97 movups %xmm2, 112(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
98 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
99 // [2] Process next data block |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
100 / movq 16(%eax), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
101 / movq 24(%eax), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
102 movups 16(%eax), %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
103 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
104 / movq %mm0, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
105 / movq %mm4, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
106 movaps %xmm0, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
107 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
108 / movq 104(%eax), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
109 / movq 96(%eax), %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
110 movups 96(%eax), %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
111 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
112 / pswapd %mm1, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
113 / pswapd %mm5, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
114 //// shufps $177, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
115 shufps $27, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
116 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
117 / pfadd %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
118 / pfadd %mm5, %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
119 addps %xmm1, %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
120 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
121 / movq %mm0, 16(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
122 / movq %mm4, 24(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
123 movups %xmm0, 16(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
124 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
125 / pfsub %mm1, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
126 / pfsub %mm5, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
127 subps %xmm1, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
128 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
129 / pfmul 16(%ebx), %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
130 / pfmul 24(%ebx), %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
131 movups 16(%ebx), %xmm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
132 mulps %xmm7, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
133 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
134 / pswapd %mm3, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
135 / pswapd %mm7, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
136 shufps $27, %xmm2, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
137 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
138 / movq %mm3, 104(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
139 / movq %mm7, 96(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
140 movups %xmm2, 96(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
141 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
142 // [3] |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
143 / movq 32(%eax), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
144 / movq 40(%eax), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
145 movups 32(%eax), %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
146 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
147 / movq %mm0, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
148 / movq %mm4, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
149 movaps %xmm0, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
150 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
151 / movq 88(%eax), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
152 / movq 80(%eax), %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
153 movups 80(%eax), %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
154 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
155 / pswapd %mm1, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
156 / pswapd %mm5, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
157 //// shufps $177, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
158 shufps $27, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
159 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
160 / pfadd %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
161 / pfadd %mm5, %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
162 addps %xmm1, %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
163 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
164 / movq %mm0, 32(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
165 / movq %mm4, 40(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
166 movups %xmm0, 32(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
167 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
168 / pfsub %mm1, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
169 / pfsub %mm5, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
170 subps %xmm1, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
171 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
172 / pfmul 32(%ebx), %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
173 / pfmul 40(%ebx), %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
174 movups 32(%ebx), %xmm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
175 mulps %xmm7, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
176 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
177 / pswapd %mm3, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
178 / pswapd %mm7, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
179 shufps $27, %xmm2, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
180 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
181 / movq %mm3, 88(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
182 / movq %mm7, 80(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
183 movups %xmm2, 80(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
184 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
185 // [4] |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
186 / movq 48(%eax), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
187 / movq 56(%eax), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
188 movups 48(%eax), %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
189 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
190 / movq %mm0, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
191 / movq %mm4, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
192 movaps %xmm0, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
193 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
194 / movq 72(%eax), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
195 / movq 64(%eax), %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
196 movups 64(%eax), %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
197 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
198 / pswapd %mm1, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
199 / pswapd %mm5, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
200 //// shufps $177, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
201 shufps $27, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
202 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
203 / pfadd %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
204 / pfadd %mm5, %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
205 addps %xmm1, %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
206 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
207 / movq %mm0, 48(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
208 / movq %mm4, 56(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
209 movups %xmm0, 48(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
210 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
211 / pfsub %mm1, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
212 / pfsub %mm5, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
213 subps %xmm1, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
214 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
215 / pfmul 48(%ebx), %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
216 / pfmul 56(%ebx), %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
217 movups 48(%ebx), %xmm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
218 mulps %xmm7, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
219 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
220 / pswapd %mm3, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
221 / pswapd %mm7, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
222 shufps $27, %xmm2, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
223 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
224 / movq %mm3, 72(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
225 / movq %mm7, 64(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
226 movups %xmm2, 64(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
227 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
228 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
229 // phase 1 fpu code |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
230 /* Phase 1*/ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
231 /* |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
232 flds (%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
233 leal 128(%esp),%edx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
234 fadds 124(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
235 movl 272(%esp),%esi |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
236 fstps (%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
237 movl 276(%esp),%edi |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
238 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
239 flds 4(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
240 movl $costab_mmx,%ebx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
241 fadds 120(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
242 orl %ecx,%ecx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
243 fstps 4(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
244 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
245 flds (%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
246 movl %esp,%ecx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
247 fsubs 124(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
248 fmuls (%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
249 fstps 124(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
250 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
251 flds 4(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
252 fsubs 120(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
253 fmuls 4(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
254 fstps 120(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
255 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
256 flds 8(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
257 fadds 116(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
258 fstps 8(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
259 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
260 flds 12(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
261 fadds 112(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
262 fstps 12(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
263 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
264 flds 8(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
265 fsubs 116(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
266 fmuls 8(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
267 fstps 116(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
268 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
269 flds 12(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
270 fsubs 112(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
271 fmuls 12(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
272 fstps 112(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
273 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
274 flds 16(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
275 fadds 108(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
276 fstps 16(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
277 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
278 flds 20(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
279 fadds 104(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
280 fstps 20(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
281 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
282 flds 16(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
283 fsubs 108(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
284 fmuls 16(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
285 fstps 108(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
286 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
287 flds 20(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
288 fsubs 104(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
289 fmuls 20(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
290 fstps 104(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
291 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
292 flds 24(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
293 fadds 100(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
294 fstps 24(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
295 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
296 flds 28(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
297 fadds 96(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
298 fstps 28(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
299 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
300 flds 24(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
301 fsubs 100(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
302 fmuls 24(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
303 fstps 100(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
304 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
305 flds 28(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
306 fsubs 96(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
307 fmuls 28(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
308 fstps 96(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
309 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
310 flds 32(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
311 fadds 92(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
312 fstps 32(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
313 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
314 flds 36(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
315 fadds 88(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
316 fstps 36(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
317 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
318 flds 32(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
319 fsubs 92(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
320 fmuls 32(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
321 fstps 92(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
322 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
323 flds 36(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
324 fsubs 88(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
325 fmuls 36(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
326 fstps 88(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
327 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
328 flds 40(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
329 fadds 84(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
330 fstps 40(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
331 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
332 flds 44(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
333 fadds 80(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
334 fstps 44(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
335 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
336 flds 40(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
337 fsubs 84(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
338 fmuls 40(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
339 fstps 84(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
340 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
341 flds 44(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
342 fsubs 80(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
343 fmuls 44(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
344 fstps 80(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
345 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
346 flds 48(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
347 fadds 76(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
348 fstps 48(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
349 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
350 flds 52(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
351 fadds 72(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
352 fstps 52(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
353 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
354 flds 48(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
355 fsubs 76(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
356 fmuls 48(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
357 fstps 76(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
358 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
359 flds 52(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
360 fsubs 72(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
361 fmuls 52(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
362 fstps 72(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
363 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
364 flds 56(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
365 fadds 68(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
366 fstps 56(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
367 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
368 flds 60(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
369 fadds 64(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
370 fstps 60(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
371 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
372 flds 56(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
373 fsubs 68(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
374 fmuls 56(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
375 fstps 68(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
376 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
377 flds 60(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
378 fsubs 64(%eax) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
379 fmuls 60(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
380 fstps 64(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
381 */ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
382 // end phase 1 fpu code |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
383 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
384 /* Phase 2 (completed, worx) */ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
385 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
386 / movq (%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
387 / movq 8(%edx), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
388 movups (%edx), %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
389 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
390 / movq %mm0, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
391 / movq %mm4, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
392 movaps %xmm0, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
393 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
394 / movq 56(%edx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
395 / movq 48(%edx), %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
396 movups 48(%edx), %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
397 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
398 / pswapd %mm1, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
399 / pswapd %mm5, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
400 //// shufps $177, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
401 shufps $27, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
402 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
403 / pfadd %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
404 / pfadd %mm5, %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
405 addps %xmm1, %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
406 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
407 / movq %mm0, (%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
408 / movq %mm4, 8(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
409 movups %xmm0, (%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
410 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
411 / pfsub %mm1, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
412 / pfsub %mm5, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
413 subps %xmm1, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
414 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
415 / pfmul 64(%ebx), %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
416 / pfmul 72(%ebx), %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
417 movups 64(%ebx), %xmm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
418 mulps %xmm7, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
419 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
420 / pswapd %mm3, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
421 / pswapd %mm7, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
422 shufps $27, %xmm2, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
423 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
424 / movq %mm3, 56(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
425 / movq %mm7, 48(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
426 movups %xmm2, 48(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
427 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
428 / movq 16(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
429 / movq 24(%edx), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
430 movups 16(%edx), %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
431 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
432 / movq %mm0, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
433 / movq %mm4, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
434 movaps %xmm0, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
435 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
436 / movq 40(%edx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
437 / movq 32(%edx), %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
438 movups 32(%edx), %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
439 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
440 / pswapd %mm1, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
441 / pswapd %mm5, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
442 //// shufps $177, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
443 shufps $27, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
444 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
445 / pfadd %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
446 / pfadd %mm5, %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
447 addps %xmm1, %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
448 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
449 / movq %mm0, 16(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
450 / movq %mm4, 24(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
451 movups %xmm0, 16(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
452 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
453 / pfsub %mm1, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
454 / pfsub %mm5, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
455 subps %xmm1, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
456 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
457 / pfmul 80(%ebx), %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
458 / pfmul 88(%ebx), %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
459 movups 80(%ebx), %xmm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
460 mulps %xmm7, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
461 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
462 / pswapd %mm3, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
463 / pswapd %mm7, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
464 shufps $27, %xmm2, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
465 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
466 / movq %mm3, 40(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
467 / movq %mm7, 32(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
468 movups %xmm2, 32(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
469 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
470 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
471 // phase 2 fpu |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
472 /* Phase 2*/ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
473 /* |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
474 flds (%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
475 fadds 60(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
476 fstps (%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
477 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
478 flds 4(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
479 fadds 56(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
480 fstps 4(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
481 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
482 flds (%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
483 fsubs 60(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
484 fmuls 64(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
485 fstps 60(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
486 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
487 flds 4(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
488 fsubs 56(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
489 fmuls 68(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
490 fstps 56(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
491 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
492 flds 8(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
493 fadds 52(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
494 fstps 8(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
495 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
496 flds 12(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
497 fadds 48(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
498 fstps 12(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
499 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
500 flds 8(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
501 fsubs 52(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
502 fmuls 72(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
503 fstps 52(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
504 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
505 flds 12(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
506 fsubs 48(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
507 fmuls 76(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
508 fstps 48(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
509 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
510 flds 16(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
511 fadds 44(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
512 fstps 16(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
513 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
514 flds 20(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
515 fadds 40(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
516 fstps 20(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
517 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
518 flds 16(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
519 fsubs 44(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
520 fmuls 80(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
521 fstps 44(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
522 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
523 flds 20(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
524 fsubs 40(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
525 fmuls 84(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
526 fstps 40(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
527 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
528 flds 24(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
529 fadds 36(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
530 fstps 24(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
531 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
532 flds 28(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
533 fadds 32(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
534 fstps 28(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
535 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
536 flds 24(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
537 fsubs 36(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
538 fmuls 88(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
539 fstps 36(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
540 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
541 flds 28(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
542 fsubs 32(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
543 fmuls 92(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
544 fstps 32(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
545 */ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
546 // end phase 2 fpu |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
547 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
548 /* Phase 3 (completed, working) */ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
549 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
550 / movq 64(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
551 / movq 72(%edx), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
552 movups 64(%edx), %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
553 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
554 / movq %mm0, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
555 / movq %mm4, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
556 movaps %xmm0, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
557 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
558 / movq 120(%edx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
559 / movq 112(%edx), %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
560 movups 112(%edx), %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
561 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
562 / pswapd %mm1, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
563 / pswapd %mm5, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
564 //// shufps $177, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
565 shufps $27, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
566 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
567 / pfadd %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
568 / pfadd %mm5, %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
569 addps %xmm1, %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
570 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
571 / movq %mm0, 64(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
572 / movq %mm4, 72(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
573 movups %xmm0, 64(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
574 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
575 / pfsubr %mm1, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
576 / pfsubr %mm5, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
577 // optimized (xmm1<->xmm2) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
578 subps %xmm2, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
579 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
580 / pfmul 64(%ebx), %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
581 / pfmul 72(%ebx), %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
582 movups 64(%ebx), %xmm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
583 mulps %xmm7, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
584 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
585 / pswapd %mm3, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
586 / pswapd %mm7, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
587 shufps $27, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
588 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
589 / movq %mm3, 120(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
590 / movq %mm7, 112(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
591 movups %xmm1, 112(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
592 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
593 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
594 / movq 80(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
595 / movq 88(%edx), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
596 movups 80(%edx), %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
597 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
598 / movq %mm0, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
599 / movq %mm4, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
600 movaps %xmm0, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
601 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
602 / movq 104(%edx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
603 / movq 96(%edx), %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
604 movups 96(%edx), %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
605 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
606 / pswapd %mm1, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
607 / pswapd %mm5, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
608 //// shufps $177, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
609 shufps $27, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
610 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
611 / pfadd %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
612 / pfadd %mm5, %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
613 addps %xmm1, %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
614 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
615 / movq %mm0, 80(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
616 / movq %mm4, 88(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
617 movups %xmm0, 80(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
618 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
619 / pfsubr %mm1, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
620 / pfsubr %mm5, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
621 // optimized (xmm1<->xmm2) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
622 subps %xmm2, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
623 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
624 / pfmul 80(%ebx), %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
625 / pfmul 88(%ebx), %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
626 movups 80(%ebx), %xmm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
627 mulps %xmm7, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
628 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
629 / pswapd %mm3, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
630 / pswapd %mm7, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
631 shufps $27, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
632 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
633 / movq %mm3, 104(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
634 / movq %mm7, 96(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
635 movups %xmm1, 96(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
636 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
637 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
638 // phase 3 fpu |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
639 /* Phase 3*/ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
640 /* |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
641 flds 64(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
642 fadds 124(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
643 fstps 64(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
644 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
645 flds 68(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
646 fadds 120(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
647 fstps 68(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
648 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
649 flds 124(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
650 fsubs 64(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
651 fmuls 64(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
652 fstps 124(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
653 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
654 flds 120(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
655 fsubs 68(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
656 fmuls 68(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
657 fstps 120(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
658 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
659 flds 72(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
660 fadds 116(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
661 fstps 72(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
662 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
663 flds 76(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
664 fadds 112(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
665 fstps 76(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
666 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
667 flds 116(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
668 fsubs 72(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
669 fmuls 72(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
670 fstps 116(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
671 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
672 flds 112(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
673 fsubs 76(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
674 fmuls 76(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
675 fstps 112(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
676 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
677 flds 80(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
678 fadds 108(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
679 fstps 80(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
680 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
681 flds 84(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
682 fadds 104(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
683 fstps 84(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
684 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
685 flds 108(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
686 fsubs 80(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
687 fmuls 80(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
688 fstps 108(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
689 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
690 flds 104(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
691 fsubs 84(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
692 fmuls 84(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
693 fstps 104(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
694 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
695 flds 88(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
696 fadds 100(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
697 fstps 88(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
698 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
699 flds 92(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
700 fadds 96(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
701 fstps 92(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
702 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
703 flds 100(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
704 fsubs 88(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
705 fmuls 88(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
706 fstps 100(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
707 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
708 flds 96(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
709 fsubs 92(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
710 fmuls 92(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
711 fstps 96(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
712 */ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
713 // end phase 3 fpu |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
714 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
715 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
716 /* Phase 4 (completed, buggy) */ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
717 /* |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
718 / movq 96(%ebx), %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
719 / movq 104(%ebx), %mm6 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
720 movups 96(%ebx), %xmm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
721 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
722 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
723 / movq (%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
724 / movq 8(%ecx), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
725 movups (%ecx), %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
726 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
727 / movq %mm0, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
728 / movq %mm4, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
729 movaps %xmm0, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
730 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
731 / movq 24(%ecx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
732 / movq 16(%ecx), %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
733 movups 16(%ecx), %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
734 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
735 / pswapd %mm1, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
736 / pswapd %mm5, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
737 //// shufps $177, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
738 shufps $27, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
739 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
740 / pfadd %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
741 / pfadd %mm5, %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
742 addps %xmm1, %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
743 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
744 / movq %mm0, (%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
745 / movq %mm4, 8(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
746 movups %xmm0, (%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
747 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
748 / pfsub %mm1, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
749 / pfsub %mm5, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
750 subps %xmm1, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
751 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
752 / pfmul %mm2, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
753 / pfmul %mm6, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
754 mulps %xmm4, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
755 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
756 / pswapd %mm3, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
757 / pswapd %mm7, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
758 shufps $27, %xmm2, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
759 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
760 / movq %mm3, 24(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
761 / movq %mm7, 16(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
762 movups %xmm2, 16(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
763 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
764 / movq 32(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
765 / movq 40(%ecx), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
766 movups 32(%ecx), %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
767 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
768 / movq %mm0, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
769 / movq %mm4, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
770 movaps %xmm0, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
771 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
772 / movq 56(%ecx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
773 / movq 48(%ecx), %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
774 movups 48(%ecx), %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
775 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
776 / pswapd %mm1, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
777 / pswapd %mm5, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
778 //// shufps $177, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
779 shufps $27, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
780 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
781 / pfadd %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
782 / pfadd %mm5, %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
783 addps %xmm1, %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
784 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
785 / movq %mm0, 32(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
786 / movq %mm4, 40(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
787 movups %xmm0, 32(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
788 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
789 / pfsubr %mm1, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
790 / pfsubr %mm5, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
791 // Luckily we can swap this (xmm1<->xmm2) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
792 subps %xmm2, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
793 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
794 / pfmul %mm2, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
795 / pfmul %mm6, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
796 mulps %xmm4, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
797 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
798 / pswapd %mm3, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
799 / pswapd %mm7, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
800 shufps $27, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
801 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
802 / movq %mm3, 56(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
803 / movq %mm7, 48(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
804 movups %xmm1, 48(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
805 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
806 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
807 / movq 64(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
808 / movq 72(%ecx), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
809 movups 64(%ecx), %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
810 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
811 / movq %mm0, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
812 / movq %mm4, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
813 movaps %xmm0, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
814 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
815 / movq 88(%ecx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
816 / movq 80(%ecx), %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
817 movups 80(%ecx), %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
818 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
819 / pswapd %mm1, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
820 / pswapd %mm5, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
821 //// shufps $177, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
822 shufps $27, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
823 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
824 / pfadd %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
825 / pfadd %mm5, %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
826 addps %xmm1, %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
827 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
828 / movq %mm0, 64(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
829 / movq %mm4, 72(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
830 movups %xmm0, 64(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
831 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
832 / pfsub %mm1, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
833 / pfsub %mm5, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
834 subps %xmm1, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
835 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
836 / pfmul %mm2, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
837 / pfmul %mm6, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
838 mulps %xmm4, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
839 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
840 / pswapd %mm3, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
841 / pswapd %mm7, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
842 shufps $27, %xmm2, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
843 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
844 / movq %mm3, 88(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
845 / movq %mm7, 80(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
846 movups %xmm2, 80(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
847 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
848 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
849 / movq 96(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
850 / movq 104(%ecx), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
851 movups 96(%ecx), %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
852 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
853 / movq %mm0, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
854 / movq %mm4, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
855 movaps %xmm0, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
856 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
857 / movq 120(%ecx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
858 / movq 112(%ecx), %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
859 movups 112(%ecx), %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
860 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
861 / pswapd %mm1, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
862 / pswapd %mm5, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
863 //// shufps $177, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
864 shufps $27, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
865 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
866 / pfadd %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
867 / pfadd %mm5, %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
868 addps %xmm1, %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
869 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
870 / movq %mm0, 96(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
871 / movq %mm4, 104(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
872 movups %xmm0, 96(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
873 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
874 / pfsubr %mm1, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
875 / pfsubr %mm5, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
876 // This is already optimized, so xmm2 must be swapped with xmm1 for rest of phase |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
877 subps %xmm2, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
878 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
879 / pfmul %mm2, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
880 / pfmul %mm6, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
881 mulps %xmm4, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
882 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
883 / pswapd %mm3, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
884 / pswapd %mm7, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
885 shufps $27, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
886 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
887 / movq %mm3, 120(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
888 / movq %mm7, 112(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
889 movups %xmm1, 112(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
890 */ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
891 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
892 // phase 4 fpu code |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
893 /* Phase 4*/ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
894 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
895 flds (%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
896 fadds 28(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
897 fstps (%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
898 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
899 flds (%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
900 fsubs 28(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
901 fmuls 96(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
902 fstps 28(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
903 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
904 flds 4(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
905 fadds 24(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
906 fstps 4(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
907 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
908 flds 4(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
909 fsubs 24(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
910 fmuls 100(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
911 fstps 24(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
912 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
913 flds 8(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
914 fadds 20(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
915 fstps 8(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
916 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
917 flds 8(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
918 fsubs 20(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
919 fmuls 104(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
920 fstps 20(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
921 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
922 flds 12(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
923 fadds 16(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
924 fstps 12(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
925 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
926 flds 12(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
927 fsubs 16(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
928 fmuls 108(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
929 fstps 16(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
930 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
931 flds 32(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
932 fadds 60(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
933 fstps 32(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
934 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
935 flds 60(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
936 fsubs 32(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
937 fmuls 96(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
938 fstps 60(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
939 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
940 flds 36(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
941 fadds 56(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
942 fstps 36(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
943 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
944 flds 56(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
945 fsubs 36(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
946 fmuls 100(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
947 fstps 56(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
948 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
949 flds 40(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
950 fadds 52(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
951 fstps 40(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
952 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
953 flds 52(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
954 fsubs 40(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
955 fmuls 104(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
956 fstps 52(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
957 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
958 flds 44(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
959 fadds 48(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
960 fstps 44(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
961 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
962 flds 48(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
963 fsubs 44(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
964 fmuls 108(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
965 fstps 48(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
966 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
967 flds 64(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
968 fadds 92(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
969 fstps 64(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
970 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
971 flds 64(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
972 fsubs 92(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
973 fmuls 96(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
974 fstps 92(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
975 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
976 flds 68(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
977 fadds 88(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
978 fstps 68(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
979 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
980 flds 68(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
981 fsubs 88(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
982 fmuls 100(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
983 fstps 88(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
984 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
985 flds 72(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
986 fadds 84(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
987 fstps 72(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
988 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
989 flds 72(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
990 fsubs 84(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
991 fmuls 104(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
992 fstps 84(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
993 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
994 flds 76(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
995 fadds 80(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
996 fstps 76(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
997 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
998 flds 76(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
999 fsubs 80(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1000 fmuls 108(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1001 fstps 80(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1002 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1003 flds 96(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1004 fadds 124(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1005 fstps 96(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1006 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1007 flds 124(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1008 fsubs 96(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1009 fmuls 96(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1010 fstps 124(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1011 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1012 flds 100(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1013 fadds 120(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1014 fstps 100(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1015 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1016 flds 120(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1017 fsubs 100(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1018 fmuls 100(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1019 fstps 120(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1020 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1021 flds 104(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1022 fadds 116(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1023 fstps 104(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1024 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1025 flds 116(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1026 fsubs 104(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1027 fmuls 104(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1028 fstps 116(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1029 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1030 flds 108(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1031 fadds 112(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1032 fstps 108(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1033 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1034 flds 112(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1035 fsubs 108(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1036 fmuls 108(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1037 fstps 112(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1038 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1039 flds (%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1040 fadds 12(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1041 fstps (%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1042 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1043 flds (%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1044 fsubs 12(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1045 fmuls 112(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1046 fstps 12(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1047 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1048 flds 4(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1049 fadds 8(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1050 fstps 4(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1051 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1052 flds 4(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1053 fsubs 8(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1054 fmuls 116(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1055 fstps 8(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1056 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1057 flds 16(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1058 fadds 28(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1059 fstps 16(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1060 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1061 flds 28(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1062 fsubs 16(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1063 fmuls 112(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1064 fstps 28(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1065 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1066 flds 20(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1067 fadds 24(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1068 fstps 20(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1069 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1070 flds 24(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1071 fsubs 20(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1072 fmuls 116(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1073 fstps 24(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1074 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1075 flds 32(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1076 fadds 44(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1077 fstps 32(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1078 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1079 flds 32(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1080 fsubs 44(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1081 fmuls 112(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1082 fstps 44(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1083 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1084 flds 36(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1085 fadds 40(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1086 fstps 36(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1087 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1088 flds 36(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1089 fsubs 40(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1090 fmuls 116(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1091 fstps 40(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1092 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1093 flds 48(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1094 fadds 60(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1095 fstps 48(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1096 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1097 flds 60(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1098 fsubs 48(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1099 fmuls 112(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1100 fstps 60(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1101 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1102 flds 52(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1103 fadds 56(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1104 fstps 52(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1105 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1106 flds 56(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1107 fsubs 52(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1108 fmuls 116(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1109 fstps 56(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1110 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1111 flds 64(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1112 fadds 76(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1113 fstps 64(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1114 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1115 flds 64(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1116 fsubs 76(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1117 fmuls 112(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1118 fstps 76(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1119 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1120 flds 68(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1121 fadds 72(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1122 fstps 68(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1123 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1124 flds 68(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1125 fsubs 72(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1126 fmuls 116(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1127 fstps 72(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1128 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1129 flds 80(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1130 fadds 92(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1131 fstps 80(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1132 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1133 flds 92(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1134 fsubs 80(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1135 fmuls 112(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1136 fstps 92(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1137 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1138 flds 84(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1139 fadds 88(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1140 fstps 84(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1141 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1142 flds 88(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1143 fsubs 84(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1144 fmuls 116(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1145 fstps 88(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1146 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1147 flds 96(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1148 fadds 108(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1149 fstps 96(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1150 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1151 flds 96(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1152 fsubs 108(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1153 fmuls 112(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1154 fstps 108(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1155 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1156 flds 100(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1157 fadds 104(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1158 fstps 100(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1159 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1160 flds 100(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1161 fsubs 104(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1162 fmuls 116(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1163 fstps 104(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1164 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1165 flds 112(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1166 fadds 124(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1167 fstps 112(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1168 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1169 flds 124(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1170 fsubs 112(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1171 fmuls 112(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1172 fstps 124(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1173 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1174 flds 116(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1175 fadds 120(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1176 fstps 116(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1177 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1178 flds 120(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1179 fsubs 116(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1180 fmuls 116(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1181 fstps 120(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1182 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1183 // end of phase 4 fpu |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1184 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1185 // below stuff needs to be finished I use FPU code for first |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1186 /* Phase 5 (completed, crashing) */ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1187 /* |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1188 / movq 112(%ebx), %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1189 // move 8 byte data to (low)high quadword - check this! atmos |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1190 movlps 112(%ebx), %xmm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1191 // maybe I need movhlps too to get data into correct quadword |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1192 movlhps %xmm4, %xmm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1193 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1194 / movq (%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1195 / movq 16(%edx), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1196 movups (%edx), %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1197 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1198 / movq %mm0, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1199 / movq %mm4, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1200 movaps %xmm0, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1201 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1202 // hmm? this is strange |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1203 / movq 8(%edx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1204 / movq 24(%edx), %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1205 movlps 8(%edx), %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1206 movhps 24(%edx), %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1207 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1208 / pswapd %mm1, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1209 / pswapd %mm5, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1210 pshufd $177, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1211 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1212 / pfadd %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1213 / pfadd %mm5, %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1214 addps %xmm1, %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1215 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1216 / movq %mm0, (%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1217 / movq %mm4, 16(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1218 movlps %xmm0, (%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1219 movhps %xmm0, 16(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1220 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1221 / pfsub %mm1, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1222 / pfsubr %mm5, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1223 // I need to emulate pfsubr here |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1224 movaps %xmm1, %xmm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1225 subps %xmm2, %xmm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1226 subps %xmm1, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1227 // now move correct quadword from reverse substration in xmm3 to correct |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1228 // quadword in xmm2 and leave other quadword with non-reversed substration untouched |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1229 /// shufpd $2, %xmm3, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1230 // (or $1?) (see ia32-ref p.749) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1231 // optimize |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1232 movq %xmm2, %xmm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1233 movaps %xmm3, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1234 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1235 / pfmul %mm2, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1236 / pfmul %mm2, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1237 mulps %xmm4, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1238 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1239 / pswapd %mm3, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1240 / pswapd %mm7, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1241 shufps $177, %xmm2, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1242 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1243 / movq %mm3, 8(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1244 / movq %mm7, 24(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1245 movlps %xmm2, 8(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1246 movhps %xmm2, 24(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1247 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1248 / movq 32(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1249 / movq 48(%edx), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1250 movlps 32(%edx), %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1251 movhps 48(%edx), %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1252 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1253 / movq %mm0, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1254 / movq %mm4, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1255 movaps %xmm0, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1256 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1257 / movq 40(%edx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1258 / movq 56(%edx), %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1259 movlps 40(%edx), %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1260 movhps 56(%edx), %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1261 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1262 / pswapd %mm1, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1263 / pswapd %mm5, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1264 shufps $177, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1265 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1266 / pfadd %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1267 / pfadd %mm5, %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1268 addps %xmm1, %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1269 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1270 / movq %mm0, 32(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1271 / movq %mm4, 48(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1272 movlps %xmm0, 32(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1273 movhps %xmm0, 48(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1274 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1275 / pfsub %mm1, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1276 / pfsubr %mm5, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1277 movaps %xmm1, %xmm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1278 subps %xmm2, %xmm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1279 subps %xmm1, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1280 /// shufpd $2, %xmm3, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1281 // (or $1?) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1282 // optimize |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1283 movq %xmm2, %xmm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1284 movaps %xmm3, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1285 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1286 / pfmul %mm2, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1287 / pfmul %mm2, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1288 mulps %xmm4, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1289 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1290 / pswapd %mm3, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1291 / pswapd %mm7, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1292 shufps $177, %xmm2, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1293 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1294 / movq %mm3, 40(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1295 / movq %mm7, 56(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1296 movlps %xmm2, 40(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1297 movhps %xmm2, 56(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1298 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1299 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1300 / movq 64(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1301 / movq 80(%edx), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1302 movlps 64(%edx), %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1303 movhps 80(%edx), %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1304 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1305 / movq %mm0, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1306 / movq %mm4, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1307 movaps %xmm0, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1308 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1309 / movq 72(%edx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1310 / movq 88(%edx), %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1311 movlps 72(%edx), %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1312 movhps 88(%edx), %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1313 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1314 / pswapd %mm1, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1315 / pswapd %mm5, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1316 shufps $177, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1317 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1318 / pfadd %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1319 / pfadd %mm5, %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1320 addps %xmm1, %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1321 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1322 / movq %mm0, 64(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1323 / movq %mm4, 80(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1324 movlps %xmm0, 64(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1325 movhps %xmm0, 80(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1326 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1327 / pfsub %mm1, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1328 / pfsubr %mm5, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1329 movaps %xmm1, %xmm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1330 subps %xmm2, %xmm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1331 subps %xmm1, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1332 /// shufpd $2, %xmm3, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1333 // (or $1?) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1334 // optimize |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1335 movq %xmm2, %xmm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1336 movaps %xmm3, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1337 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1338 / pfmul %mm2, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1339 / pfmul %mm2, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1340 mulps %xmm4, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1341 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1342 / pswapd %mm3, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1343 / pswapd %mm7, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1344 shufps $177, %xmm2, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1345 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1346 / movq %mm3, 72(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1347 / movq %mm7, 88(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1348 movlps %xmm2, 72(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1349 movhps %xmm2, 88(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1350 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1351 / movq 96(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1352 / movq 112(%edx), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1353 movups 96(%edx), %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1354 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1355 / movq %mm0, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1356 / movq %mm4, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1357 movaps %xmm0, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1358 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1359 / movq 104(%edx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1360 / movq 120(%edx), %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1361 movlps 104(%edx), %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1362 movhps 120(%edx), %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1363 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1364 / pswapd %mm1, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1365 / pswapd %mm5, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1366 shufps $177, %xmm1, %xmm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1367 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1368 / pfadd %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1369 / pfadd %mm5, %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1370 addps %xmm1, %xmm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1371 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1372 / movq %mm0, 96(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1373 / movq %mm4, 112(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1374 movups %xmm0, 96(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1375 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1376 / pfsub %mm1, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1377 / pfsubr %mm5, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1378 movaps %xmm1, %xmm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1379 subps %xmm2, %xmm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1380 subps %xmm1, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1381 /// shufpd $2, %xmm3, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1382 // (or $1?) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1383 // optimize |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1384 movq %xmm2, %xmm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1385 movaps %xmm3, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1386 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1387 / pfmul %mm2, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1388 / pfmul %mm2, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1389 mulps %xmm4, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1390 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1391 / pswapd %mm3, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1392 / pswapd %mm7, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1393 shufps $177, %xmm2, %xmm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1394 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1395 / movq %mm3, 104(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1396 / movq %mm7, 120(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1397 movlps %xmm2, 104(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1398 movhps %xmm2, 120(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1399 */ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1400 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1401 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1402 /* Phase 6. This is the end of easy road. */ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1403 /* Code below is coded in scalar mode. Should be optimized */ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1404 // |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1405 // movd plus_1f, %mm6 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1406 // punpckldq 120(%ebx), %mm6 /* mm6 = 1.0 | 120(%ebx)*/ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1407 // movq x_plus_minus_3dnow, %mm7 /* mm7 = +1 | -1 */ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1408 /* |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1409 movq 32(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1410 movq 64(%ecx), %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1411 movq %mm0, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1412 movq %mm2, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1413 pxor %mm7, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1414 pxor %mm7, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1415 pfacc %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1416 pfacc %mm3, %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1417 pfmul %mm6, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1418 pfmul %mm6, %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1419 movq %mm0, 32(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1420 movq %mm2, 64(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1421 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1422 movd 44(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1423 movd 40(%ecx), %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1424 movd 120(%ebx), %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1425 punpckldq 76(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1426 punpckldq 72(%ecx), %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1427 punpckldq %mm3, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1428 movq %mm0, %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1429 movq %mm2, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1430 pfsub %mm2, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1431 pfmul %mm3, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1432 movq %mm0, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1433 pfadd %mm5, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1434 pfadd %mm4, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1435 movq %mm0, %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1436 punpckldq %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1437 punpckhdq %mm1, %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1438 movq %mm0, 40(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1439 movq %mm2, 72(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1440 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1441 movd 48(%ecx), %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1442 movd 60(%ecx), %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1443 pfsub 52(%ecx), %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1444 pfsub 56(%ecx), %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1445 pfmul 120(%ebx), %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1446 pfmul 120(%ebx), %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1447 movq %mm2, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1448 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1449 pfadd 56(%ecx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1450 pfadd 60(%ecx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1451 movq %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1452 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1453 pfadd 48(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1454 pfadd 52(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1455 pfadd %mm3, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1456 punpckldq %mm2, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1457 pfadd %mm3, %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1458 punpckldq %mm2, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1459 movq %mm1, 56(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1460 movq %mm0, 48(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1461 */ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1462 /*---*/ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1463 /* |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1464 movd 92(%ecx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1465 pfsub 88(%ecx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1466 pfmul 120(%ebx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1467 movd %mm1, 92(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1468 pfadd 92(%ecx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1469 pfadd 88(%ecx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1470 movq %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1471 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1472 pfadd 80(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1473 pfadd 84(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1474 movd %mm0, 80(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1475 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1476 movd 80(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1477 pfsub 84(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1478 pfmul 120(%ebx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1479 pfadd %mm0, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1480 pfadd 92(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1481 punpckldq %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1482 movq %mm0, 84(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1483 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1484 movq 96(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1485 movq %mm0, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1486 pxor %mm7, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1487 pfacc %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1488 pfmul %mm6, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1489 movq %mm0, 96(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1490 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1491 movd 108(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1492 pfsub 104(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1493 pfmul 120(%ebx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1494 movd %mm0, 108(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1495 pfadd 104(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1496 pfadd 108(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1497 movd %mm0, 104(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1498 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1499 movd 124(%ecx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1500 pfsub 120(%ecx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1501 pfmul 120(%ebx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1502 movd %mm1, 124(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1503 pfadd 120(%ecx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1504 pfadd 124(%ecx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1505 movq %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1506 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1507 pfadd 112(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1508 pfadd 116(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1509 movd %mm0, 112(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1510 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1511 movd 112(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1512 pfsub 116(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1513 pfmul 120(%ebx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1514 pfadd %mm0,%mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1515 pfadd 124(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1516 punpckldq %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1517 movq %mm0, 116(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1518 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1519 jnz .L01 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1520 */ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1521 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1522 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1523 /* Phase 7*/ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1524 /* Code below is coded in scalar mode. Should be optimized */ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1525 /* |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1526 movd (%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1527 pfadd 4(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1528 movd %mm0, 1024(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1529 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1530 movd (%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1531 pfsub 4(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1532 pfmul 120(%ebx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1533 movd %mm0, (%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1534 movd %mm0, (%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1535 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1536 movd 12(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1537 pfsub 8(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1538 pfmul 120(%ebx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1539 movd %mm0, 512(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1540 pfadd 12(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1541 pfadd 8(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1542 movd %mm0, 512(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1543 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1544 movd 16(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1545 pfsub 20(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1546 pfmul 120(%ebx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1547 movq %mm0, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1548 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1549 movd 28(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1550 pfsub 24(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1551 pfmul 120(%ebx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1552 movd %mm0, 768(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1553 movq %mm0, %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1554 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1555 pfadd 24(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1556 pfadd 28(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1557 movq %mm0, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1558 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1559 pfadd 16(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1560 pfadd 20(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1561 movd %mm0, 768(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1562 pfadd %mm3, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1563 movd %mm1, 256(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1564 pfadd %mm3, %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1565 movd %mm2, 256(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1566 */ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1567 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1568 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1569 /* Phase 8*/ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1570 /* |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1571 movq 32(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1572 movq 48(%edx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1573 pfadd 48(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1574 pfadd 40(%edx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1575 movd %mm0, 896(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1576 movd %mm1, 640(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1577 psrlq $32, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1578 psrlq $32, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1579 movd %mm0, 128(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1580 movd %mm1, 384(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1581 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1582 movd 40(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1583 pfadd 56(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1584 movd %mm0, 384(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1585 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1586 movd 56(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1587 pfadd 36(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1588 movd %mm0, 128(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1589 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1590 movd 60(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1591 movd %mm0, 896(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1592 pfadd 44(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1593 movd %mm0, 640(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1594 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1595 movq 96(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1596 movq 112(%edx), %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1597 movq 104(%edx), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1598 pfadd 112(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1599 pfadd 104(%edx), %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1600 pfadd 120(%edx), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1601 movq %mm0, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1602 movq %mm2, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1603 movq %mm4, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1604 pfadd 64(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1605 pfadd 80(%edx), %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1606 pfadd 72(%edx), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1607 movd %mm0, 960(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1608 movd %mm2, 704(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1609 movd %mm4, 448(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1610 psrlq $32, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1611 psrlq $32, %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1612 psrlq $32, %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1613 movd %mm0, 64(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1614 movd %mm2, 320(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1615 movd %mm4, 576(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1616 pfadd 80(%edx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1617 pfadd 72(%edx), %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1618 pfadd 88(%edx), %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1619 movd %mm1, 832(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1620 movd %mm3, 576(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1621 movd %mm5, 320(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1622 psrlq $32, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1623 psrlq $32, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1624 psrlq $32, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1625 movd %mm1, 192(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1626 movd %mm3, 448(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1627 movd %mm5, 704(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1628 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1629 movd 120(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1630 pfadd 100(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1631 movq %mm0, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1632 pfadd 88(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1633 movd %mm0, 192(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1634 pfadd 68(%edx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1635 movd %mm1, 64(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1636 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1637 movd 124(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1638 movd %mm0, 960(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1639 pfadd 92(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1640 movd %mm0, 832(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1641 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1642 jmp .L_bye |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1643 .L01: |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1644 */ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1645 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1646 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1647 /* Phase 9*/ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1648 /* |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1649 movq (%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1650 movq %mm0, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1651 pxor %mm7, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1652 pfacc %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1653 pfmul %mm6, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1654 pf2id %mm0, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1655 movd %mm0, %eax |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1656 movw %ax, 512(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1657 psrlq $32, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1658 movd %mm0, %eax |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1659 movw %ax, (%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1660 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1661 movd 12(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1662 pfsub 8(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1663 pfmul 120(%ebx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1664 pf2id %mm0, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1665 movd %mm7, %eax |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1666 movw %ax, 256(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1667 pfadd 12(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1668 pfadd 8(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1669 pf2id %mm0, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1670 movd %mm0, %eax |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1671 movw %ax, 256(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1672 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1673 movd 16(%ecx), %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1674 pfsub 20(%ecx), %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1675 pfmul 120(%ebx), %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1676 movq %mm3, %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1677 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1678 movd 28(%ecx), %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1679 pfsub 24(%ecx), %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1680 pfmul 120(%ebx), %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1681 movq %mm2, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1682 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1683 pf2id %mm2, %mm7 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1684 movd %mm7, %eax |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1685 movw %ax, 384(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1686 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1687 pfadd 24(%ecx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1688 pfadd 28(%ecx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1689 movq %mm1, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1690 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1691 pfadd 16(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1692 pfadd 20(%ecx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1693 pf2id %mm0, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1694 movd %mm0, %eax |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1695 movw %ax, 384(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1696 pfadd %mm3, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1697 pf2id %mm1, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1698 movd %mm1, %eax |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1699 movw %ax, 128(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1700 pfadd %mm3, %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1701 pf2id %mm2, %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1702 movd %mm2, %eax |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1703 movw %ax, 128(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1704 */ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1705 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1706 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1707 /* Phase 10*/ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1708 /* |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1709 movq 32(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1710 movq 48(%edx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1711 pfadd 48(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1712 pfadd 40(%edx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1713 pf2id %mm0, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1714 pf2id %mm1, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1715 movd %mm0, %eax |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1716 movd %mm1, %ecx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1717 movw %ax, 448(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1718 movw %cx, 320(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1719 psrlq $32, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1720 psrlq $32, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1721 movd %mm0, %eax |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1722 movd %mm1, %ecx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1723 movw %ax, 64(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1724 movw %cx, 192(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1725 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1726 movd 40(%edx), %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1727 movd 56(%edx), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1728 movd 60(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1729 movd 44(%edx), %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1730 movd 120(%edx), %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1731 punpckldq %mm4, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1732 punpckldq 124(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1733 pfadd 100(%edx), %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1734 punpckldq 36(%edx), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1735 punpckldq 92(%edx), %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1736 movq %mm5, %mm6 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1737 pfadd %mm4, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1738 pf2id %mm0, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1739 pf2id %mm3, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1740 pfadd 88(%edx), %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1741 movd %mm1, %eax |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1742 movd %mm3, %ecx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1743 movw %ax, 448(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1744 movw %cx, 192(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1745 pf2id %mm5, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1746 psrlq $32, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1747 psrlq $32, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1748 movd %mm5, %ebx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1749 movd %mm1, %eax |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1750 movd %mm3, %ecx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1751 movw %bx, 96(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1752 movw %ax, 480(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1753 movw %cx, 64(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1754 pfadd %mm2, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1755 pf2id %mm0, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1756 movd %mm0, %eax |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1757 pfadd 68(%edx), %mm6 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1758 movw %ax, 320(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1759 psrlq $32, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1760 pf2id %mm6, %mm6 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1761 movd %mm0, %eax |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1762 movd %mm6, %ebx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1763 movw %ax, 416(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1764 movw %bx, 32(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1765 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1766 movq 96(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1767 movq 112(%edx), %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1768 movq 104(%edx), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1769 pfadd %mm2, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1770 pfadd %mm4, %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1771 pfadd 120(%edx), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1772 movq %mm0, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1773 movq %mm2, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1774 movq %mm4, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1775 pfadd 64(%edx), %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1776 pfadd 80(%edx), %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1777 pfadd 72(%edx), %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1778 pf2id %mm0, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1779 pf2id %mm2, %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1780 pf2id %mm4, %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1781 movd %mm0, %eax |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1782 movd %mm2, %ecx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1783 movd %mm4, %ebx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1784 movw %ax, 480(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1785 movw %cx, 352(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1786 movw %bx, 224(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1787 psrlq $32, %mm0 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1788 psrlq $32, %mm2 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1789 psrlq $32, %mm4 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1790 movd %mm0, %eax |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1791 movd %mm2, %ecx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1792 movd %mm4, %ebx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1793 movw %ax, 32(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1794 movw %cx, 160(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1795 movw %bx, 288(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1796 pfadd 80(%edx), %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1797 pfadd 72(%edx), %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1798 pfadd 88(%edx), %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1799 pf2id %mm1, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1800 pf2id %mm3, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1801 pf2id %mm5, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1802 movd %mm1, %eax |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1803 movd %mm3, %ecx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1804 movd %mm5, %ebx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1805 movw %ax, 416(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1806 movw %cx, 288(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1807 movw %bx, 160(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1808 psrlq $32, %mm1 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1809 psrlq $32, %mm3 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1810 psrlq $32, %mm5 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1811 movd %mm1, %eax |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1812 movd %mm3, %ecx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1813 movd %mm5, %ebx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1814 movw %ax, 96(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1815 movw %cx, 224(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1816 movw %bx, 352(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1817 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1818 movsw |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1819 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1820 .L_bye: |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1821 addl $256,%esp |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1822 / femms |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1823 emms |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1824 popl %edi |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1825 popl %esi |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1826 popl %ebx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1827 ret $12 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1828 */ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1829 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1830 // here comes old fashioned FPU code for the tough parts |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1831 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1832 /* Phase 5*/ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1833 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1834 flds 32(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1835 fadds 36(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1836 fstps 32(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1837 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1838 flds 32(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1839 fsubs 36(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1840 fmuls 120(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1841 fstps 36(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1842 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1843 flds 44(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1844 fsubs 40(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1845 fmuls 120(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1846 fsts 44(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1847 fadds 40(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1848 fadds 44(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1849 fstps 40(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1850 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1851 flds 48(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1852 fsubs 52(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1853 fmuls 120(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1854 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1855 flds 60(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1856 fsubs 56(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1857 fmuls 120(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1858 fld %st(0) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1859 fadds 56(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1860 fadds 60(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1861 fld %st(0) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1862 fadds 48(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1863 fadds 52(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1864 fstps 48(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1865 fadd %st(2) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1866 fstps 56(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1867 fsts 60(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1868 faddp %st(1) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1869 fstps 52(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1870 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1871 flds 64(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1872 fadds 68(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1873 fstps 64(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1874 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1875 flds 64(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1876 fsubs 68(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1877 fmuls 120(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1878 fstps 68(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1879 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1880 flds 76(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1881 fsubs 72(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1882 fmuls 120(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1883 fsts 76(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1884 fadds 72(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1885 fadds 76(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1886 fstps 72(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1887 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1888 flds 92(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1889 fsubs 88(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1890 fmuls 120(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1891 fsts 92(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1892 fadds 92(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1893 fadds 88(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1894 fld %st(0) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1895 fadds 80(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1896 fadds 84(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1897 fstps 80(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1898 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1899 flds 80(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1900 fsubs 84(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1901 fmuls 120(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1902 fadd %st(0), %st(1) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1903 fadds 92(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1904 fstps 84(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1905 fstps 88(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1906 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1907 flds 96(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1908 fadds 100(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1909 fstps 96(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1910 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1911 flds 96(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1912 fsubs 100(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1913 fmuls 120(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1914 fstps 100(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1915 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1916 flds 108(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1917 fsubs 104(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1918 fmuls 120(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1919 fsts 108(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1920 fadds 104(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1921 fadds 108(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1922 fstps 104(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1923 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1924 flds 124(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1925 fsubs 120(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1926 fmuls 120(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1927 fsts 124(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1928 fadds 120(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1929 fadds 124(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1930 fld %st(0) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1931 fadds 112(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1932 fadds 116(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1933 fstps 112(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1934 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1935 flds 112(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1936 fsubs 116(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1937 fmuls 120(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1938 fadd %st(0),%st(1) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1939 fadds 124(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1940 fstps 116(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1941 fstps 120(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1942 jnz .L01 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1943 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1944 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1945 /* Phase 6*/ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1946 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1947 flds (%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1948 fadds 4(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1949 fstps 1024(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1950 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1951 flds (%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1952 fsubs 4(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1953 fmuls 120(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1954 fsts (%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1955 fstps (%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1956 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1957 flds 12(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1958 fsubs 8(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1959 fmuls 120(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1960 fsts 512(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1961 fadds 12(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1962 fadds 8(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1963 fstps 512(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1964 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1965 flds 16(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1966 fsubs 20(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1967 fmuls 120(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1968 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1969 flds 28(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1970 fsubs 24(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1971 fmuls 120(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1972 fsts 768(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1973 fld %st(0) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1974 fadds 24(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1975 fadds 28(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1976 fld %st(0) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1977 fadds 16(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1978 fadds 20(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1979 fstps 768(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1980 fadd %st(2) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1981 fstps 256(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1982 faddp %st(1) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1983 fstps 256(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1984 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1985 /* Phase 7*/ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1986 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1987 flds 32(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1988 fadds 48(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1989 fstps 896(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1990 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1991 flds 48(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1992 fadds 40(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1993 fstps 640(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1994 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1995 flds 40(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1996 fadds 56(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1997 fstps 384(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1998 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
1999 flds 56(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2000 fadds 36(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2001 fstps 128(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2002 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2003 flds 36(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2004 fadds 52(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2005 fstps 128(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2006 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2007 flds 52(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2008 fadds 44(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2009 fstps 384(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2010 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2011 flds 60(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2012 fsts 896(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2013 fadds 44(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2014 fstps 640(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2015 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2016 flds 96(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2017 fadds 112(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2018 fld %st(0) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2019 fadds 64(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2020 fstps 960(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2021 fadds 80(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2022 fstps 832(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2023 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2024 flds 112(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2025 fadds 104(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2026 fld %st(0) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2027 fadds 80(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2028 fstps 704(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2029 fadds 72(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2030 fstps 576(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2031 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2032 flds 104(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2033 fadds 120(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2034 fld %st(0) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2035 fadds 72(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2036 fstps 448(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2037 fadds 88(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2038 fstps 320(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2039 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2040 flds 120(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2041 fadds 100(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2042 fld %st(0) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2043 fadds 88(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2044 fstps 192(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2045 fadds 68(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2046 fstps 64(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2047 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2048 flds 100(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2049 fadds 116(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2050 fld %st(0) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2051 fadds 68(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2052 fstps 64(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2053 fadds 84(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2054 fstps 192(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2055 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2056 flds 116(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2057 fadds 108(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2058 fld %st(0) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2059 fadds 84(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2060 fstps 320(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2061 fadds 76(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2062 fstps 448(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2063 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2064 flds 108(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2065 fadds 124(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2066 fld %st(0) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2067 fadds 76(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2068 fstps 576(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2069 fadds 92(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2070 fstps 704(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2071 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2072 flds 124(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2073 fsts 960(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2074 fadds 92(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2075 fstps 832(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2076 addl $256,%esp |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2077 popl %edi |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2078 popl %esi |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2079 popl %ebx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2080 ret |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2081 .L01: |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2082 /* Phase 8*/ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2083 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2084 flds (%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2085 fadds 4(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2086 fistp 512(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2087 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2088 flds (%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2089 fsubs 4(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2090 fmuls 120(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2091 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2092 fistp (%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2093 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2094 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2095 flds 12(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2096 fsubs 8(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2097 fmuls 120(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2098 fist 256(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2099 fadds 12(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2100 fadds 8(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2101 fistp 256(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2102 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2103 flds 16(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2104 fsubs 20(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2105 fmuls 120(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2106 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2107 flds 28(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2108 fsubs 24(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2109 fmuls 120(%ebx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2110 fist 384(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2111 fld %st(0) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2112 fadds 24(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2113 fadds 28(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2114 fld %st(0) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2115 fadds 16(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2116 fadds 20(%ecx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2117 fistp 384(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2118 fadd %st(2) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2119 fistp 128(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2120 faddp %st(1) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2121 fistp 128(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2122 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2123 /* Phase 9*/ |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2124 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2125 flds 32(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2126 fadds 48(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2127 fistp 448(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2128 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2129 flds 48(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2130 fadds 40(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2131 fistp 320(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2132 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2133 flds 40(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2134 fadds 56(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2135 fistp 192(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2136 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2137 flds 56(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2138 fadds 36(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2139 fistp 64(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2140 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2141 flds 36(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2142 fadds 52(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2143 fistp 64(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2144 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2145 flds 52(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2146 fadds 44(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2147 fistp 192(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2148 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2149 flds 60(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2150 fist 448(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2151 fadds 44(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2152 fistp 320(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2153 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2154 flds 96(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2155 fadds 112(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2156 fld %st(0) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2157 fadds 64(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2158 fistp 480(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2159 fadds 80(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2160 fistp 416(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2161 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2162 flds 112(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2163 fadds 104(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2164 fld %st(0) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2165 fadds 80(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2166 fistp 352(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2167 fadds 72(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2168 fistp 288(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2169 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2170 flds 104(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2171 fadds 120(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2172 fld %st(0) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2173 fadds 72(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2174 fistp 224(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2175 fadds 88(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2176 fistp 160(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2177 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2178 flds 120(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2179 fadds 100(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2180 fld %st(0) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2181 fadds 88(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2182 fistp 96(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2183 fadds 68(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2184 fistp 32(%esi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2185 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2186 flds 100(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2187 fadds 116(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2188 fld %st(0) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2189 fadds 68(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2190 fistp 32(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2191 fadds 84(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2192 fistp 96(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2193 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2194 flds 116(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2195 fadds 108(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2196 fld %st(0) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2197 fadds 84(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2198 fistp 160(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2199 fadds 76(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2200 fistp 224(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2201 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2202 flds 108(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2203 fadds 124(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2204 fld %st(0) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2205 fadds 76(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2206 fistp 288(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2207 fadds 92(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2208 fistp 352(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2209 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2210 flds 124(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2211 fist 480(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2212 fadds 92(%edx) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2213 fistp 416(%edi) |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2214 movsw |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2215 addl $256,%esp |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2216 popl %edi |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2217 popl %esi |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2218 popl %ebx |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2219 ret $12 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2220 |
d9e3f91d6da9
First development version of dct64, mixed with 3dnow/k7 and fpu code.
atmos4
parents:
diff
changeset
|
2221 // end of FPU stuff |