annotate DOCS/tech/colorspaces.txt @ 18880:7f57e6af511e

cosmetics: move md5sum check down so it can depend on libavutil later
author ivo
date Sun, 02 Jul 2006 19:50:52 +0000
parents 786628f8db88
children f01023c524c3
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
5586
6ce9c6231bdd updated
arpi
parents: 5322
diff changeset
1 In general
6ce9c6231bdd updated
arpi
parents: 5322
diff changeset
2 ==========
6ce9c6231bdd updated
arpi
parents: 5322
diff changeset
3
6ce9c6231bdd updated
arpi
parents: 5322
diff changeset
4 There are planar and packed modes.
15033
178020a16dae grammar/spelling
diego
parents: 13992
diff changeset
5 - Planar mode means: You have 3 separate images, one for each component,
5735
cebe55b15cd4 some typos
alex
parents: 5587
diff changeset
6 each image 8 bits/pixel. To get the real colored pixel, you have to
5586
6ce9c6231bdd updated
arpi
parents: 5322
diff changeset
7 mix the components from all planes. The resolution of planes may differ!
6ce9c6231bdd updated
arpi
parents: 5322
diff changeset
8 - Packed mode means: you have all components mixed/interleaved together,
6ce9c6231bdd updated
arpi
parents: 5322
diff changeset
9 so you have small "packs" of components in a single, big image.
5312
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
10
5586
6ce9c6231bdd updated
arpi
parents: 5322
diff changeset
11 There are RGB and YUV colorspaces.
15033
178020a16dae grammar/spelling
diego
parents: 13992
diff changeset
12 - RGB: Red, Green and Blue components. Used by analog VGA monitors.
5586
6ce9c6231bdd updated
arpi
parents: 5322
diff changeset
13 - YUV: Luminance (Y) and Chrominance (U,V) components. Used by some
15033
178020a16dae grammar/spelling
diego
parents: 13992
diff changeset
14 video systems, like PAL. Also most M(J)PEG/DCT based codecs use this.
5312
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
15
5586
6ce9c6231bdd updated
arpi
parents: 5322
diff changeset
16 With YUV, they used to reduce the resolution of U,V planes:
6ce9c6231bdd updated
arpi
parents: 5322
diff changeset
17 The most common YUV formats:
15033
178020a16dae grammar/spelling
diego
parents: 13992
diff changeset
18 FOURCC: bpp: IEEE: plane sizes: (w=width h=height of original image)
7833
arpi
parents: 5735
diff changeset
19 444P 24 YUV 4:4:4 Y: w * h U,V: w * h
arpi
parents: 5735
diff changeset
20 YUY2,UYVY 16 YUV 4:2:2 Y: w * h U,V: (w/2) * h [MJPEG]
15033
178020a16dae grammar/spelling
diego
parents: 13992
diff changeset
21 YV12,I420 12 YUV 4:2:0 Y: w * h U,V: (w/2) * (h/2) [MPEG, H.263]
7833
arpi
parents: 5735
diff changeset
22 411P 12 YUV 4:1:1 Y: w * h U,V: (w/4) * h [DV-NTSC, CYUV]
arpi
parents: 5735
diff changeset
23 YVU9,IF09 9 YUV 4:1:0 Y: w * h U,V: (w/4) * (h/4) [Sorenson, Indeo]
5586
6ce9c6231bdd updated
arpi
parents: 5322
diff changeset
24
8814
9e4ef4ee2a06 YUV a:b:c explained
arpi
parents: 7833
diff changeset
25 The YUV a:b:c naming style means: for <a> samples of Y there are <b> samples
9e4ef4ee2a06 YUV a:b:c explained
arpi
parents: 7833
diff changeset
26 of UV in odd lines and <c> samples of UV in even lines.
9e4ef4ee2a06 YUV a:b:c explained
arpi
parents: 7833
diff changeset
27
5586
6ce9c6231bdd updated
arpi
parents: 5322
diff changeset
28 conversion: (some cut'n'paste from www and maillist)
5312
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
29
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
30 RGB to YUV Conversion:
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
31 Y = (0.257 * R) + (0.504 * G) + (0.098 * B) + 16
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
32 Cr = V = (0.439 * R) - (0.368 * G) - (0.071 * B) + 128
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
33 Cb = U = -(0.148 * R) - (0.291 * G) + (0.439 * B) + 128
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
34 YUV to RGB Conversion:
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
35 B = 1.164(Y - 16) + 2.018(U - 128)
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
36 G = 1.164(Y - 16) - 0.813(V - 128) - 0.391(U - 128)
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
37 R = 1.164(Y - 16) + 1.596(V - 128)
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
38
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
39 In both these cases, you have to clamp the output values to keep them in
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
40 the [0-255] range. Rumour has it that the valid range is actually a subset
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
41 of [0-255] (I've seen an RGB range of [16-235] mentioned) but clamping the
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
42 values into [0-255] seems to produce acceptable results to me.
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
43
15033
178020a16dae grammar/spelling
diego
parents: 13992
diff changeset
44 Julien (sorry, I can't recall his surname) suggests that there are
178020a16dae grammar/spelling
diego
parents: 13992
diff changeset
45 problems with the above formula and proposes the following instead:
5735
cebe55b15cd4 some typos
alex
parents: 5587
diff changeset
46 Y = 0.299R + 0.587G + 0.114B
5312
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
47 Cb = U'= (B-Y)*0.565
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
48 Cr = V'= (R-Y)*0.713
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
49 with reciprocal versions:
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
50 R = Y + 1.403V'
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
51 G = Y - 0.344U' - 0.714V'
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
52 B = Y + 1.770U'
15033
178020a16dae grammar/spelling
diego
parents: 13992
diff changeset
53 Note: This formula doesn't contain the +128 offsets of U,V values!
5312
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
54
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
55 Conclusion:
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
56 Y = luminance, the weighted average of R G B components. (0=black 255=white)
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
57 U = Cb = blue component (0=green 128=grey 255=blue)
5314
ba5c92e64c5d fixes, extended by mplayer's planes[]
arpi
parents: 5312
diff changeset
58 V = Cr = red component (0=green 128=grey 255=red)
5312
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
59
5586
6ce9c6231bdd updated
arpi
parents: 5322
diff changeset
60
6ce9c6231bdd updated
arpi
parents: 5322
diff changeset
61 Huh. The planar YUV modes.
6ce9c6231bdd updated
arpi
parents: 5322
diff changeset
62 ==========================
6ce9c6231bdd updated
arpi
parents: 5322
diff changeset
63
5735
cebe55b15cd4 some typos
alex
parents: 5587
diff changeset
64 The most misunderstood thingie...
5586
6ce9c6231bdd updated
arpi
parents: 5322
diff changeset
65
5312
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
66 In MPlayer, we usually have 3 pointers to the Y, U and V planes, so it
15033
178020a16dae grammar/spelling
diego
parents: 13992
diff changeset
67 doesn't matter what the order of the planes in the memory is:
5314
ba5c92e64c5d fixes, extended by mplayer's planes[]
arpi
parents: 5312
diff changeset
68 for mp_image_t and libvo's draw_slice():
ba5c92e64c5d fixes, extended by mplayer's planes[]
arpi
parents: 5312
diff changeset
69 planes[0] = Y = luminance
ba5c92e64c5d fixes, extended by mplayer's planes[]
arpi
parents: 5312
diff changeset
70 planes[1] = U = Cb = blue
ba5c92e64c5d fixes, extended by mplayer's planes[]
arpi
parents: 5312
diff changeset
71 planes[2] = V = Cr = red
15033
178020a16dae grammar/spelling
diego
parents: 13992
diff changeset
72 Note: planes[1] is ALWAYS U, and planes[2] is V, the FOURCC
178020a16dae grammar/spelling
diego
parents: 13992
diff changeset
73 (YV12 vs. I420) doesn't matter here! So, every codec using 3 pointers
178020a16dae grammar/spelling
diego
parents: 13992
diff changeset
74 (not only the first one) normally supports YV12 and I420 (=IYUV), too!
5314
ba5c92e64c5d fixes, extended by mplayer's planes[]
arpi
parents: 5312
diff changeset
75
15033
178020a16dae grammar/spelling
diego
parents: 13992
diff changeset
76 But there are some codecs (VfW, dshow) and vo drivers (xv) ignoring the 2nd
178020a16dae grammar/spelling
diego
parents: 13992
diff changeset
77 and 3rd pointer that use only a single pointer to the planar YUV image. In
5312
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
78 this case we must know the right order and alignment of planes in the memory!
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
79
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
80 from the webartz fourcc list:
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
81 YV12: 12 bpp, full sized Y plane followed by 2x2 subsampled V and U planes
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
82 I420: 12 bpp, full sized Y plane followed by 2x2 subsampled U and V planes
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
83 IYUV: the same as I420
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
84 YVU9: 9 bpp, full sized Y plane followed by 4x4 subsampled V and U planes
211c0f1ec065 soem explanation
arpi
parents:
diff changeset
85
5587
441c98167877 RGBvsBGR added
arpi
parents: 5586
diff changeset
86 Huh 2. RGB vs. BGR ?
441c98167877 RGBvsBGR added
arpi
parents: 5586
diff changeset
87 ====================
441c98167877 RGBvsBGR added
arpi
parents: 5586
diff changeset
88
5735
cebe55b15cd4 some typos
alex
parents: 5587
diff changeset
89 The 2nd most misunderstood thingie...
5587
441c98167877 RGBvsBGR added
arpi
parents: 5586
diff changeset
90
441c98167877 RGBvsBGR added
arpi
parents: 5586
diff changeset
91 You know, there are Intel and Motorola, and they use different byteorder.
15033
178020a16dae grammar/spelling
diego
parents: 13992
diff changeset
92 There are also others, like MIPS or Alpha, but all follow either Intel
5587
441c98167877 RGBvsBGR added
arpi
parents: 5586
diff changeset
93 or Motorola byteorder.
5735
cebe55b15cd4 some typos
alex
parents: 5587
diff changeset
94 Unfortunately, the packed colorspaces depend on CPU byteorder. So, RGB
5587
441c98167877 RGBvsBGR added
arpi
parents: 5586
diff changeset
95 on Intel and Motorola means different order of bytes.
441c98167877 RGBvsBGR added
arpi
parents: 5586
diff changeset
96
441c98167877 RGBvsBGR added
arpi
parents: 5586
diff changeset
97 In MPlayer, we have constants IMGFMT_RGBxx and IMGFMT_BGRxx.
5735
cebe55b15cd4 some typos
alex
parents: 5587
diff changeset
98 Unfortunately, some codecs and vo drivers follow Intel, some follow Motorola
5587
441c98167877 RGBvsBGR added
arpi
parents: 5586
diff changeset
99 byteorder, so they are incompatible. We had to find a stable base, so long
5735
cebe55b15cd4 some typos
alex
parents: 5587
diff changeset
100 time ago I've chosen OpenGL, as it's a wide-spreaded standard, and it well
cebe55b15cd4 some typos
alex
parents: 5587
diff changeset
101 defines what RGB is and what BGR is. So, MPlayer's RGB is compatible with
cebe55b15cd4 some typos
alex
parents: 5587
diff changeset
102 OpenGL's GL_RGB on all platforms, and the same goes for BGR - GL_BGR.
15033
178020a16dae grammar/spelling
diego
parents: 13992
diff changeset
103 Unfortunately, most of the x86 codecs call our BGR RGB, so it sometimes
178020a16dae grammar/spelling
diego
parents: 13992
diff changeset
104 confuses developers.
5587
441c98167877 RGBvsBGR added
arpi
parents: 5586
diff changeset
105
12997
aa59d6df0280 better? RGB/BGR spec
michael
parents: 12159
diff changeset
106 memory order: name
13032
2ba5eb44eabd maybe more understandable?
michael
parents: 12997
diff changeset
107 lowest address .. highest address
12997
aa59d6df0280 better? RGB/BGR spec
michael
parents: 12159
diff changeset
108 RGBA IMGFMT_RGBA
aa59d6df0280 better? RGB/BGR spec
michael
parents: 12159
diff changeset
109 ARGB IMGFMT_ARGB
aa59d6df0280 better? RGB/BGR spec
michael
parents: 12159
diff changeset
110 BGRA IMGFMT_BGRA
aa59d6df0280 better? RGB/BGR spec
michael
parents: 12159
diff changeset
111 ABGR IMGFMT_ABGR
aa59d6df0280 better? RGB/BGR spec
michael
parents: 12159
diff changeset
112 RGB IMGFMT_RGB24
aa59d6df0280 better? RGB/BGR spec
michael
parents: 12159
diff changeset
113 BGR IMGFMT_BGR24
aa59d6df0280 better? RGB/BGR spec
michael
parents: 12159
diff changeset
114
aa59d6df0280 better? RGB/BGR spec
michael
parents: 12159
diff changeset
115 order in an int name
13535
0a02fb78c3e1 Typo, pointed out by lu_zero
rtognimp
parents: 13032
diff changeset
116 most significant .. least significant bit
12997
aa59d6df0280 better? RGB/BGR spec
michael
parents: 12159
diff changeset
117 8A8R8G8B IMGFMT_BGR32
aa59d6df0280 better? RGB/BGR spec
michael
parents: 12159
diff changeset
118 8A8B8G8R IMGFMT_RGB32
aa59d6df0280 better? RGB/BGR spec
michael
parents: 12159
diff changeset
119 5R6G5B IMGFMT_BGR16
aa59d6df0280 better? RGB/BGR spec
michael
parents: 12159
diff changeset
120 5B6G5R IMGFMT_RGB16
aa59d6df0280 better? RGB/BGR spec
michael
parents: 12159
diff changeset
121 1A5R5G5B IMGFMT_BGR15
aa59d6df0280 better? RGB/BGR spec
michael
parents: 12159
diff changeset
122 1A5B5G5R IMGFMT_RGB15
13992
702acff80c44 4 and 8 bit formats work with a palette.
reimar
parents: 13535
diff changeset
123
702acff80c44 4 and 8 bit formats work with a palette.
reimar
parents: 13535
diff changeset
124 The following are palettized formats, the palette is in the second plane.
702acff80c44 4 and 8 bit formats work with a palette.
reimar
parents: 13535
diff changeset
125 When they come out of the swscaler (this can be different when they
15033
178020a16dae grammar/spelling
diego
parents: 13992
diff changeset
126 come from a codec!), their palette is organized in such a way
13992
702acff80c44 4 and 8 bit formats work with a palette.
reimar
parents: 13535
diff changeset
127 that you actually get:
702acff80c44 4 and 8 bit formats work with a palette.
reimar
parents: 13535
diff changeset
128
12997
aa59d6df0280 better? RGB/BGR spec
michael
parents: 12159
diff changeset
129 3R3G2B IMGFMT_BGR8
aa59d6df0280 better? RGB/BGR spec
michael
parents: 12159
diff changeset
130 2B3G3R IMGFMT_RGB8
aa59d6df0280 better? RGB/BGR spec
michael
parents: 12159
diff changeset
131 1R2G1B IMGFMT_BGR4_CHAR
aa59d6df0280 better? RGB/BGR spec
michael
parents: 12159
diff changeset
132 1B2G1R IMGFMT_RGB4_CHAR
aa59d6df0280 better? RGB/BGR spec
michael
parents: 12159
diff changeset
133 1R2G1B1R2G1B IMGFMT_BGR4
aa59d6df0280 better? RGB/BGR spec
michael
parents: 12159
diff changeset
134 1B2G1R1B2G1R IMGFMT_RGB4
aa59d6df0280 better? RGB/BGR spec
michael
parents: 12159
diff changeset
135
15033
178020a16dae grammar/spelling
diego
parents: 13992
diff changeset
136 Depending upon the CPU being little- or big-endian, different 'in memory' and
13032
2ba5eb44eabd maybe more understandable?
michael
parents: 12997
diff changeset
137 'in register' formats will be equal (LE -> BGRA == BGR32 / BE -> ARGB == BGR32)
17680
786628f8db88 Add a practical description of endian-independent RGB/BGR coding
pacman
parents: 15033
diff changeset
138
786628f8db88 Add a practical description of endian-independent RGB/BGR coding
pacman
parents: 15033
diff changeset
139 Practical coding guide:
786628f8db88 Add a practical description of endian-independent RGB/BGR coding
pacman
parents: 15033
diff changeset
140
786628f8db88 Add a practical description of endian-independent RGB/BGR coding
pacman
parents: 15033
diff changeset
141 The 4, 8, 15, and 16 bit formats are defined so that the portable way to
786628f8db88 Add a practical description of endian-independent RGB/BGR coding
pacman
parents: 15033
diff changeset
142 access them is to load the pixel into an integer and use bitmasks.
786628f8db88 Add a practical description of endian-independent RGB/BGR coding
pacman
parents: 15033
diff changeset
143
786628f8db88 Add a practical description of endian-independent RGB/BGR coding
pacman
parents: 15033
diff changeset
144 The 24 bit formats are defined so that the portable way to access them is
786628f8db88 Add a practical description of endian-independent RGB/BGR coding
pacman
parents: 15033
diff changeset
145 to address the 3 components as separate bytes, as in ((uint8_t *)pixel)[0],
786628f8db88 Add a practical description of endian-independent RGB/BGR coding
pacman
parents: 15033
diff changeset
146 ((uint8_t *)pixel)[1], ((uint8_t *)pixel)[2].
786628f8db88 Add a practical description of endian-independent RGB/BGR coding
pacman
parents: 15033
diff changeset
147
786628f8db88 Add a practical description of endian-independent RGB/BGR coding
pacman
parents: 15033
diff changeset
148 When a 32-bit format is identified by the four characters A, R, G, and B in
786628f8db88 Add a practical description of endian-independent RGB/BGR coding
pacman
parents: 15033
diff changeset
149 some order, the portable way to access it is by addressing the 4 components
786628f8db88 Add a practical description of endian-independent RGB/BGR coding
pacman
parents: 15033
diff changeset
150 as separate bytes.
786628f8db88 Add a practical description of endian-independent RGB/BGR coding
pacman
parents: 15033
diff changeset
151
786628f8db88 Add a practical description of endian-independent RGB/BGR coding
pacman
parents: 15033
diff changeset
152 When a 32-bit format is identified by the 3 characters R, G, and B in some
786628f8db88 Add a practical description of endian-independent RGB/BGR coding
pacman
parents: 15033
diff changeset
153 order followed by the number 32, the portable way to access it is to load
786628f8db88 Add a practical description of endian-independent RGB/BGR coding
pacman
parents: 15033
diff changeset
154 the pixel into an integer and use bitmasks.
786628f8db88 Add a practical description of endian-independent RGB/BGR coding
pacman
parents: 15033
diff changeset
155
786628f8db88 Add a practical description of endian-independent RGB/BGR coding
pacman
parents: 15033
diff changeset
156 When the above portable access methods are not used, you will need to write
786628f8db88 Add a practical description of endian-independent RGB/BGR coding
pacman
parents: 15033
diff changeset
157 2 versions of your code, and use #ifdef WORDS_BIGENDIAN to choose the correct
786628f8db88 Add a practical description of endian-independent RGB/BGR coding
pacman
parents: 15033
diff changeset
158 one.