Mercurial > mplayer.hg
view DOCS/tech/realcodecs/audio-codecs.txt @ 10612:8e678e833591
docs updates
author | diego |
---|---|
date | Fri, 15 Aug 2003 12:05:18 +0000 |
parents | be35ec0b2ee3 |
children | 0f1b5b68af32 |
line wrap: on
line source
all audio codecs (cook,atrk,14_4,28_8,dnet,sipr) have the same interface, but i have only analyzed the cook codec audio properties (hex) 00 short text/description of the format (bitrate, when to use) 01 bitrate (bits/s) //avg. bytes/sec output 02 ulong: ? ulong: samples per second ushort: bits/sample ushort: number of channels 03 same as 02 //constant 2 04 long description 05 constant 1 (always?) 06 ulong: block align (input frame size for RADecode) 07 string: minimum player version 08 n/a 09 n/a 0A n/a 0B n/a 0C n/a 0D ? 0E ? leaf size 0F ? 10 ? 11 ? 12 ? 13 min. output buffer size? max. number of samples? 14 ? functions: ulong result=RAOpenCodec2(ra_main_t *raMain); ulong result=RAInitDecoder(ra_main_t *raMain, ra_init_struct *raInit); struct ra_init_struct { ulong sample_rate; ushort bits_per_sample; // unused by RAInitDecoder ushort number_of_channels; ushort unknown1; // 0 ushort unknown2; // also unused (100) ulong leaf_size; // leaf size (used for interleaving, but // exists in audio stream description header (ASDH)) ulong block_align; // packet size ulong bits_per_sample; // unused (always 16) char *ext_data; // 16 bytes located at the end of the // ASDH }; There are some information missing that you usually need for playback, like bits per sample (the fileds aren't read by RAInitDecoder()). These are hard coded in the "flavors", i.e. the sub formats. A flavor is an entry in the list of available format variations like bitrate, number of channels, decoding algorithm, and so on.We can get those information with the following command: void *GetRAFlavorProperty(ra_main_t *raMain, ulong flavor, ulong property, short *property_length_in_bytes); returns property data for a specific data This is not important, because it's just a read only function. These flavor properties don't seem to exist in ulong RADecode(ra_main_t *raMain, char *input_buffer, ulong input_buffer_size, char *output_buffer, ulong *decoded_bytes, ulong p6=-1); RAFreeDecoder(ra_main_t *); RACloseCodec(ra_main_t *); ulong RASetFlavor(ra_main_t *ra_main, ulong flavor); Set the flavor of the stream. a flavor is an entry in the list of available format variations like bitrate, number of channels, decoding algorithm, and so on audio data storage: ------------------- With Real Audio V5 (or earlier?), the audio streams can be interleaved, i.e. the stream is striped amongst several data packets. The packets (which have a fixed size packet_len) are split up into a fixed number of num_parts equally sized parts - I call them leaves in lack of better name. The leaves have the size leaf_size = packet_len / num_parts. To create a bunch of packets, you need 2*num_parts stream packets. The first part of the first stream packet is stored in leaf number 0, the first part of the second into leaf number num_parts, the one of the next one into leaf number 1 etc. The following part of a stream packet is stored 2*num_packets behind the current part of the same stream packet. In short words: when you have a matrix with the leaves as the values, it's a transposition in conjunction with a permutation. packet | leaf | stream packet, part no. -------+---------------+------------------------ 0 | 0 | (0,0) 0 | 1 | (2,0) . | . | . . | . | . 0 | num_parts-1 | (2*num_parts-2,0) 0 | num_parts | (1,0) 0 | num_parts+1 | (3,0) . | . | . . | . | . 0 | 2*num_parts-1 | (2*num_parts-1,0) 1 | 0 | (0,1) . | . | . . | . | . sequence of calls: ------------------ RAOpenCodec2() RAInitDecoder() RASetFlavor() RAGetFlavorProperty(0xE) sequence of RADecode()s once a RAGetFlavorProperty(0xE) after some RADecode()s and occasionally the following sequence: RAGetFlavorProperty(0) RAGetFlavorProperty(7) which is rather pointless because they only return cleartext audio descriptions RAFreeDecoder() RACloseCodec() RAFlush(ra_main_t *raMain, char *output_buffer, ulong *retval) will be called when seeking output_buffer points to the output buffer from the last decode operation. retval is unknown, returning always 0x18 in a specific sample -> further investigation needed