view DOCS/tech/subcp.txt @ 16429:84174804804b

Updates to NUT spec: 1. remove average_bitrate 2. add other_stream_header, for subtitles and metadata 3. add max_pts to index 4. index_ptr - a 64 bit integer to say the total length of all index packets 5. specify how to write "multiple" indexes 6. change forward_ptr behavior, starts right after forward_ptr, ends after checksum 7. remove stream_id <-> stream_class limitation. 8. time_base_nom must also be non zero. 9. rename time_base_nom and time_base_denom, now timebase means the length of a tick, not amounts of ticks 10. remove (old?) sample_rate_mul stuff. 11. specify what exactly the checksum covers. 12. specify that stream classes which have multiple streams must have an info packet.. (in new Semantic requirements section) 13. Rename 'timestamp' to pts. 14. Change date of draft... 15. Add myself to authors...
author ods15
date Fri, 09 Sep 2005 10:26:21 +0000
parents ef3af71f0113
children 0f1b5b68af32
line wrap: on
line source

Ascii Subtitle / Font CODEPAGEs
===============================

The subtitle encoding issue seems a bit confusing, so I'll try to 
summarize it here.

There are 2 approaches:

1. (preferred) You can generate Unicode subtitles with:
	subfont --unicode <signle-byte encoding known by iconv> ...
or
	subfont --unicode <path to custom encoding file> ...
	(this custom encoding file could list all iso-8859-* characters to create 
single font file for common encodings)

and then run mplayer this way (-subcp and -utf8 expect Unicode font!):
	mplayer -subcp <any encoding known by iconv> ...
or
	mplayer -utf8 ...

2. (current) Generate subtitles for some specific encoding with:
	subfont <signle-byte encoding known by iconv> ...
or
	subfont <path to custom signle-byte or EUC encoding file> ...

and then run mplayer without any encoding options for signle-byte 
encodings, or with -unicode option for EUC (and the like) encodings 
(which is only partially implemented in mplayer).

AFAIK, CJK encodings: EUC-*, BIG5 and GB2312 work more or less this way:
- 0x8e (SINGLE-SHIFT TWO, SS2) begins a 2-byte character,
- 0x8f (SINGLE-SHIFT THREE, SS3) begins a 3-byte character,
- 0xa0-0xff begin 2-byte characters,
- other characters are single-byte.


I tested charmap2enc script only with /usr/share/i18n/charmaps/EUC-KR.gz 
(on RedHat). It wasn't intended to be perfect.


-- 
Artur Zaprzala