Unicode

FreeBSD is a great platform for many multilingual tasks. What can be done to make it even better?

Fonts

Having enough fonts in ports to display any wikipedia article may be considered the first milestone. As a test case, rendering http://meta.wikimedia.org/wiki/List_of_Wikipedias in Firefox should be enough. As of 20 May 2008 there is enough fonts in ports to render the page completely (correctness was not verified, though).

Ideally, fonts in ports should provide as much Unicode coverage as currently possible. Maximum-coverage meta-ports may then be created (possibly divided per Unicode planes) for users to get comprehensive Unicode environment quickly and easily.

Another option to pursue is free fonts. For example, much of Unicode coverage is currently exclusively provided by x11-fonts/code2000, which is shareware. Likewise, x11-fonts/webfonts license is quite restrictive. The situation where considerable coverage is only available under non-free license conditions is undesirable for many users. a deskutils/gucharmap is a good tool to check what scripts are supported by the fonts installed on your machine. x11-fonts/fntsample can produce PDF documents showing coverage of any TTF font. It can also compare coverages of two different fonts.

fc-cache poisoning

It appears that some TTF fonts contain non-unicode names, which causes problems in different programs. Further investigation pending.

Processing

Quite a number of popular tools still lack support for Unicode, or have only partial support. For example, screen(1) only supports 16-bit codepoints.

Rendering

As with fonts, rendering wikipedia languages correctly may be considered the first goal. Unlike fonts, rendering depends more on particular applications. Popular web browsers are a good choice for testing, but other programs are equally important. An X11 terminal with full support of Unicode rendering is an important milestone.

Input

Other issues

It has long been pointed out that FreeBSD has problems with low-level (libc?) support for Unicode. It would be great if a person knowledgeable in the area could rehash the issues here.

Unicode (last edited 2008-06-17 21:38:02 by localhost)