Featured Linguist!

Jost Gippert: Our Featured Linguist!

"Buenos dias", "buenas noches" -- this was the first words in a foreign language I heard in my life, as a three-year old boy growing up in developing post-war Western Germany, where the first gastarbeiters had arrived from Spain. Fascinated by the strange sounds, I tried to get to know some more languages, the only opportunity being TV courses of English and French -- there was no foreign language education for pre-teen school children in Germany yet in those days. Read more

Donate Now | Visit the Fund Drive Homepage

Amount Raised:


Still Needed:


Can anyone overtake Syntax in the Subfield Challenge ?

Grad School Challenge Leader: University of Washington

Publishing Partner: Cambridge University Press CUP Extra Publisher Login

Discussion Details

Title: Missing Characters and Glyphs in Unicode Fonts
Submitter: Maurizio Lana
Description: Dear colleagues,

I would like to get your advice about a matter regarding characters,
glyphs, fonts and Unicode.

In printed late-Latin texts symbols do appear representing measurement
units (you can see a list of them at
http://www.tulliana.eu/documenti/measure%20units.doc). The problem is that
- as far as I know - no single font contains all of those characters. Some
of them are found in Cardo by D. Perry, some other in Alphabetum by J.J.
Marcos, some other elsewhere, and some of them do not appear in any font I

So some questions arise about:
1) How do we represent those 'signs' in a digital text?
2) What should be done in order to get the glyph corresponding to the
missing characters?
3) What should be done in order to have 1 single font containing all those
symbols? (Creating an ad-hoc font? Expanding one of the existing fonts?)
4) What to do in order to have Unicode-compliant character codes for the
now non-existent characters?

The solution of getting the missing characters from many different fonts is
not viable as it requires that the scholar interested to those texts loads
many different fonts in order use 1 or 2 or 5 characters from each of them.

With many thanks to everyone!
Date Posted: 24-Jun-2011
Linguistic Field(s): Computational Linguistics
Language Specialty: None
LL Issue: 22.2647
Posted: 24-Jun-2011

Search Again

Back to Discussions Index