• Font
  • Family
  • Foundry
  • Designer
  • Sample
  • Article
  • Help
Fontke.com>Article>Details

UAX #29, Unicode Text Segmentation, update to improve Mongolian word segmentation

Date:2015-10-01 05:15:17| Standard|Browse: 120|Source: The Unicode Blog|Author: Unicode, Inc.
  • Follow FontKe on Wechat to get Zcode
  • Scan the Qrcode to participate in the SVIP lottery
IntroductionUnicode Standard Annex #29, Unicode Text Segmentation, will be up

Unicode Standard Annex #29, Unicode Text Segmentation, will be updated for Unicode 9.0. A draft of the proposed update is available for general public review and comment.

The Word_Break classification of U+202F NARROW NO-BREAK SPACE (NNBSP) is revised to correct the text segmentation behavior of U+202F for Mongolian usage. For further background on this issue and possible ways to address it, see PRI #308, Property Change for U+202F NARROW NO-BREAK SPACE (NNBSP).

In this revision, the formerly empty Prepend class of the Grapheme_Cluster_Break property is redefined to consist of all prefixed format control characters and a few other characters with certain Indic_Syllabic_Category property values.

The corresponding property value changes will be incorporated in the UCD data files for Unicode 9.0.

0
  • Follow FontKe on Wechat to get Zcode
  • Scan the Qrcode to participate in the SVIP lottery
UAX #29, Unicode Text Segmentation, update to improve Mongolian word segmentation Comments
Guest Please obey the rules of this website. Unclear?
UAX #29, Unicode Text Segmentation, update to improve Mongolian word segmentation Latest comments
No relevant comments