This document describes requirements for the layout and presentation of text in languages that use the Mongolian script when they are used by Web standards and technologies, such as HTML, CSS, Mobile Web, Digital Publications, and Unicode.

This document describes the basic requirements for Mongolian script layout and text support on the Web and in eBooks. These requirements provide information for Web technologies such as CSS, HTML and digital publications about how to support users of Mongolian script languages. Currently the document focuses on the Traditional Mongolian script as used for Mongolian. The information here is developed in conjunction with a document that summarises gaps in support on the Web for Mongolian.

The editor's draft of this document is being developed by the Mongolian Layout Task Force, part of the W3C Internationalization Interest Group. It is published by the Internationalization Working Group. The end target for this document is a Working Group Note.

Sending comments on this document

If you wish to make comments regarding this document, please raise them as github issues. Only send comments by email if you are unable to raise issues on github (see links below). All comments are welcome.

To make it easier to track comments, please raise separate issues or emails for each comment, and point to the section you are commenting on  using a URL.

Introduction

About this document

This standard describes the Mongolian format standard, which can be used and achieve implementation in the CSS, SVG and XSL-FO, and it can also be used as the reference documents for office software. It mainly focuses on the Mongolian script and its features, the basic module of Mongolian text, page layout, new CSS standard and accessories.

Scope

The document describes the rules of composition of various types of text written in traditional Mongolian script and pictures, etc. in various network formats. The purpose of this standard is not to provide complete solution, but to describe the important basic information needed in the implementation in a computer.

The main purpose of this document is to provide standardization and guiding rules for the display and application of traditional Mongolian script in the web. However, the more complex layout requirements such as paper layout are not included in this document. Thus, this document includes the most basic rules of display of page and characters of the traditional Mongolian script, aiming at formulating the corresponding standards of web display and application of Mongolian script for the web application vendors to follow this standard of display and application of Mongolian script in the web.

Terminology

The following terms and definitions apply to this standard.

Composition
The composition in the W3C Standard Format includes Mongolian characters, punctuation marks, and other marks related to the script.
Mongolian Space
NARROW NO-BREAK SPACE,U+202F Official term should be: NARROW NO - BREAK SPACE, code: U + 202F Two spaces are used in Mongolian, U + 0020 and U + 202 F. One is a common space: U + 0020, and the other is a Mongolian space U + 202 F.
Mongolian Symbols
It refers to various symbols used in Mongolian text. In formulating the Mongolian coding standard, some of the codes is used in the coding area of the Mongolian standard, while the rest are symbols used in the scripts of other ethnic groups, due to the fact that some symbols are borrowed from other ethnic groups .
Baseline
Baseline in Mongolian is called “Nirogo”. Mongolian writing is generally aligned to the baseline.
Writing Direction
It refers to the default direction of writing. For example, English writing direction is horizontal from top to bottom, while the Mongolian writing is vertical from left to right.
Text Width
Text width indicates the distance from the word at the left start to the word at the right ead in a line in Mongolian, in the equivalent of the height of the horizontal English or Chinese text writing.
Text Height
Text height refers to the height of a Mongolian word, in equivalent of the width of the horizontal English or Chinese text writing.
Text Spacing
Text spacing refers to the gap between the Mongolian words.
Leftline and Rightline
As the Mongolian writing is vertical, the right line, the left line and the line through is equivalent to the underline and overline in the horizontal English or Chinese text writing .

Gap analysis

This document is pointed to by a separate document, Mongolian Gap Analysis, which describes gaps in support for Mongolian on the Web, and prioritises and describes the impact of those gaps on the user.

Wherever an unsupported feature is indentified through the gap analysis process, the requirements for that feature need to be documented. This document is where those requirements are described.

This document should contain no reference to a particular technology. For example, it should not say "CSS does/doesn't do such and such", and it should not describe how a technology, such as CSS, should implement the requirements. It is technology agnostic, so that it will be evergreen, and it simply describes how the script works. The gap analysis document is the appropriate place for all kinds of technology-specific information.

Other related resources

Initial content for this document drew on Chinese standards work. (EXPAND)

The document International text layout and typography index (known informally as the text layout index) points to this document and others, and provides a central location for developers and implementers to find information related to various scripts.

The W3C also maintains a tracking system that has links to github issues in W3C repositories. There are separate links for (a) requests from developers to the user community for information about how scripts/languages work, (b) issues raised against a spec, and (c) browser bugs. For example, you can find out what information developers are currently seeking, and the resulting list can also be filtered by script.

Mongolian Script Overview

This section introduces the Mongolian script in general terms, providing some context and terminology that is useful in the remainder of the document. If possible, it's best not to introduce actual requirements in this section, but to leave those and detailed descriptions of expected typography to the sections that follow.

Is the script an alphabet, abugida, abjad, syllabary, etc? What are the basic set of characters for major languages written in the script? How does the script work? Are glyphs in the script regularly joined up, as in Arabic, or not? Are there other special features, such as stacking, syllabic clusters and conjuncts, etc. Is there a set of characters used for special purposes, such as transcription? Are multiple scripts used?

What are the basic directional features of the script? Is the script written right-to-left, or bidirectionally? Is is written vertically? If so, can it also be written horizontally, what is the frequency with which it is written one way or the other, and which side is the first line on?

Are there special considerations about page layout that should be described briefly at this point, eg. Japanese kihon-hanmen?

Characters and phrases

The following is placeholder text, aimed to give suggestions for content. The questions are only prompts, to help the editor think of topics to address. Actual content will depend on the needs of the script being described. Change and reorganize sections as headings and needed.

Fonts and font styles

What are key fonts and or font styles used for this script? Do italic fonts lean in the right direction? Is synthesised italicisation problematic? Are there standard fallback fonts in use in browsers, and if so do they match expectations? Does the script use fixed-width font glyphs, proportionally-spaced fonts, or a combination? See available information or check for currently needed data.

Glyph control

Does the script in question require additional features to support alterations to the position or shape of glyphs, for example adjusting the distance between the base text and diacritics, or changing the glyphs used in a systematic way? See available information or check for currently needed data.

Cursive text

If this script is cursive (eg. Arabic, N'Ko, Syriac, Mongolian, etc), are there problems or needed features related to the handling of cursive text? Do cursive links break if parts of a word are marked up or styled? Do Unicode joiner and non-joiner characters behave as expected? See available information or check for currently needed data.

Punctuation

What are the typical punctuation marks used, and how are they used? (See other sections for information about how quotations work, and how punctuation interacts with boundary detection and line-breaking, etc.) See available information or check for currently needed data.

Punctuation rules

There are some specific requirements while using the Mongolian punctuation:

  1. Commas, periods, exclamation marks, question marks, or colons cannot be used at the beginning of a line.
  2. Opening brackets cannot be used at the end of a line.
  3. Closing bracket cannot be used at the beginning of a line.
  4. The colon and "《" of the opening bracket cannot be used separately, which means they must be used together and they are not allowed to be used at the beginning of a line.
  5. Dash consists of two long separatrix, which cannot be separated, i.e., one is used at the beginning of a line and the other is used at the end. However, it is allowed to appear at the beginning of the line when a paragraph begins.
  6. Mongolian names of Mongolian punctuation and writing direction are as shown in , namely the writing direction is the same as that in the graph.

    Mongolian punctuation sample
  7. All the punctuation in Mongolian "baseline" should keep in accordance with alignment of the midline, as shown in .

    Alignment relationship between punctuation and Mongolian script text.

Display rules for Mongolian space

Mongolian case supplementary component display rules and Mongolian space (0x202f) processing.

The inputting of Mongolian case supplementary component into the computer requires the Mongolian space 0x202F first, and then the corresponding Mongolian script. For example, the internal code of the word “ ᠤᠨ” is: 0x202F 0x1824 0x1828, among which 0x202F is called Mongolian space, inputted before the Mongolian case supplementary component .

The difference between the common space and Mongolian space.

Mongolian space (0x202F) and case supplementary component cannot appear at the beginning of a line, for example, is the correct processing method and is the wrong processing method.

Correct processing.

Wrong processing.

 

Quotations

What is the expected behaviour for quotations marks, especially when nested? Should block quotes be indented or handled specially? See available information or check for currently needed data.

Symbols

Does the script use special symbols that are worth noting, eg. head marks in Tibetan?

Numbers, dates, etc.

Does the script have its own set of number digits? How are they used, and how frequently? Does the numbering system use base-10, or some other type of base? Does it have special formatting patterns (eg. 12,34,000 in India). What about date/time formats and selection – are non-Gregorian calendars used? How are percent signs used, and do numbers have special decorations (like in Ethiopic or Syriac)?

Text boundaries & selection

What are the basic units of the text, and how are they demarcated, eg. characters, character sequences, syllables, or words? Are spaces or different symbols used between 'words'? Is it important to treat clusters of characters as a single unit? What should happen if you double- or triple-click in the text, whether or not 'words' are separated by spaces? See available information.

Selection rules

The selected midcourt line must overlap the Mongolian "baseline", as shown in Multi-line selection must follow the writing direction of Mongolian script, i.e., text direction from top to bottom and from left to right, as shown in .

The effect of choosing a single-line text.

 

The effect of choosing multi lines.

Cursor movement rules

There are cursor movement keys including “←”, “→”, “↑”, “↓”, “Page Up”, “Page Down”, “Home” and “End” in the keyboard. Striking these keys needs to follow the writing direction of Mongolian script from top to bottom and from left to right. For example, the cursor moves to the left after striking the key of “←”; the cursor moves to the right when striking the key of “→”. The cursor moves down after striking the key of “↓”. When the cursor moves to the bottom of the current line, press the key of “↓” again, the cursor will move right to the top of the next line, the case of the key “↑” is opposite, as shown in .

Illustration of cursor movement.

Mouse pointer rules

The mouse pointer in text editing is shown in .

Scrollbar will scroll left and right when mouse wheel is scrolling, that is, the scrollbar will move left or the text will move right when mouse wheel is scrolling forward; the scrollbar will move right or the text will move left when mouse wheel is scrolling backward.

Illustration of cursor movement.

The cursor shape in text editing is shown in ,The cursor must be aligned at the base on the midpoint of the “baseline” of Mongolian script, that is, the midpoint of the “baseline” of Mongolian script is overlapping the midpoint of the cursor, with the length no longer than the middle line between the two lines of the text, as shown in the .

Illustration of cursor movement.

Transforming characters

What transformations does your script need? For example, does your script convert letters to uppercase, capitalised and lowercase alternatives? Do you need to to convert between half-width and full-width presentation forms? See available information or check for currently needed data.

Inter-character spacing

Some scripts create emphasis or other effects by spacing out the letters or syllables in a word. We know there are questions here about how this should work in Indic and SE Asian scripts, and in Arabic-based scripts. Can you provide information? Are there requirements for this script that we should add? (For justification related spacing, see below.) See available information or check for currently needed data.

Ruby annotation

Are ruby annotations used in this script/language? If so, how? See available information or check for currently needed data.

Text decoration

Some aspects related to the drawing of lines alongside or through text involve local typographic considerations. For example, underlines need to be broken in special ways for some scripts, and the position relative to the text may vary depending on the script. Do you need support for special line shapes or widths? What about vertical text? See available information or check for currently needed data.

Right line in Mongolian writing is on the right of the text, similar to the underline of horizontal English and Chinese text . The left line is on the left of the text, similar to the overline of horizontal English and Chinese text. The strikethrough is a “baseline”-centered vertical line as shown in

Illustration of the right line, the left line and the strikethrough.

The underline, the overline and the strikethrough in mixed composition with other languages are shown in :

The underline, the overline and the strikethrough as mixed with other languages.

Emphasis & highlighting

Bold and italic are not always appropriate for expressing emphasis, and some scripts have their own unique ways of doing it, that are not in the Western tradition at all. If this applies to this script/language, how is it done? See available information or check for currently needed data.

Bidirectional text

If this script runs right-to-left, are there any special requirements when handling that? What about numbers and expressions – do they run rtl or ltr? See available information or check for currently needed data.

Other inline features

Does the script/language have special ways of representing inline notes (such as wakiten or kumimoji in Japanese) or other special inline features that need to be supported? See available information or check for currently needed data.

Mongolian text width, height and spacing

Due to the different height of Mongolian text, in order to ensure that every character is in fully displayed and the whole word looks beautiful, the height of each letter must keep in balance. The spaces between words must be different from the common gaps whose code point is U+202F NARROW NO-BREAK SPACE. See :

The height, width and spacing of Mongolian words.

Lines and Paragraphs

Line breaking

What are the important features about how script wraps when it hits the end of a line? Does line-breaking wrap whole 'words' at a time, or characters, or something else (such as syllables in Tibetan and Javanese)? What characters should not appear at the end or start of a line, and what should be done to prevent that? See available information or check for currently needed data.

Newline does not allow a word break and a sign of a new line of all words are space, punctuation or another language. is the correct newline without breaking the Mongolian words. is the wrong newline for breaking “” and “”.

Correct newline.

Wrong newline.

Hyphenation

Is hyphenation used for your script, or something else? If hyphenation is used, where should the hyphen appear? See available information or check for currently needed data.

Justification & line-end alignment

Is it normal to have flush lines down both sides of the text body (ie. full justification)? If so, what rules are used by the script to resolve the positioning of characters on a line? Does the script conform to a grid pattern? Does your script allow punctuation to hang outside the text box at the start or end of a line? Where adjustments are need to make a line flush, how is that done? Do you shrink/stretch space between words and/or letters? Are word baselines stretched, as in Arabic? See available information or check for currently needed data.

Text alignment

Text alignment includes "left alignment", "horizontal center", “right alignment ” as well as “top alignment”, “vertical center”, “bottom alignment”and “top-bottom alignment”. "Left alignment ", "horizontal center" and “right alignment” are the alignment rules inside a line. “Top alignment”, “horizontal center”, “bottom alignment” and “top-bottom alignment” are the alignment rules in a page or a paragraph of a multi-line text.

Among "Left alignment", "horizontal center", and “right alignment”, “horizontal center” is a default option which means that Mongolian script will align basing on its “baseline” for axis. “Left alignment” means that the text will move left in a certain distance, which shows obvious difference compared with the text after setting “horizontal center”. “Right alignment” means that the text will move right in a certain distance after, which shows obvious difference compared with the text after setting “horizontal center”.

Among “Top alignment”, “vertical center”, “bottom alignment”and “top-bottom alignment”, “top-bottom alignment” is a default option which means that the words in the upper and the lower boundary of a multi-line text in a paragraph or a page are all aligned. And its alignment method is that “spaces” in the text are stretched in the same proportion. “Top alignment” means that the words in the upper boundary of a multi-line text in a paragraph or a page are all aligned without considering the alignment of the lower boundary and stretching the space between words. “Vertical center” means the words in the upper and the lower boundary of a multi-line text in a paragraph or a page do not need to align and leave the same blank under the condition of no-stretching space between words.

Counters, lists, etc

The display of these controls needs paying attention to the direction of the output. That is to say, the primary display of the number 1, 2, 3. Etc. from left to right. The effect of the following codes is as it is shown in , please pay attention to the midcourt line aligning principles.

Lists.

Initial letter styling

Does the script/language apply special styling of the initial letter of a line or paragraph, such as for drop caps? If so, what are the rules for positioning: what is the size relationship between the large letter and the lines alongide? where does the large letter anchor relative to the lines alongside? is it normal to include initial quote marks in the large letter? is the large letter really a syllable? etc. See available information or check for currently needed data.

Baselines

What are the requirements for baseline alignment between mixed scripts and in general? See available information or check for currently needed data.

Mongolian is aligned to a "baseline" that runs down the center of the writing, and all text is aligned to this baseline as shown in :

The aligning baseline for Mongolian.

Mixed Arrangement Rules with Other Languages

When mixed with other languages, the main consideration should be midcourt line aligning.

Mixed Arrangement Rules with Numbers and Latin

There is no obvious midcourt line in numbers and Latin. Therefore generally, half of the text height is regarded as a midcourt line position. When Mongolian script is mixed with numbers and Latin, the line of half of the text height should be aligned with Mongolian midcourt line. When number’s size is the same as Mongolian’s, it will be slightly larger, so some handling methods should be taken to avoid the problem, such as the methods listed in .

The mixed arrangement of Mongolian, Latin and numbers.

Mixed Arrangement Rules with Chinese and Japanese

The notices of the mixed arrangement with Chinese or Japanese are as the following:

  1. Chinese and Japanese cannot be displayed in the way of lying down. They must be the same as the display direction of Chinese or Japanese in horizontal.
  2. The center line of Chinese or Japanese text height (the width in this case) needs to be aligned with the midcourt line of Mongolian script.
  3. Pay attention to the top alignment and bottom alignment of Mongolian script. In the same line, the mixed display of Mongolian and Chinese needs paying attention to balanced alignment. When displayed, it should stretch space, but empty distance or space should not be added in Chinese text.
  4. Mongolian is smaller than Chinese or Japanese in the same font size, so some handling methods should be taken to avoid the problem, such as methods listed in .

The mixed arrangement of Mongolian and Chinese.

Other paragraph features

In this script/language, is the first line of text typically indented at the start of a paragraph? Are there other features of paragraph design that are peculiar to your script? See available information or check for currently needed data.

Layout & pages

General page layout & progression

Bookbinding and the Direction of Page Turning

In general, bookbinding is on the left side, as shown in :

Bookbinding.

The direction of page turning is to the left hand, as shown in :

The direction of page turning.

Paper direction

Generally, "horizontal" is the default direction in Mongolian format, as shown in .

Effect of "Horizontal" paper direction.

Paper rolling direction

Generally, "horizontal" is the default direction in Mongolian format, as shown in .

The direction of paper format.

The rolling direction of scroll bars

When there are much more words in the text in excess of the prescribed space size, the scroll bar needs to be displayed.The default display position of the scroll bar is downmost in the corresponding space (The simultaneous display of both horizontal and vertical scroll bars has not been installed).

Position and rolling direction of scroll bar.

Columns

In the Mongolian format, page number should be displayed on the upper side or downside.

The effect of 2 columns.

Bidirectional layout

If this is a RTL or vertically-set writing system, how do you specify the location of objects, text, etc. relative to the flow? Is content mirror-imaged completely in layouts? What other aspects of layout are affected by direction? See available information or check for currently needed data.

Vertical text

What are the requirements for vertically oriented text? What about if you mix vertical text with scripts that are normally only horizontal? Is it normal to use different characters in vertical vs. horizontal text? Do you expect short numbers, acronyms, etc to run horizontally within a vertical line (tate chu yoko)? See available information.

The Mongolian writing direction is shown in

Mongolian writing direction.

Tables

When setting writing-mode:tb-lr of CSS (grammar will have a little difference according to various browsers), the table will support vertical display feature of Mongolian script.When setting default option for the text in the table, it will display in “horizontal center” state, that is, the upper and the lower center lines of the table cell will align at the center line of “baseline” of Mongolian script, as shown in the .

The aligning baseline for Mongolian.

Notes, footnotes, etc

Does your script have special requirements for notes, footnotes, endnotes or other necessary annotations of this kind in the way needed for your culture? See available information or check for currently needed data.

Page numbering, running headers, etc

In the Mongolian format, page number should be display on the upper side or downside.

Adding the horizontal type and number on the page number .

User interaction

All controls in input need to adjust according to the characteristics of the vertical typeset of Mongolian script, for example, controls like text and button need to support to enter and display text vertically, among which the cursors in text and password need to conform to the cursor style as shown in the .

The display of input controls and the alignment of Mongolian script.

Select

The select box appeared in the following code of HTML should be as it is shown in . According to default setting, the scroll bar is at the bottom and closes to the leftmost. To see the contents at the back of the list, it scrolls from left to right. The rolling of the mouse wheel should be in accordance with illustration in . While selecting a column, the selected text and the selected background color should conform to the principle of aligning of the midcourt line. (see the descriptions in ).

Correct newline.

Mongolian standard the select box.

Textarea

The label control mainly considers text midcourt line aligning principles. ( See the descriptions in TBD). The label display of the following codes is shown in .

Display status of the standard textarea.

Label

Textarea is an important control, and is required in the text input, edit, and display. Scroll bar rolling accords with requirements of select control in TBD. The display and moving direction of the cursor should be consistent with the standard of cursor movement in TBD. In some browsers, there are functions of stretching the size of textarea. The stretching icon should be in the lower right corner, and the textarea scaling accords with the mouse dragging. The rows and cols attribute of Textarea is the opposite of those in horizontal. Its specific attributes are as the following: rows {int} : showing the column number and cols {int} : showing the row number

The display of label controls.

Fieldset

As shown in , it is a fieldset, the example fieldset generation with the following codes, the latter figure shows the midcourt line aligning principles.

Fieldset effect.

More page layout & pagination

Some cultures define page areas and page progression direction very differently from those in the West (eg. kihon hanmen in Japanese). Is this an issue for you? Are widows and orphans relevant? In what order do pages progress, RTL or LTR? See available information or check for currently needed data.

Mixed arrangement of text and illustrations (or other non-text objects)

The illustrations here include ordinary picture formats, textboxes, charts, media objects and so on, all of which are called illustration as followed. There are many ways (as shown in ) of mixed arrangement of text and illustrations but, no matter which way, some principles need to be obeyed.

  1. The illustration cannot be rotated, that is, the original illustration is inserted just as it is. The illustration cannot be rotated into the vertical one just because of vertical text.
  2. The Mongolian script around the inserted illustrations cannot be broken inside the word. The rules of the beginning and end of lines are the same as the original ones.

by mixed.

Glossary

Term Mongolian Transliteration Definition
alignment    

Suggestions about Formulating New CSS Standard

  1. When Mongolian webpage is created, the page direction should be designated: CSS-MONGOLIAN-LAYOUT:Y. Designating the option means that all page elements should be arranged in accordance with the requirement of vertical column script. Meanwhile, there are the main requirements: script should conform to the habit of Mongolian writing -----arrangement from top to bottom and from left to right (see the descriptions in 2.2.3); textboxes, buttons and list order are all displayed in vertical column; pictures, videos and the other third party controls do not need to accord with the requirement of vertical column.
  2. In mixed arrangement with other languages, Mongolian text is X larger than the other texts. Designating the option requires that the size of Mongolian in all control texts should be automatically X larger than the size of the text of other scripts or X smaller under the condition of using minus.
  3. When arranged with other languages, Mongolian should follow the midcourt line aligning principles as the following. when the option CSS-MONGOLIAN-MOVE- WAIST:X px is designated, Mongolian text in all control texts should move X px to left or right so that it can be aligned with the midcourt line of other texts. Because the midcourt line cannot be aligned well under the condition, the option needs to be set.
  4. When arranged with other languages, Mongolian font should be set as CSS-MONGOLIAN-FONT:MongolianFontName. When the option is designated, Mongolian font in all control texts should be changed into MongolianFontName, while other languages’ font should accord with default font in the system. Because the change of Mongolian font will affect Chinese font when it is arranged with Chinese or other languages and sometimes, Chinese will be displayed in the way of lying down.
  5. Whether Mongolian space (0x202F) is stretched: CSS-MONGOLIAN-SPACE:Stretch|Normal, when the Stretch option is designated, the length of Mogolian space should be stretched as that of normal space. While the option Normal is designated, Mongolian space should not be stretched, and should be displayed in the length of the font library.
  6. Whether the Mongolian vowel space mark MVS (0x180E): is displayed: CSS-MONGOLIAN-MVS:Display|Normal, when the option Display is designated, there will be a small hollow rectangular on the screen with the size in the font library, but the rectangular will not be shown when it is printed. When the option Normal is designated, there will be no picture on the screen, but it will occupy a space of the size in the font library.

Relative Standards of Currently Used Scripts in the Traditional Mongolian Script

Currently, there are some main characters in applied traditional Mongolian script, including basic Mongolian characters, symbols and Mongolian numbers.

Acknowledgements

Special thanks to the following people who contributed to this document (contributors' names listed in in alphabetic order).

This Person, That Person, etc

Please find the latest info of the contributors at the GitHub contributors list.