Locale-Unicode

 view release on metacpan or  search on metacpan

README.md  view on Meta::CPAN

        Name: Ho Chi Minh City, Vietnam

        Time zone: `Asia/Saigon`, `Asia/Ho_Chi_Minh`

    - `vuvli`

        Name: Efate, Vanuatu

        Time zone: `Pacific/Efate`

    - `wfmau`

        Name: Wallis Islands, Wallis and Futuna

        Time zone: `Pacific/Wallis`

    - `wsapw`

        Name: Apia, Samoa

        Time zone: `Pacific/Apia`

    - `yeade`

        Name: Aden, Yemen

        Time zone: `Asia/Aden`

    - `ytmam`

        Name: Mayotte

        Time zone: `Indian/Mayotte`

    - `zajnb`

        Name: Johannesburg, South Africa

        Time zone: `Africa/Johannesburg`

    - `zmlun`

        Name: Lusaka, Zambia

        Time zone: `Africa/Lusaka`

    - `zwhre`

        Name: Harare, Zimbabwe

        Time zone: `Africa/Harare`

    See the [standard documentation](https://unicode.org/reports/tr35/#Time_Zone_Identifiers) for more information.

- `va`

    A [Unicode Variant Identifier](https://unicode.org/reports/tr35/#UnicodeVariantIdentifier) defines a special variant used for locales.

## Transform extensions

This is used for transliterations, transcriptions, translations, etc, as per [RFC6497](https://datatracker.ietf.org/doc/html/rfc6497)>

For example:

- `ja-t-it`

    The content is Japanese, transformed from Italian.

- `ja-Kana-t-it`

    The content is Japanese Katakana, transformed from Italian.

- `und-Latn-t-und-cyrl`

    The content is in the Latin script, transformed from the Cyrillic script.

- `und-Cyrl-t-und-latn-m0-ungegn-2007`

    The content is in Cyrillic, transformed from Latin, according to a UNGEGN specification dated 2007.

    The date is of format `YYYYMMDD` all without space, and the month and day information should be provided only when necessary for clarification, as per the [RFC6497, section 2.5(c)](https://datatracker.ietf.org/doc/html/rfc6497#section-2.5)

- `und-Cyrl-t-und-latn-m0-ungegn`

    Same, but without year.

The complete list of valid subtags is as follows. They are all two to eight alphanumeric characters.

- `d0`

    Transform destination: for non-languages/scripts, such as fullwidth-halfwidth conversion

    See also `s0`

    Possible [values](https://github.com/unicode-org/cldr/blob/maint/maint-41/common/bcp47/transform-destination.xml) are:

    - `accents`

        Map base + punctuation, etc to accented characters

    - `ascii`

        Map as many characters to the closest ASCII character as possible

    - `casefold`

        Apply Unicode case folding

    - `charname`

        Map each character to its Unicode name

    - `digit`

        Convert to digit form of accent

    - `fcc`

        Map string to the FCC format; [http://unicode.org/notes/tn5](http://unicode.org/notes/tn5)

    - `fcd`



( run in 0.887 second using v1.01-cache-2.11-cpan-71847e10f99 )