Coder Social home page Coder Social logo

cldr's Introduction

CLDR

Compact data from the Unicode Common Locale Data Repository

For anyone interested, I just dumped most of the CLDR data in a compact way (see provided CLDR.INI file).

The final data for all languages is 13,727,497 bytes, but still highly compressible, as seen below.

BSC:    1,023,137 bytes, ratio=92.5468% enctime=1211822us dectime=757387us
BROTLI: 1,287,148 bytes, ratio=90.6236% enctime=67236011us dectime=50243us
LZMA25: 1,369,212 bytes, ratio=90.0258% enctime=3961970us dectime=98609us
LZIP:   1,369,811 bytes, ratio=90.0214% enctime=3895218us dectime=131528us
LZMA20: 1,423,667 bytes, ratio=89.6291% enctime=3334007us dectime=103905us
MINIZ:  1,892,977 bytes, ratio=86.2103% enctime=697077us dectime=31830us
ZSTD:   2,108,053 bytes, ratio=84.6436% enctime=65694us dectime=34525us
LZ4HC:  2,151,652 bytes, ratio=84.326% enctime=491871us dectime=13641us
LZ4:    2,918,991 bytes, ratio=78.7362% enctime=37851us dectime=13775us
RAW:   13,727,497 bytes, ratio=0% enctime=16242us dectime=7658us

This is what is currently processed from the CLDR repos:

  • skipped
  • extracted

cldr-core/supplemental/

  • aliases.json
  • calendarData.json
  • calendarPreferenceData.json
  • characterFallbacks.json
  • codeMappings.json
  • currencyData.json
  • gender.json
  • languageData.json
  • languageMatching.json
  • likelySubtags.json
  • measurementData.json
  • metaZones.json
  • numberingSystems.json
  • ordinals.json
  • parentLocales.json
  • plurals.json
  • primaryZones.json
  • references.json
  • telephoneCodeData.json
  • territoryContainment.json
  • territoryInfo.json (interesting!)
  • timeData.json
  • weekData.json
  • windowsZones.json

cldr-dates-modern\main\xx-XX

  • ca-generic.json
  • ca-gregorian.json
  • dateFields.json
  • timeZoneNames.json

cldr-localenames-modern\main\xx-XX

  • languages.json
  • localeDisplayNames.json
  • scripts.json
  • territories.json
  • transformNames.json
  • variants.json

cldr-misc-modern\main\xx-XX

  • characters.json
  • contextTransforms.json
  • delimiters.json
  • layout.json
  • listPatterns.json
  • posix.json

cldr-numbers-modern\main\xx-XX

  • currencies.json
  • numbers.json

cldr-segments-modern\segments\xx-XX

  • suppressions.json

Licenses

cldr's People

Contributors

r-lyeh avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.