Metadata-Version: 2.4
Name: ipanema
Version: 202602.8
Summary: Packaged language data from Wiktionary
Author-email: Jan Berkel <jan@berkel.fr>
License-Expression: MIT
Project-URL: Homepage, https://gitlab.com/jberkel/ipanema
Classifier: Programming Language :: Python :: 3
Classifier: Operating System :: OS Independent
Requires-Python: >=3
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: peewee<=3.17.7
Dynamic: license-file


# /ipaˈnẽmɐ/

An attempt to create a central repository with structured language-related metadata
for applications which need to work with a variety of different languages.

Data is aggregated from various sources and combined into a single SQLite database
which can be queried easily.

The Wiktionary [language code][] is defined as follows:

  1. If the language has a two-letter code in the ISO 639-1 standard, then that code is used.
  2. If the language has a three-letter code in the ISO 639-3 standard, then that code is used.
  3. If the language has a three-letter code in the ISO 639-2 standard, then that code is used. (rare)
  4. Any language which does not have an ISO code, but which is to be included in Wiktionary, has a new Wiktionary-specific "exceptional" code devised for it.

## Data sources

### Language data

Source [Module:languages/data2][], [Module:languages/data3][]

    $ make -j4 -f Makefile.lang-data data/lang_data.json

### Language families

Source [Module:families/data][]

    $ make -f Makefile.lang-data data/lang_families.json

[language code]: https://en.wiktionary.org/wiki/Wiktionary:Languages#Language_codes
[Module:languages/data2]: https://en.wiktionary.org/wiki/Module:languages/data2
[Module:languages/data3]: https://en.wiktionary.org/wiki/Module:languages/data3
[Module:families/data]: https://en.wiktionary.org/wiki/Module:families/data
