lexibank / bowernpny Goto Github PK
View Code? Open in Web Editor NEWCLDF dataset derived from Bowern and Atkinson's "Internal Structure of Pama-Nyungan" from 2012
License: Creative Commons Attribution 4.0 International
CLDF dataset derived from Bowern and Atkinson's "Internal Structure of Pama-Nyungan" from 2012
License: Creative Commons Attribution 4.0 International
We found an error in orthographic profile (git blame says otherwise, but I'm 99% sure it is my fault): μ
is mapped to m
, but it should be ɳ
. Confirmation by the author via Twitter: https://twitter.com/anggarrgoon/status/1496151913284489217
I can make a PR, but I am not sure if it is the right workflow anymore, especially in terms of regenerating the CLDF.
Hi,
I have the impression that there might be spurious contrasts in the orthography profile, in particular voicing contrasts in occlusives (p/b, g/k, t/d). I also suspect that the ɹ/r contrast is maybe not meaningful.
I suggest that we figure out precisely which contrasts are due to variation in descriptive practice, and which are truly contrasts imputable to sound change etc, and neutralize meaningless contrasts.
As to how to normalize, we have three other datasets with languages from these families, and should make sure we are using the same notations:
Lexibank dataset | Sounds found |
---|---|
bowernpny | + _ a aː ã b bː c cʷ cː d dʒ dʱ dː d̪ e eː f g gʷ gː h i iː j k kʷ kː l lʷ lː l̪ m mː n nʲ nː n̪ n̪ː o oː p pː q qː r rː s t tʃ tʲ tː t̪ t̪ʷ t̪ː u uː ũ v w x yː z æ ð ø ŋ œ ɐ ɑː ɔ ɖ ə ɛ ɛː ɜ ɣ ɤ ɨ ɪ ɭ ɲ ɳ ɹ ɽ ɾ ʀ ʈ ʊ ʒ ʔ ʔʲ ˀb ˀd ˀdʒ ˀk ˀm ˀn ˀr ˀt ˀt̪ ˀw ˀɭ β θ |
johanssonsoundsymbolic | + a aː c i j k l l̪ m n n̪ p r rː t t̪ u w ŋ ɭ ɲ ɳ ɽ ʎ |
joophonosemantic | a aː i j k l l̻ m n n̻ p r t t̻ u uː w ŋ ɭ ɳ ɽ ʈ |
wold | + _ a g i iː j k l m n p r t u w y ŋ ɲ |
As you can see, the other ones use p-k-t, not b-g-d, and have a single /r/ sound. https://github.com/lexibank/wold does have a k/g contrast (if it is also meaningless, we should change it there).
@erichround, @chirila, could you chime in on whether these contrasts should be kept here ? Are there other contrasts that should be neutralized in the list above ? @tresoldi, it looks like the orthography profile was from you, do you remember if there was specific motivations for these contrasts ?
For a closer look, the list of sounds with counts can be found in the TRANSCRIPTION file: https://github.com/lexibank/bowernpny/blob/master/TRANSCRIPTION.md
Having non meaningful contrasts causes issues with downstream analyses of the data, especially in the sound correspondence study.
I noticed that you seem to be using 'thou' for the plural second person pronoun, but in concepticon thou is meant to be you_sg.
For some reason, none of the segments are recognized. Maybe this will go away when moving from CLPA to CLTS anyway.
Hiya,
looks like the code/repo is ready for a release but it hasn't been released/pushed to Zenodo yet. Should I go ahead and do that @xrotwang or will you continue working on this repository?
The title is too long, I would propose: "The Internal Structure of Pama-Yungan"
Wirangu-Nauo is currently mapped to a book-keeping code, this must be fixed according to the instructions provided by @xrotwang.
Opt-in for cross-concept cognates after checking? The cognate sets here seem quite big/inclusive.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.