Comments (6)
What you can do, for instance, while building the colexification network (see clics colexification -h
) is redirecting the output to a file, for example:
clics -t 3 -f families colexification --show 3000 --format tsv > out.tsv
from clics3.
Thanks. But why 3000? What is the total number?
from clics3.
from clics3.
Thanks. But why 3000? What is the total number?
No particular reason other than that there are roughly 3000 concepts in CLICS and that, generally speaking, the less frequent colexifications also tend to be less reliable (However, of course note that number of concepts != to the number of colexifications in CLICS). network-3-families.gml
in total has 4228 edges (note that this is before clustering with infomap), so in total there would be 4228 colexifications. The blog post that Mattis mentioned is a very good introduction to programmatically accessing the network data. Here's also a small snippet that shows how to access the data using igraph
.
Note that the snippet is also based on @LinguList and @tresoldi's blog postings.
from clics3.
There is also some code from the "semantic distance" that I present at SLE2019 and discussed in another CALC blog post: https://github.com/tresoldi/semantic_distance
I think what you want is something similar to the full list ( https://github.com/tresoldi/semantic_distance/blob/master/data/colexifications.tsv ), but you should really compute it yourself, and @chrzyki 's snippet is clear. The data in this repository is outdated and includes all possible colexifications, including those found only between a single pair of languages, so that you have a lot of noise in there.
from clics3.
Closing this for now. Feel free to reopen should any other questions arise.
from clics3.
Related Issues (17)
- Update citation on landing page of web app HOT 1
- Repeated "SUBGRAPH" label
- Web app looks odd in firefox HOT 4
- ERROR: File "setup.py" not found. Directory cannot be installed in editable mode: /Library/Frameworks/Python.framework/Versions/3.9/lib/python3.9/site-packages HOT 2
- Add venv usage to README HOT 1
- Workflow doesn't work with Python 3.9.x HOT 4
- Potentially do a bugfix release that pulls datasets from Zenodo instead from GitHub
- Describe workflow and requirements in README HOT 2
- Multiple numeric identifiers for same concept HOT 6
- What goes into the CLICS dataset? HOT 10
- Upgrade pyclics to clldutils 3.2 HOT 2
- Release all datasets with pylexibank 2.0 HOT 2
- Update Code Ocean Capsule HOT 4
- Change dataset installation from dev to fixed versions HOT 1
- Update artifacts in repository after pylexibank 2.0 and dataset releases
- Update citation in README.md HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from clics3.