Coder Social home page Coder Social logo

Comments (6)

chrzyki avatar chrzyki commented on July 29, 2024

What you can do, for instance, while building the colexification network (see clics colexification -h) is redirecting the output to a file, for example:

clics -t 3 -f families colexification --show 3000 --format tsv > out.tsv

from clics3.

ianjoo avatar ianjoo commented on July 29, 2024

Thanks. But why 3000? What is the total number?

from clics3.

LinguList avatar LinguList commented on July 29, 2024

from clics3.

chrzyki avatar chrzyki commented on July 29, 2024

Thanks. But why 3000? What is the total number?

No particular reason other than that there are roughly 3000 concepts in CLICS and that, generally speaking, the less frequent colexifications also tend to be less reliable (However, of course note that number of concepts != to the number of colexifications in CLICS). network-3-families.gml in total has 4228 edges (note that this is before clustering with infomap), so in total there would be 4228 colexifications. The blog post that Mattis mentioned is a very good introduction to programmatically accessing the network data. Here's also a small snippet that shows how to access the data using igraph.

Note that the snippet is also based on @LinguList and @tresoldi's blog postings.

from clics3.

tresoldi avatar tresoldi commented on July 29, 2024

There is also some code from the "semantic distance" that I present at SLE2019 and discussed in another CALC blog post: https://github.com/tresoldi/semantic_distance

I think what you want is something similar to the full list ( https://github.com/tresoldi/semantic_distance/blob/master/data/colexifications.tsv ), but you should really compute it yourself, and @chrzyki 's snippet is clear. The data in this repository is outdated and includes all possible colexifications, including those found only between a single pair of languages, so that you have a lot of noise in there.

from clics3.

chrzyki avatar chrzyki commented on July 29, 2024

Closing this for now. Feel free to reopen should any other questions arise.

from clics3.

Related Issues (17)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.