Comments (4)
I am wondering if there's any specific benefit to this over using the SQLite databases available from the website.
Isn't the data from sanskrit-lexicon github page in parallel with the one from Cologne website?
from pycdsl.
In the hindsight, it seems that there is no much difference if you use JSON or sqlite. In a way SQLITE is more frequently updated than JSON.
Ideally csl-orig repository is the latest bleeding-edge data. Once it is stable, sqlites are generated when that data is integrated in Cologne web display.
So for stable data usage, using sqlite makes sense. I drop the idea of JSON usage.
from pycdsl.
Please look at csl-orig at https://github.com/sanskrit-lexicon/csl-orig/commits/master . It has two commits after 24 Jan 2021, which are yet to make it to Cologne website.
One more question - It may happen that a specific dictionary (let's say MW) is not changed in the version 2.0.725 to 2.0.726 in website of Cologne. Do you download new MW sqlite when you see the version change? Or do you do some further analysis whether there is any change in MW sqlite before downloading?
from pycdsl.
It does not use the global version to update the data. It uses the "Last modified" text at the bottom of the download page.
e.g.
https://www.sanskrit-lexicon.uni-koeln.de/scans/WILScan/2020/web/webtc/download.html
I think that data is independent for each dictionary.
Further, the update check is triggered by passing the flag update=True
to the setup functions. (In REPL, it can be triggered by simply typing update
, which checks the last updated date and decides whether to download the update or not.)
I think I will close this as we seem to be in agreement that SQLite data is the way to go.
from pycdsl.
Related Issues (20)
- Python 3.6 Compatibility HOT 7
- transliterate error - unexpected argument HOT 1
- dictionary as an argument instead of attribute HOT 9
- Global transliteration preferences HOT 5
- Search across multiple dictionaries or all dictionaries HOT 6
- search() command for CDSLCorpus class HOT 3
- Documentation of changes / features in 0.3.0 HOT 1
- Write Unit Tests and Ingtegration Tests HOT 2
- Not able to search in more than one dictionary in CLI HOT 1
- CLI minor edit HOT 1
- BUG: cannot update dictionaries without giving a search term
- Credit to CDSL HOT 5
- Allow providing model_map to CDSLCorpus instance HOT 1
- Plan for stable release HOT 1
- Search Modes - by key, by value, by both HOT 5
- BUG: Search by value is also showing key matches
- Use cmd2.Cmd instead of cmd.Cmd
- [QUESTION] Is there a way to search for or detect root words? HOT 4
- normalization of headwords - sAmAnya and sAmAnyaM HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pycdsl.