Coder Social home page Coder Social logo

sveinbjornt / ensk.is Goto Github PK

View Code? Open in Web Editor NEW
30.0 2.0 1.0 166.13 MB

A free and open English-Icelandic dictionary. https://ensk.is

Home Page: https://ensk.is

License: BSD 3-Clause "New" or "Revised" License

Python 73.90% CSS 2.04% HTML 22.41% Shell 0.22% Ruby 1.44%
icelandic-language icelandic dictionary language-dictionary english english-dictionary english-language python

ensk.is's Introduction

License Python 3.12 Build

ensk.is

A free and open English-Icelandic dictionary

This repository contains the data files and software for ensk.is, a free and open public domain online English-Icelandic dictionary.

Download

You can download the dictionary in the following formats:

Background

The author of the original dictionary, Geir T. Zoëga (1857-1928), was an English language teacher and later rector at the Learned School of Reykjavík in the years 1883-1927. The first edition was published in 1896. The digitised, extended and sanitised version presented here is based on the much-improved 1932 edition edited by Þorsteinn Þorsteinsson.

The current incarnation of the dictionary has been heavily edited, updated and fixed in accordance with modern Icelandic orthography. Many additions have been made to bring the vocabulary into the 21st century. Various errors in the original text were also corrected.

Contributing

The raw dictionary text files are stored here.

Additions can be viewed here.

Deployment

Ensk.is an ASGI web application implemented using the FastAPI Python web framework. To deploy it, you need to set up a Python virtual environment (3.12+) and install dependencies:

virtualenv -p /path/to/python3 venv
source venv/bin/activate
pip install -r requirements.txt

To build the dictionary data files from plain text source files, run the following command:

python gen.py

Run the web application via uvicorn:

uvicorn app:app --reload

Should also work with any other reasonable ASGI server.

Development

All new dictionary entries should be appended to data/dict/_add.txt.

To verify the syntax and references of all entries, you can run:

python verify.py

Dictionary License

This dictionary itself is in the public domain within all countries where permitted.

The authors waive copyright and related rights in the work worldwide through the Creative Commons 0 (CC0) 1.0 Universal public domain dedication.

The person who associated a work with this deed has dedicated the work to the public domain by waiving all of his or her rights to the work worldwide under copyright law, including all related and neighboring rights, to the extent allowed by law.

You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission.

In no way are the patent or trademark rights of any person affected by CC0, nor are the rights that other persons may have in the work or in how the work is used, such as publicity or privacy rights.

Unless expressly stated otherwise, the person who associated a work with this deed makes no warranties about the work, and disclaims liability for all uses of the work, to the fullest extent permitted by applicable law. When using or citing the work, you should not imply endorsement by the author or the affirmer.

Source license

Copyright (c) 2021-2024 Sveinbjorn Thordarson <[email protected]> All rights reserved.

Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met:

  1. Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer.

  2. Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution.

  3. Neither the name of the copyright holder nor the names of its contributors may be used to endorse or promote products derived from this software without specific prior written permission.

THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

ensk.is's People

Contributors

sveinbjornt avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

jokull

ensk.is's Issues

Bæta við orðum úr Íðorðabankanum?

Ég velti fyrir mér hvort ekki megi nýta Íðorðabanka Árnastofnunar til að auka við orðaforða ensk.is. Auðvitað ákveðin tegund af orðum sem þar má finna, en gæti verið gagnleg viðbót engu að síður. Og þótt ekki sé skýrt kveðið á um tiltekið leyfi þá stendur í upplýsingum um hann:

Íðorðabankinn getur því gagnast vel öllum þeim sem fjalla um sérfræðileg efni, þýðendum, kennurum, nemendum, fjölmiðlafólki, opinberum stofnunum, fyrirtækjum svo og hvers kyns áhugafólki, og síðast en ekki síst orðabókarhöfundum þar sem hann er sérstaklega ætlaður til orðabókasmíða.

https://idordabanki.arnastofnun.is/

Reyndar kemur fram annars staðar að það séu höfundar einstakra orðasafna sem hafi höfundarétt að þeim:

https://english.arnastofnun.is/page/ordabankinn

Væri þetta í samræmi við hugsjónina um ensk.is? Semsagt, ef hægt væri að fá leyfi til að nýta tiltekin orðasöfn þaðan, jafnvel þótt þeir hlutar væru ekki með sama public domain leyfi og er á meginþorranum, þá væri hægt að bæta notendaupplifunina með því að bjóða upp á orð úr þeim orðasöfnum þegar ekki er að finna þýðingu í megintextanum (þyrfti væntanlega einhverja þróun á hugbúnaðinum, en það þyrfti kannski ekki að vera of umfangsmikið?).

Missing data file for gen.py script

Hello,
Running python gen.py within a python virtual environment with the required packages installed gives the following output:

python gen.py 
Traceback (most recent call last):
  File "gen.py", line 127, in <module>
    entries = read_all_entries()
  File "gen.py", line 63, in read_all_entries
    pn = page_for_word(w)
  File "/home/ian/forrit/ensk.is/util.py", line 141, in page_for_word
    with open("data/word2page.json", "r") as file:
FileNotFoundError: [Errno 2] No such file or directory: 'data/word2page.json'

I couldn't find a way to generate the file with a quick look through the scripts, but may well have missed something.

(Ákvað að hafa þetta á ensku svo sem flestir gætu skilið, en ég get breytt því ef íslenskan er ákjósanlegri)

Viðbótarorð

Vil bara óska þér til hamingju með þetta frábæra framtak, og hvað þetta er vel útfært.

Nú finnast mjög mörg orð í bókinni eins og hún er í dag, en auðvitað ekki öll, og þá er spurning hvernig ný orð bætast við hana. Er markmiðið að hver sem er geti bætt við orðum, eða þá bent á orð sem vantar?

Ég leitaði til gamans að orðinu „econometrics“ en það vantar.

Fyrst datt manni í hug að safna bara saman öllum fyrirspurnum sem ekki finnast, en það er kannski ekki alveg það besta upp á information privacy notenda, spurning hvort það ætti að vera takki fyrir neðan „Engar niðurstöður fundust“ þar sem maður getur bent á orð sem vantar. Þá geta notendur líka lagt mat á hvort orðið eigi heima eða ekki (ef það var til dæmis bara innsláttarvilla). Svo væri svo hægt að safna þeim saman og ganga í að bæta mest umbeðnu orðunum við.

Eflaust ertu búinn að pæla í þessu öllu, en þá verður ekki hér annað en að ég fái að skrá fyrsta „issue“ við þessa snilld :-)

Icelandic to english dictionary

Hi, is there any interest in creating an icelandic to english dictionary with the information provided here on ensk.is? essentially a dictionary that is reversible both ways. The icelandic entries will arguably be less rich than the english since they will just be an icelandic definition referring back to an english keyword but still extremely valuable

nit: button and input box don't match

Frábært verkefni! Áhugaverð þessi gamla orðabók og vel unnin hreinsun á stafsetningu og fleiru, þó sú gamla hefði kannski verið áhugaverð er þetta mun nothæfara. Bara til að bæta við einhverju má að mér sýnist eyða út þessarri línu og breyta þannig þessu

image

í þetta

image

(prófað í Firefox og Chrome).

Má líka vel skrifast óþarft og eyða þessum miða.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.