Coder Social home page Coder Social logo

jonasengelmann / worldcat-reconciliation-service Goto Github PK

View Code? Open in Web Editor NEW
2.0 2.0 0.0 41 KB

Worldcat.org reconciliation service for OpenRefine.

License: MIT License

Python 97.61% Dockerfile 2.39%
openrefine metadata metadata-editor worldcat

worldcat-reconciliation-service's Introduction

Worldcat-reconciliation-service

License

OpenRefine reconciliation service for Worldcat.

Implemented query properties are author and publication_year.

If there is only need to disambiguate conceptual works, i.e. specific realizations and editions are treated as the same entry, an extension service is provided that allows to extract all OCLC numbers of all editions. Thereby a set of OCLC numbers can be used to unambiguously identify a conceptual work, requiring an entry only to be matched to an arbitrary edition of the work. The extension service can be used as follows:

Edit column -> Add columns from reconciled values... -> OCLC Numbers of all Editions

Run via Docker-Compose (Recommended)

cp .env.docker-compose .env
docker-compose up --build

The reconciliation service should now be accessible at http://localhost.

Development

The reconciliation service requires Selenium and Redis. The easiest way is to run these services via Docker:

docker run --rm -it -p 4444:4444 --shm-size 2g seleniarm/standalone-chromium:latest
docker run --rm -it -p 6379:6379 redis:latest

Setup pre-commit hooks:

pre-commit install
pip3 install -r requirement.txt

Start the reconciliation service:

uvicorn main:app --reload --port 8000 --env-file .env.dev

The reconciliation service should now be accessible at http://localhost:8000.

License

This project is licensed under MIT license - see the LICENSE file for more information.

worldcat-reconciliation-service's People

Contributors

dependabot[bot] avatar jonasengelmann avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar

worldcat-reconciliation-service's Issues

Add more properties to Data Extension Service

For now only one property (OCLC Numbers of all Editions) is supported.

For type book add:

  • ISBN
  • ISBN13
  • Publisher
  • Publication Date

For type book, artchap, archv add:

  • First Author
  • All Authors

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.