Coder Social home page Coder Social logo

prajna-ramamurthy / entity-linking-project Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 29 KB

Summer Research Project for IEEE CS on Entity Linking for Ambiguity Removal

Python 100.00%
ambiguity-resolver entities entity-extraction entity-linking entity-recognition ieee ieee-cs loinc ontologies parser

entity-linking-project's Introduction

Entity-Linking-Project

Summer Research Project for IEEE CS on Entity Linking for Ambiguity Removal

This project demonstrates two different approaches for medical entity disambiguation, focusing on mapping medical terms to their corresponding CUIs (Concept Unique Identifiers). The goal is to improve the accuracy of interpreting and classifying medical text data.

File 1: Entity Linking with spaCy and RDFLib

File: entity_linking_with_spacy_and_rdflib.py

This file showcases how to use the spaCy NLP library and RDFLib to perform entity linking for medical terms. It leverages RDF knowledge graphs to link medical terms to their unique CUIs, allowing for more accurate and context-aware disambiguation.

File 2: Medical Term Disambiguation with TF-IDF and SVM

File: medical_term_disambiguation_tfidf_svm.py

This file demonstrates a different approach using TF-IDF vectorization and Support Vector Machines (SVM) for medical term disambiguation. It uses machine learning techniques to classify medical texts and map them to appropriate CUIs, helping to mitigate ambiguity in medical terminology.

Usage

  1. Entity Linking with spaCy and RDFLib:

    • Ensure you have the required libraries (rdflib and spacy) installed.
    • Run the script entity_linking_with_spacy_and_rdflib.py.
    • The script will use spaCy for NLP processing and RDFLib to link medical terms to CUIs.
  2. Medical Term Disambiguation with TF-IDF and SVM:

    • Install the necessary libraries (scikit-learn and numpy) if not already installed.
    • Run the script medical_term_disambiguation_tfidf_svm.py.
    • The script will demonstrate TF-IDF vectorization and SVM classification for medical entity disambiguation.

Results

Both approaches aim to enhance the accuracy of medical term disambiguation. The first approach leverages semantic knowledge from RDF graphs, while the second approach employs machine learning techniques. Feel free to experiment with different texts and terms to observe the disambiguation results.

Acknowledgements

The project was inspired by the need for accurate medical text processing and the concept of entity linking in natural language processing.

entity-linking-project's People

Contributors

prajna-ramamurthy avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.