Coder Social home page Coder Social logo

hetio / medline Goto Github PK

View Code? Open in Web Editor NEW
16.0 4.0 4.0 142.56 MB

Computing term cooccurrence in MEDLINE

Home Page: https://doi.org/10.15363/thinklab.d67

Python 2.81% Jupyter Notebook 97.19%
rephetio hetionet medline pubmed cooccurrence symptoms diseases anatomy text-mining literature

medline's People

Contributors

dhimmel avatar larsac avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

medline's Issues

PubMed search matches subset of term name when there are no matches

C566272 is Townes-Brocks-Branchiootorenal-Like Syndrome. Looking on the mesh browser this term has a frequency of 0 meaning it's never tagged a medline topic.

We're currently using the following PubMed search: Townes-Brocks-Branchiootorenal-Like Syndrome [MeSH Terms:noexp], which returns 117,315 results and has the message:

The following term was not found in PubMed: Townes-Brocks-Branchiootorenal-Like

Note now this doesn't include "Syndrome". So I think what's happening is that PubMed isn't finding the entire search term and is falling back to just searching Syndrome [MeSH Terms:noexp], which is matching 117,315 records.

So we probably have to quote the search term.

Expected denominator is wrong causing underestimates of enrichment

Hello,

I am unsure about the denominator in the formula referenced below. Should expected not be equal to the product of the two frequencies of occurrence ? In that case, wouldn't the denominator be equal to the square of total_pmids ? It may be splitting hairs and not make a large difference overall, but I'd like to understand the maths behind the code a little more.

expected = len(pmids0) * len(pmids1) / total_pmids

Thanks,
Lars

How to search PubMed for citations that are assigned MeSH supplemental concept records in MEDLINE?

Based on the MEDLINE online training docs, it sounds like MEDLINE assigns MeSH supplemental concept records (SCRs) to citations:

1-benzylpiperazine is a supplementary chemical concept. It is mapped to the MeSH descriptor PIPERAZINES. That means that the article indexed with 1-benzylpiperazine will appear in PubMed under both 1-benzylpiperazine and PIPERAZINES. The indexer needs ONLY to index the SCR; the Heading Mapped to is added automatically.

The indexing of SCR terms is identical to the indexing of MeSH descriptors. In the IMS Browser, they appear in the scroll-down list alphabetically along with MeSH descriptors. The indexer can search the MeSH browser for complex SCR terms in the same manner as for complex MeSH terms. The Pharmacological Action of the SCR term must be added, if appropriate.

The MeSH browser for 1-benzylpiperazine shows a frequency of 57, leading me to believe it is actually assigned as a topic in MEDLINE.

But searching pubmed for "1-benzylpiperazine" [MeSH Terms:noexp] returns zero results and the message:

Your search was processed without automatic term mapping because it retrieved zero results.

So is there a way to search PubMed for MeSH SCRs?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.