Coder Social home page Coder Social logo

imputation-of-missing-location-information-for-worldwide-patent-data's Issues

About missing first filings in imputation_priority_filings_app_ctry_codes_20190829.sql

Thank you for sharing the SQL code for identifying priority filings. I have been reviewing the code and came across a particular point of concern.
According to

INSERT INTO PRIORITY_FILINGS_app
SELECT DISTINCT t1.appln_id, t1.appln_kind, t6.person_id, t1.appln_auth, t1.appln_filing_year, t1.appln_filing_date, 'priority'
FROM patstat.tls201_appln t1
JOIN patstat.tls204_appln_prior t2 ON t1.appln_id = t2.prior_appln_id
--LEFT OUTER JOIN toExclude t3 ON t1.appln_id = t3.appln_id
--JOIN patstat.tls211_pat_publn t4 ON t1.appln_id = t4.appln_id
JOIN po t5 ON t1.appln_auth = t5.patent_office
JOIN patstat.tls207_pers_appln t6 ON t1.appln_id = t6.appln_id
WHERE (t1.appln_kind != 'W')
AND t1.internat_appln_id = 0
--AND t3.appln_id IS NULL
--AND t4.publn_nr IS NOT NULL
--AND t4.publn_kind !='D2'
AND applt_seq_nr > 0
AND t1.appln_filing_year >=1980 AND t1.appln_filing_year<2016,

The SQL query uses a JOIN operation with the patstat.tls207_pers_appln table to filter for applications, and only those that exist in this table are considered as first filings. According to this setup, it seems that we are assuming that all first filings should appear in the patstat.tls207_pers_appln table.

However, I found that some first filings are not in the patstat.tls207_pers_appln table. For example, appln_id='902221298' does not appear in patstat.tls207_pers_appln even though it is the priority patent for appln_id='456788097'. We could originally infer appln_id='902221298' through appln_id='456788097'. In cases like this, are we potentially missing out on certain first filings?

Is there a way to run the codes in PATSTAT online?

Hello,

I am a master's student who is trying to use the "Imputation-of-missing-location-information-for-worldwide-patent-data" you have provided for patent analysis. I am currently using PATSTAT online Spring 2021 but there is a character limit of 10,000 to run the queries there that prevents me from using the code.
Hence, I was wondering if the code provided could be used in PATSTAT online or I should try other way?

Thank you very much in advance.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.