magda-io / magda-ckan-connector Goto Github PK
View Code? Open in Web Editor NEWA Magda connector for CKAN data sources
License: Other
A Magda connector for CKAN data sources
License: Other
We currently crawl user info when crawl org. some ckan sources have ban this.
We seems not use this info anyway --- should just from the extra query parameter below:
magda-ckan-connector/src/Ckan.ts
Line 199 in 5112697
The following fields:
contact_point
license_title
spatial_coverage
temporal_coverage_from
temporal_coverage_to
are available on CKAN via plugin ckanext-dgu and might be available as "extra" data on CKAN instance without the plugin installed.
Those fields are critical for Magda's searching capability (especially, spatial_coverage, temporal_coverage and license_title).
The ckan connector's field mapping template should be updated to search for those fields from extra data as well.
Some data portal might create their own customised field through CKAN customisation.
e.g. this dataset https://catalogue.data.wa.gov.au/api/3/action/package_show?id=mining-tenements-dmirs-003
contains a custom field access_level
, which we might need to filter data by.
This ticket is about creating a new feature to filter datasets harvested by custom fields.
We adopted solution option 1: magda-io/magda#3100
harvest_url
& harvest_portal
fieldsSome clients may have external scripts running to aggregate data from external sources to their CKAN portal.
And store original portal source info as extra fields: harvest_url
& harvest_portal
fields:
e.g. this dataset (ckan package):
We need make ckan connector supports those fields so we can display the correct original source on Magda web UI
Seems VIC portal might respond 0 count for package_search request intermittently:
{"help":"https://discover.data.vic.gov.au/api/3/action/help_show?name=package_search","success":true,"result":{"count":0,"facets":{},"results":[],"sort":"id asc","search_facets":{}}}
The issue seems hard to fix from vic portal (seems a cache (section.io) issue)
We will retry the request when 0 count response is received in order to fix the issue.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.