zeichenkette / dgd2cmdi Goto Github PK
View Code? Open in Web Editor NEWdgd metadata and resource conversion to clarin cmdi:
Home Page: http://fkuhn.github.io/dgd2cmdi/
License: BSD 3-Clause "New" or "Revised" License
dgd metadata and resource conversion to clarin cmdi:
Home Page: http://fkuhn.github.io/dgd2cmdi/
License: BSD 3-Clause "New" or "Revised" License
implement a method to update the profile of a cmdi file are finalization.
identify via out_edges and splitting of file labels.
based on the jupyter notebooks used to convert the language metadata.
when building the final output, restrict this step to the corpora in the configuration yml file.
Currently, the intermediate output folders are parsed and every resource found is finalized again.
When adding additional speaker information to a speaker element found in an event, the method speaker2event()
will iterate over all speaker-elements via xpath('//Speaker')
found in the current session of the event file and check, whether the label of the iterated speaker-elements and the selected speaker are the same. If yes, it will add the additional information to the speaker-element by adding them as sub-element.
After adding the information, speaker2event()
sets a triple with event, speaker and label of speaker-element of the event. This triple is used check if the speaker already has been added to the event to prevent multiple entries of the additional information.
However, there might be the case that an events holds more than one recording session and therefore a speaker can be part of these two sessions. According to the conditional defined above, additional information can be written to one matching-speaker element. A second session with the same speaker will then not get the additonal information.
Add an outer loop to iterate over Sessions with xpath('//Session')
and set the stop-condition for sessions, not for events.
method is named speaker2event_session()
The documentation is outdated
avoid changes to master when editing documentation and tutorial notebooks
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.