Coder Social home page Coder Social logo

hackaton22jan's Introduction

hackaton22Jan

Goal of the hackaton

Goal of this hackaton is to create a NER algo able to recognize aircraft occurences in a document.

For instance:

"A Captain and First Officer of a CE-560XL reported..."

The main challenge is that there are no labels for the data. The onus is on you to create a dataset to train your algorithm. Your NER algo's performance will depend on the size of the supervized dataset you produce, as well as the learning strategy you choose.

Expected result:

At the end of the day, the student will deliver a Flair SequenceTagger model that can be load through the following function: SequenceTagger.load('example_path/best_model.pt')

The model will then be evaluated on my own validation dataset and the results will be compared to determine the winner of the hackaton. The validation dataset is made up of sentences that can be found in the data that is provided.

Data:

The data provided is the ASR dataset: "https://asrs.arc.nasa.gov".

You can access the data with the link provided below:

https://drive.google.com/open?id=1fqhnETNivqsV73cV-DPQXIqWFiO12mhw

Data is in pickle format, which can be used with pandas.read_pickle("path_to_pickle_file") function.

It is up to you to explore the data and decide on what to do with it in order to attain the expected results.

Good luck!

hackaton22jan's People

Contributors

albertonieto44 avatar jjover avatar ricardo-lopezlopez avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.