The persephone-web-api from persephone-tools

Investigate tensorflow running in parallel

Need to see if there are memory or namespacing issues from having more than one model be available to users at once.

Create serializer for Utterances

Currently this serialization to JSON being done in code and duplicated in various places, move to a schema based approach.

Reference to utterances in Audio and Transcription results

Currently if you GET either of these you get a reference to where these are used in Utterances:

*** uploading WAV files ***
{
  "filename": "crdo-NRU_F4_ACCOMP_PFV.1_21.wav",
  "id": 1,
  "in_utterances": [],
  "url": "uploads/audio_uploads/crdo-NRU_F4_ACCOMP_PFV.1_21.wav"
}
*** uploading transcription files ***
{
  "filename": "crdo-NRU_F4_ACCOMP_PFV.1_23.phonemes",
  "id": 1,
  "in_utterances": [],
  "url": "uploads/text_uploads/crdo-NRU_F4_ACCOMP_PFV.1_23.phonemes"
}

This currently is not in agreement with the API specification. Should we expose this information to the user from these endpoints or not? (Incidentally this is the exact sort of issue that GraphQL solves nicely)

Create test suite that runs the na_tutorial entirely via API calls

Similar to the test case in Persephone persephone/tests/experiments/test_na.py::test_tutorial

Update requests example

Some of the API parameters have changed and updating the example would be good

Search endpoint for transcriptions

Search endpoint for Audio files

Create docker container

For ease in deploying, see persephone-tools/persephone#131

Add max_samples to corpus creation API

Be able to manually specify the maximum samples parameter.

Use get_or_404() instead of custom query

Cuts down on code needed.

Refactor database Corpus class to have a non-colliding name with persephone.Corpus

Having two classes called Corpus is confusing, change the class name for the DB ORM that handles Corpus

Add missing utterance creation response codes to API specification

Currently only the response code for successful creation (201) is included in the API specification YAML file. The cases where an invalid request (400) or a duplicate (409) are not currently in the API sepcifcation.

Set up DB migrations

Probably using alembic

Search endpoint for utterances

Create a basic search endpoint

Serialization of CorpusSchema gives wrong primary keys

Consider this script to create a corpus via Curl:

# This is a very quick and dirty to populate some initial data via calling the API.
# Note that for now ID's are hardcoded in later steps.
# TODO: process response data IDs
# Upload WAV files
echo "*** uploading WAV files ***"
curl -X POST --header 'Content-Type: multipart/form-data' --header 'Accept: application/json' --form audioFile=@crdo-NRU_F4_ACCOMP_PFV.1.wav 'http://127.0.0.1:8080/v0.1/audio'
curl -X POST --header 'Content-Type: multipart/form-data' --header 'Accept: application/json' --form audioFile=@crdo-NRU_F4_ACCOMP_PFV.3.wav 'http://127.0.0.1:8080/v0.1/audio'
curl -X POST --header 'Content-Type: multipart/form-data' --header 'Accept: application/json' --form audioFile=@crdo-NRU_F4_ACCOMP_PFV.7.wav 'http://127.0.0.1:8080/v0.1/audio'
# Upload transcriptions
echo "*** uploading transcription files ***"
curl -X POST --header 'Content-Type: multipart/form-data' --header 'Accept: application/json' --form transcriptionFile=@crdo-NRU_F4_ACCOMP_PFV.1.phonemes 'http://127.0.0.1:8080/v0.1/transcription'
curl -X POST --header 'Content-Type: multipart/form-data' --header 'Accept: application/json' --form transcriptionFile=@crdo-NRU_F4_ACCOMP_PFV.3.phonemes 'http://127.0.0.1:8080/v0.1/transcription'
curl -X POST --header 'Content-Type: multipart/form-data' --header 'Accept: application/json' --form transcriptionFile=@crdo-NRU_F4_ACCOMP_PFV.7.phonemes 'http://127.0.0.1:8080/v0.1/transcription'
# Create Utterances
echo "*** specifying utterances ***"
curl -X POST --header 'Content-Type: application/json' --header 'Accept: application/json' -d '{
   "audioId": 1,
   "transcriptionId": 1
 }' 'http://127.0.0.1:8080/v0.1/utterance'
curl -X POST --header 'Content-Type: application/json' --header 'Accept: application/json' -d '{
   "audioId": 2,
   "transcriptionId": 2
 }' 'http://127.0.0.1:8080/v0.1/utterance'
curl -X POST --header 'Content-Type: application/json' --header 'Accept: application/json' -d '{
   "audioId": 3, 
   "transcriptionId": 3 
 }' 'http://127.0.0.1:8080/v0.1/utterance'

# Create corpus
echo "*** creating corpus ***"
curl -X POST --header 'Content-Type: application/json' --header 'Accept: application/problem+json' -d '{
   "name": "Test Corpus", 
   "label_type": "phonemes",
   "feature_type": "fbank",
   "testing": [
     1
   ],
   "training": [
     2
   ],
   "validation": [
     3
   ]
 }' 'http://127.0.0.1:8080/v0.1/corpus'

Gives this:

{
  "feature_type": "fbank",
  "filesystem_path": "f2976bee-900f-11e8-9a5f-d8cb8acb264b",
  "id": 1,
  "label_type": "phonemes",
  "max_samples": null,
  "name": "Test Corpus",
  "preprocessed": false,
  "testing": [
    1
  ],
  "training": [
    1
  ],
  "validation": [
    1
  ]
}

The test, train, validation are the primary keys of the DataSet DB models, NOT the utterance primary keys.

Serialization for audio and transcriptions duplicates utterance relation information

{
  "filename": "crdo-NRU_F4_ACCOMP_PFV.1_16.wav",
  "id": 1,
  "in_utterances": [],
  "url": "uploads/audio_uploads/crdo-NRU_F4_ACCOMP_PFV.1_16.wav",
  "utterances": []
}

{
  "filename": "crdo-NRU_F4_ACCOMP_PFV.1_18.phonemes",
  "id": 1,
  "in_utterances": [],
  "url": "uploads/text_uploads/crdo-NRU_F4_ACCOMP_PFV.1_18.phonemes",
  "utterances": []
}

In this in_utterances is a duplicate of utterances.

Implement user management

Specify errors in API responses

There are times where returning error information will make the frontend much easier to write. For example a mismatch of label types would be a good thing to have an error response about.

Pass audio and transcription source paths into create_prefixes

Currently these are being fetched from the app.config which is not the right way to go about this

Set up VM image for running this service

This will make it easier to run this locally.

Move flask config to file

Currently in ad-hoc arrangement

Add feature type to corpus creation API

Currently this parameter is not supported.

Implement Model definition endpoint

Create API for model definitions

Add label type to Corpus creation API

Currently this parameter is not supported.

Utterance will create duplicate given same foreign keys

Say you have an audio and a transcription and you create an utterance with those ID's should a second call with the same ID's create a new utterance?

Put test upload files into temporary directory

Currently these files get written to disk and not cleared after a test run.

Limit maximum file upload size

Prevent memory exhaustion issues that can occur from uncapped file upload sizes.

Get an ELAN file with all required assets for testing

Having a real ELAN file to test against would make life much easier for development

Improve API documentation on POST /corpus endpoint

Currently the example data has the same id duplicated in the testing, training, validation sets which is confusing and represents improper usage of the API.

Supply information about which filetypes are accepted by the backend

It might be good to expose information about which filetypes the backend will accept.

Support ".wave" extension?

Should we support this?

Set up production ready file serving

Currently for development purposes files are being served via flask which is not production ready.

Enable TravisCI integration

Settings need to be changed in GitHub organization to allow this.

Create label specification endpoint

For convenience in specifying the allowable labels for a Corpus.

Filenames and duplicates

Currently when you upload more than one file of the same name a duplication check is performed so that nothing is overwritten but that means when you download the file again the name can be different to the name of the file you uploaded. This is something that needs a bit of attention.

Should duplicate files have the same ID or different IDs?

persephone-tools / persephone-web-api Goto Github PK

persephone-web-api's People

Contributors

Stargazers

Watchers

Forkers

persephone-web-api's Issues

Recommend Projects

Recommend Topics

Recommend Org