Coder Social home page Coder Social logo

modelingprobabilisticreduction's Introduction

Modeling probabilistic reduction across domains with Naive Discriminative Learning

Authors: Anna Sophia Stein, Kevin Tang

This repository is the official implementation of Modeling probabilistic reduction across domains with Naive Discriminative Learning.

Abstract: The predictability of a word modulates its acoustic duration. Such probabilistic effects can compete across linguistic domains (segments, syllables and adjacent-word contexts e.g., frequent words with infrequent syllables) and across local and aggregate contexts (e.g., a generally unpredictable word in a predictable context). This study aims to tease apart competing effects using Naive Discriminative Learning, which incorporates cue competition. The model was trained on English conversational speech from the Buckeye Corpus, using words as outcomes and segments, syllables, and adjacent words as cues. The connections between cues and outcomes were used to predict acoustic word duration. Results show that a word's duration is more strongly predicted by its syllables than its segments, and a word's predictability aggregated over all contexts is a stronger predictor than its specific local contexts. Our study presents a unified approach to modeling competition in probabilistic reduction.

Requirements

To install the requirements:

pip install -r requirements.txt

Workflow

Below is a short version of what you ened to do in order to recreate my workflow. All of these files are in the /codefolder. A longer, more thourough version of the worklofw can be found in docs/workflow.txt.

The code for this thesis, especially the Python code, is by no means the most efficient way to do things. However, it represents my coding journey and what I was able to do at the time. In the future I hope to add a more concise and faster version of the current code.

  1. Create a word list for every speaker in the Buckeye corpus with buckeye_text.py.
  2. Create table with data from the Buckeye corpus for the regression analysis with regression_data.py.
  3. Create individual speaker event files with eventfilesV1.py.
  4. Correct the previous event files with correctingV1eventfiles.py.
  5. Train the NDL model with the input from 4. with trainNDL.py.
  6. Compute NDL predictors for the regression analysis with prior_activation.py.
  7. Compute speech rate per utterance with speech_rate.py.
  8. Replicate the statistical analysis with regression_analysis.Rmd

License

All source code is made available under a BSD 3-clause license. You can freely use and modify the code, without warranty, so long as you provide attribution to the authors. See LICENSE.md for the full license text.

The manuscript text is not open source. The author reserves the rights to the thesis content.

modelingprobabilisticreduction's People

Contributors

ansost avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.