Coder Social home page Coder Social logo

thesisii's Introduction

EDITING

General

  • make sure terminology is consistent
    • rank-frequency vs rank-probability relationship VS r(w) and P(w) (or f(w))
    • probabilities: indices and identities
    • MLE
    • Zipfianness
    • quantities (theoretical vs observed)
    • Subsampling (capitalised)
    • randomly sampled subcorpus vs. subsample vs. filtered subcorpus vs. ...

Chapter 1

  • re-work introduction to Chapter 1 (i.e. big picture and abstract motivation)
  • move Chomsky vs C&V (i.e. learnability) to end
  • re-work sections on Zipf (i.e. elaborate on origin & extent debates, talk about relevance, foreshadow Subsampling)
  • editing -> work through comments, ensure readability and understandability
  • add references

Chapter 2

  • editing -> comments, readability
  • Tables -> change sizes, making smaller (use resizebox and footnotesize)
  • Figures -> change sizes, making larger (use wrap figure to have text float around); add names (languages) to the subplots
  • for goodness-of-fit measures: add extreme values
  • references
  • add overall summary/conclusions for Chapter,
  • mention data availability on Github: valevo/Thesis -> folder names correspond to chapter and figure numbers, file names to language name (e.g. valevo/Thesis/Chapter 2/Figure 2.1/FI.png)
  • (make plots of MLEs and null models)
  • (share cbars and y-axes in plots)
  • note that the plots in this chapter are proper estimates
  • (hapax growth)

Chapter 3

  • editing -> comments
  • references
  • terminology: set of samples, corresponding distributions, probability estimates from individual samples, averages of probability estimates
  • terminology (also in plots): articles vs texts
  • mention other languages and the fact that observations are similar
  • Table 3.1: add column from 10*10^6
  • add to conclusion of Section 3.4 that convergence is useful for Chapter 4
  • conclusion for Chapter 3
  • elaborate on introducing paragraph of Chapter 3
  • Fano factor plot: fix y-axis

Chapter 4

  • editing -> comments
  • (re-do transitioning paragraphs from typicality to filter implementations)
  • conclusion of Filtering (algorithms & theory): what does it achieve (aka the gist) and what are the limitations? how does it generalise?
  • conclusion for Filtering (chapter as a whole): how can learnability of Zipf be assessed?

Chapter 5

  • first a list of contributions, then future work
  • re-do learnability -> integrate Chomsky vs C&V and make more rigorous -> how does learnability of Zipf generalise to learnability in the sense of C&V? how does it need ot be adapted?
  • other uses of Filtering: apply to other laws, could be used to study the connection between Zipf's law and
    1. semantics (-> could shed light on such explanations -> could help further assess normality) and
    2. n-gram distributions

thesisii's People

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.