Hi Sebastian, loved your idea for this repo. I was thinking if we can have a g

Yep, good idea. I've created one <a href="https://sebastianruder.github.io/NLP-progres

Maybe we can move to gh-pages. <a href="https://nirmalsinghania2008.github.io/NLPp

Plots may be very important, vide <a href="https://deepsense.ai/keras-or-pytorch/" rel

NLP Progress Graph about nlp-progress HOT 16 CLOSED

sebastianruder commented on April 28, 2024 6

NLP Progress Graph

from nlp-progress.

Comments (16)

sebastianruder commented on April 28, 2024 2

Hey,
I think that's a great idea! If we could include an up-to-date picture of the overall progress (in terms of new results) across all tasks, that would be awesome!
For the graph, I guess the main thing we want to track is new results that are added. As a PR can include multiple commits, it might get confusing if we ask people to include "new result" in a commit. So maybe asking to include "new result" in the title of the PR might make more sense?

from nlp-progress.

sebastianruder commented on April 28, 2024 1

Yep, good idea. I've created one here. I'll link to this more prominently.

from nlp-progress.

nirmalsinghania2008 commented on April 28, 2024

Yeah. that's right.
Also, markdown doesn't provide dynamic charts otherwise we could've had something like "time period" over which the results are added. :(

from nlp-progress.

nirmalsinghania2008 commented on April 28, 2024

Maybe we can move to gh-pages.
https://nirmalsinghania2008.github.io/NLPprogress/
Looks good in my opinion.

from nlp-progress.

sebastianruder commented on April 28, 2024

So, do you think if we add the instruction for people to include "new result" in the title of PR if the PR contains a new result, you could create a chart based on the PRs?

from nlp-progress.

nirmalsinghania2008 commented on April 28, 2024

Yep. I think so

from nlp-progress.

sebastianruder commented on April 28, 2024

👍 I've just added a note for people to include "new result" in the title (bd91d2a) and will point future PRs to that.

from nlp-progress.

stared commented on April 28, 2024

Plots may be very important, vide Performance section in Keras vs PyTorch vs plain tables.

And see also Measuring the Progress of AI Research by EFF, which you may already know.

Recreating with each addition is time consuming. At the same, if instead of MD files, data were stored in some semantic files (e.g. for human readability YAML), it would be easy to create HTML plots (as in: I am happy to create such). Jekyll has support for YAML, which I already (ab)use (for list of projects, conferences, etc).

With that we can create both a table (maybe even sortable) and plots. Plus, separating semantic (pun kind of intended) data from its presentation is a plus.

from nlp-progress.

sebastianruder commented on April 28, 2024

Hi Piotr, thanks a lot for contributing to the discussion here. I had a look at EFF initially, but opted to keep it simple for the beginning.
I agree that having plots / graphs is a good idea. Given that we're not currently breaking out things by exact date of publication at the moment, my main concern is how much value they'll add vs. increased complexity, particularly for the rarer tasks. That doesn't mean that I don't want plots; I just want to make sure we get them right. :)
If you have time, would you mind taking one task and creating a plot for it, which we can use as a reference for further discussion?

from nlp-progress.

stared commented on April 28, 2024

Sebastian, sure. I would be happy to give it a try this week.

I understand that EFF-style would be harder to maintain and collaborate on. For YAML, it won't add much complexity or confusion. Instead of tables, there will be something like:

- paper: "Blablisation of tokens"
  year: 2018
  score: 78.9
  link: https://dx.doi.org/1234.1234.1234
  implementation: https://github.com/aaa/bbb

However, how to locally build the site? (While I am familiar with Jekyll, the file structure is somewhat strangely flat). jekyll serve does not generate any index.html file. Is it some GitHub-only style?

from nlp-progress.

sebastianruder commented on April 28, 2024

You mean the site at http://nlpprogress.com/?
That's just GitHub pages. In your fork, you can just go to Settings/GitHub Pages and then just switch it on.

from nlp-progress.

stared commented on April 28, 2024

I see (i works via GitHub Pages after deleting CNAME, but due to some installation hell cannot set it up locally). I am not sure this kind of Jekyll project supports data (cf. more generic structure of a Jekyll project, as in https://github.com/stared/stared.github.io).

If I come up with sth, I will let you know (most likely via PR :)).

from nlp-progress.

sebastianruder commented on April 28, 2024

Yeah, I'm aware of Jekyll. I wanted to keep it as simple as possible for now to make maintenance and collaboration easy.
I'm open to using Jekyll, if we can still make it easy for people to contribute. Would love to see what you come up with. :)

from nlp-progress.

stared commented on April 28, 2024

@sebastianruder Added a Pull Request #64 (right now mostly for viewing; if it works, we can decide on the exact form of entries and I am happy to rewrite quite a few things).

As a side note, the project may benefit from some structure, and splitting index.md (automatically listing all pages) from README.md (not there is some UX overlap between GitHub files and the website). See: https://github.com/stared/NLP-progress/tree/feat-restructure

I didn't include these changes as they are controversial, but not necessary to introduce plots. Let me know if it is also something you would like to consider (IMHO much cleaner structure).

from nlp-progress.

sebastianruder commented on April 28, 2024

Thanks! I agree that a restructuring makes sense. Automatically listing the table of contents is also a good idea.

from nlp-progress.

stared commented on April 28, 2024

Great, so I will try to incorporate that as well.

from nlp-progress.

NLP Progress Graph about nlp-progress HOT 16 CLOSED

Comments (16)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent