Coder Social home page Coder Social logo

fruitify's Introduction

CV badge Hits

I aim to kill ๐Ÿค๐Ÿค with ๐Ÿชจ
image
๐Ÿชจ ๐Ÿ—“ ๐Ÿค do more with fewer weights? ๐Ÿค solves a problem that matters at scale?
The Wisdomify Project 2021-now Draws inductive biases from how we interpret Korean proverbs A solution to a Tip-of-the-Tongue problem
The Idiomify Project 2022-now Draws inductive biases from the science of idiom acquisition (The Idiombeddings Project) A solution to the lack of idiomatic fluency of English-as-a-Foreign-Language writers
The Politely Project 2022-now N/A, as it is currently rule-based. A solution to ...

fruitify's People

Contributors

eubinecto avatar teang1995 avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar

fruitify's Issues

Implement UnalignedCrossFruit

๋ชฉํ‘œ

kor - kor
en - en

์ด๋ ‡๊ฒŒ unaligned pair๋งŒ์„ ๊ฐ€์ง€๊ณ  ํ•™์Šต์„ ํ•ด๋ณด๋Š” ๊ฒƒ.

Implement CrossLingRD

ํ•ด์•ผํ•  ๊ฒƒ?

๊ตฌ์กฐ๋Š” MonoLingRD ์™€ ์ •ํ™•ํžˆ ๋™์ผํ•˜๋‹ค. ํ•œ๊ฐ€์ง€ ๋‹ค๋ฅธ์ ์€ ์‚ฌ์šฉํ•˜๋Š” ์‚ฌ์ „ํ›ˆ๋ จ ๋ชจ๋ธ์ด mbert_mlm์ด๋ผ๋Š” ๊ฒƒ.

์ด๊ฑธ ํด๋ž˜์Šค๋ฅผ ๋”ฐ๋กœ ์ •์˜๋ฅผ ํ•ด์•ผํ•˜๋‚˜?

๋”ฐ๋กœ ์ •์˜๋ฅผ ํ•ด์„œ, ์–ด๋–ค bert๋ฅผ ์š”๊ตฌํ•˜๋Š”์ง€ ํ™•์‹คํ•˜๊ฒŒ ํ‘œ๊ธฐํ•˜๋„๋ก ํ•˜์ž.

build a dataset

find at least 5 definitions for:

  • apple
  • banana
  • strawberry
  • orange
  • grape

use the following authoritative dictionaries:

  • Oxford dict
  • Cambridge dict
  • Merriam Webster
  • Longman dict
  • Macmillan dict

์ตœ์ข… ์ปค๋ฆฌํ˜๋Ÿผ ์งœ๊ธฐ

Why?

๋‚˜๋Š” ์ฃผ์ฒด์ ์ธ ๊ต์œก์ž๊ฐ€ ๋  ๊ฒƒ์ด๋‹ค. ๋‚˜๋งŒ์ด ๊ฐ€๋ฅด์น  ์ˆ˜ ์žˆ๋Š” ๊ทธ๋Ÿฐ ๊ต์œก๊ณผ์ •์„ ๋งŒ๋“ค์–ด๋ณด๊ณ  ์‹ถ๋‹ค.

How?

์ผ๋‹จ #10 (comment) ์—ฌ๊ธฐ์— sprint 1์€ ํ•ด๊ฒฐํ–ˆ๋‹ค.
์ด์ œ ๊ณ„์†, sprint 2, sprint 3, sprint 4๋ฅผ ๋‹ฌ๋ฆฌ๋ฉฐ ๋” ์ถ”๊ฐ€ํ•ด๋‚˜๊ฐ€์ž.

์ปค๋ฆฌํ˜๋Ÿผ

  • week 1
    • inverted index & TFIDF๋กœ ๊ตฌํ˜„ํ•ด๋ณด๊ธฐ : ์žฅ์  ํŒŒ์•… / ๋ฌธ์ œ ์ธ์‹ (semantic search ๋ถˆ๊ฐ€. ์ •์˜์— ํฌํ•จ๋˜์ง€ ์•Š์€ ๋‹จ์–ด๋Š” ๊ฒ€์ƒ‰
  • week 2. Word2Vec๋กœ ๊ตฌํ˜„ํ•ด๋ณด๊ธฐ : inverted index ๋Œ€๋น„ ์žฅ์  ํŒŒ์•… / ๋ฌธ์ œ ์ธ์‹ - averaging vectors to get a sentence vector์˜ ๋‹จ์ .
  1. RNN & LSTM์œผ๋กœ ๊ตฌํ˜„ํ•ด๋ณด๊ธฐ: Word2Vec ๋Œ€๋น„ ์žฅ์  ํŒŒ์•… / ๋ฌธ์ œ ์ธ์‹ - ๋ฐ์ดํ„ฐ๊ฐ€ ๋งŽ์ด ํ•„์š”ํ•จ & ๊ทธ๋ž˜๋„ ์—ฌ์ „ํžˆ... ๊ธด ๋ฌธ์žฅ์€ ํž˜๋“ค๋‹ค.
  2. Traansformer ๋กœ ๊ตฌํ˜„ํ•ด๋ณด๊ธฐ: RNN & LSTM ๋Œ€๋น„ ์žฅ์  ํŒŒ์•… / ๋ฌธ์ œ ์ธ์‹ - ์—ฌ์ „ํžˆ ๋ฐ์ดํ„ฐ๊ฐ€ ๋งŽ์ด ํ•„์š”ํ•˜๋‹ค.
  3. BERT๋กœ ๊ตฌํ˜„ํ•ด๋ณด๊ธฐ: Transformer ๋Œ€๋น„ ์žฅ์  ํŒŒ์•… / ๋ฌธ์ œ ์ธ์‹ - ...BERT์˜ ๋ฌธ์ œ๊ฐ€ ๋ญ๋”๋ผ?
  4. GPT3 & The future of NLP .... (e.g. few-shot learning )
  5. different tasks in NLP other than RD - generation, speech recognition (ํŠนํžˆ ์ด๊ฒƒ๋„ ์ธ๊ณต์ง€๋Šฅ์ด ํ•„์š”ํ•˜๋‹ค๋Š” ๊ฒƒ),
  6. Practical tips - tokenisation, lemmatization, stemming, etc
  7. ์ด์–ด์„œ... ๋” ์ฑ„์›Œ์•ผ ํ•˜๋Š” ๊ฒƒ๋“ค.

Implement MonoFruit

Todo

Implement MonoFruit defined in fruitify/models.py; Implement its three member methods:

  • fruitify
  • forward
  • training_step

Testing (pseudo)

python3 -m fruitify.scripts.train --fruit_type="mono" --k=5 --max_epochs=10

Support

Explore the scripts in fruitfiy/examples for:

  • exploring bert_mlm
  • exploring bert_tokenizer
  • exploring the fruit2def dataset
  • implementing cross entropy in Pytorch

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.