Coder Social home page Coder Social logo

cakd3_3nd_for_better_essay's Introduction

For Better Essay ๐Ÿ“š

ํ•œ๊ตญ์–ด ๋„์„œ ๋ง๋ญ‰์น˜๋ฅผ ํ™œ์šฉํ•œ ์ธ๊ณต์ง€๋Šฅ ์„œ๋น„์Šค, ๋ฌธํ•ด๋ ฅ ํ–ฅ์ƒ ํ”„๋กœ๊ทธ๋žจ

๊ณต๋ชจ์ „ ์ตœ์šฐ์ˆ˜์ƒ(1๋“ฑ) ์ˆ˜์ƒ(๊ณต๋ชจ์ „์ˆ˜์ƒ_ref.pdf)

์ˆ˜์ƒ์ด๋ ฅ

๊ฒฐ๊ณผ / [Result]

4

- ์‹œ์—ฐ์˜์ƒ๋ฐ”๋กœ๊ฐ€๊ธฐ


์„ค๋ช… / [Description]

์ตœ๊ทผ ์ด์Šˆ๊ฐ€ ๋˜๊ณ  ์žˆ๋Š” ์–ด๋ฆฐ ์„ธ๋Œ€์˜ ๋ฌธํ•ด๋ ฅ ์ €ํ•˜ํ˜„์ƒ์ด๋ผ๋Š” ๋ฌธ์ œ์˜์‹์—์„œ ์ถœ๋ฐœํ•˜์—ฌ, ์ค‘ยท๊ณ ๋“ฑํ•™์ƒ์„ ํƒ€๊ฒŸ์œผ๋กœ ๋ฌธํ•ด๋ ฅ ํ–ฅ์ƒ์„ ์œ„ํ•ด ๋…ผ์ˆ ์„ ํฌํ•จํ•œ ๊ธ€์“ฐ๊ธฐ ํ•™์Šต ๋ฐ ๋…์„œํ•™์Šต์—์„œ ๊ฐ€์žฅ ์ž์ฃผ ์‚ฌ์šฉํ•˜๋Š” ํ•™์Šต์ธ ๋ฌธ์„œ ์š”์•ฝ ์—ฐ์Šต์„ ํ•  ์ˆ˜ ์žˆ๋Š” ์„œ๋น„์Šค๋ฅผ ์›น ์„œ๋น„์Šค๋ฅผ ์ด์šฉํ•˜์—ฌ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.

ํ”„๋กœ์ ํŠธ ์ ˆ์ฐจ / [Procedure]

์‹œ๋‚˜๋ฆฌ์˜ค ์„ค๊ณ„ / [Scenario Design]

: ํ•œ๊ตญ์–ด ๋„์„œ ๋ง๋ญ‰์น˜์—์„œ ํŠน์ • ๊ธธ์ด์˜ ๋…ํ•ด ์ง€๋ฌธ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค. ์ดํ›„ ํ•™์Šต๋ชจ๋ธ์ด ๊ธฐ๊ณ„ ์š”์•ฝ๋ฌธ์„ ์ƒ์„ฑํ•ด์ฃผ๋ฉด, ์‚ฌ์šฉ์ž๊ฐ€ ์ž…๋ ฅํ•œ ์š”์•ฝ๋ฌธ๊ณผ ๊ธฐ๊ณ„๊ฐ€ ์š”์•ฝํ•œ ์š”์•…๋ฌธ์„ ๋น„๊ต ๋ฐ ํ‰๊ฐ€ํ•˜์—ฌ ์ •ํ™•๋„๋ฅผ ์ถœ๋ ฅํ•ฉ๋‹ˆ๋‹ค. 0.8 ์ด์ƒ์ด๋ฉด Perfect, 0.6 ์ด์ƒ 0.8 ๋ฏธ๋งŒ์ด๋ฉด Great, 0.4 ์ด์ƒ 0.6 ๋ฏธ๋งŒ์ด๋ฉด Good, 0.4 ๋ฏธ๋งŒ์ด๋ฉด Try again ์ด๋ผ๋Š” ๊ฐ’์ด ๋œจ๋„๋ก ํ•ฉ๋‹ˆ๋‹ค.

๋ฐ์ดํ„ฐ ๊ตฌ์ถ• ๋ฐ ์ „์ฒ˜๋ฆฌ / [Data Building and Preprocessing]

  1. jsonํŒŒ์ผ ํ˜•ํƒœ๋กœ ๋˜์–ด์žˆ๋Š” ๋ฐ์ดํ„ฐ์•ˆ์—์„œ ํ•„์š”ํ•œ ๋ถ€๋ถ„๋งŒ์„ ์ •์ œํ•˜์—ฌ csvํ˜•ํƒœ๋กœ ์ •๋ฆฌํ•ฉ๋‹ˆ๋‹ค.
  2. ๋ฌธ๋‹จ์ด ์•„๋‹Œ ๋ฌธ์žฅ์˜ ํ˜•ํƒœ๋กœ ๊ตฌํ˜„๋˜์–ด ์žˆ๋Š” ๋ฐ์ดํ„ฐ์˜ ๊ฒฝ์šฐ ๋ฌธ๋‹จ์˜ ํ˜•ํƒœ๋กœ ์ •์ œํ•ฉ๋‹ˆ๋‹ค.
  3. ๋‚œ์ด๋„๋ฅผ ๊ณ ๋ คํ•œ ์„œ๋น„์Šค ๊ตฌํ˜„์„ ์œ„ํ•œ ์ปฌ๋Ÿผ์„ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค.
  4. ๋ง๋ญ‰์น˜ ๋ฐ์ดํ„ฐ๋ฅผ ์นดํ…Œ๊ณ ๋ฆฌ๋ณ„๋กœ ๊ณ ๋ คํ•˜์—ฌ ๋ณ‘ํ•ฉํ•ฉ๋‹ˆ๋‹ค.
  5. ์„œ๋น„์Šค ๊ตฌํ˜„์‹œ ๋น ๋ฅธ ๊ตฌ๋™ ์†๋„๋ฅผ ์œ„ํ•˜์—ฌ ๋ฐ์ดํ„ฐ๋ฅผ DB์˜ ํ˜•ํƒœ๋กœ ์ •์ œํ•ฉ๋‹ˆ๋‹ค.

์‚ฌ์ „ ํ•™์Šต / [ET5]

: ๋Œ€์šฉ๋Ÿ‰ ์›์‹œ ํ…์ŠคํŠธ๋กœ๋ถ€ํ„ฐ ๋นˆ์นธ ๋‹จ์–ด์—ด ๋งž์ถ”๊ธฐ(T5 ํ•™์Šต ์œ ํ˜•)์™€ ๋‹ค์Œ ๋‹จ์–ด ๋งž์ถ”๊ธฐ(GPT ํ•™์Šต ์œ ํ˜•)๋ฅผ ๋™์‹œ์— ์‚ฌ์ „ํ•™์Šต(pre-train)ํ•˜์—ฌ ์–ธ์–ด ์ดํ•ด์™€ ์–ธ์–ด ์ƒ์„ฑ ๋Šฅ๋ ฅ์„ ํ–ฅ์ƒ ์‹œํ‚จ ETRI์—์„œ ๊ฐœ๋ฐœํ•œ ํ•œ๊ตญ์–ด ์ดํ•ด์ƒ์„ฑ ์–ธ์–ด๋ชจ๋ธ์„ ์‚ฌ์šฉํ•˜์˜€์Šต๋‹ˆ๋‹ค.
: ์–ธ์–ด ๋ชจ๋ธ์€ ETRI์˜ ET5 ํ•™์Šต ๋ฐ์ดํ„ฐ ์‚ฌ์šฉ์‹ ์ฒญ ํŽ˜์ด์ง€๋ฅผ ํ†ตํ•ด ์‚ฌ์šฉํ—ˆ๊ฐ€ํ˜‘์•ฝ์„œ๋ฅผ ์ž‘์„ฑํ•œ ํ›„ ๋‹ค์šด๋กœ๋“œ ํ•˜์—ฌ ์‚ฌ์šฉ๋ฐ›์•˜์œผ๋ฉฐ, ์ œ 3์ž์—๊ฒŒ ๋ฌด๋‹จ ๊ณต์œ ๊ฐ€ ๋ถˆ๊ฐ€๋Šฅํ•˜์—ฌ Github ์ €์žฅ์†Œ์—๋Š” ์—…๋กœ๋“œํ•˜์ง€ ์•Š์•˜์Šต๋‹ˆ๋‹ค.
: https://aiopen.etri.re.kr/service_dataset.php

ํŒŒ์ธํŠœ๋‹ / [Fine Tuning]

: ์‚ฌ์ „ ํ•™์Šต๋œ ET5๋ฅผ ์ฃผ์–ด์ง„ ๋ฌธ์„œ์˜ ํŠน์ง•์— ๋งž๋Š” ์š”์•ฝ๋ฌธ์„ ์ƒ์„ฑํ•˜๋„๋ก ํŒŒ์ธ ํŠœ๋‹์„ ์ง„ํ–‰ํ•˜์˜€์Šต๋‹ˆ๋‹ค. ํŒŒ์ธํŠœ๋‹์€ AI hub ๊ฐœ๋ฐฉ๋ฐ์ดํ„ฐ์˜ ๋ฌธ์„œ์š”์•ฝ ํ…์ŠคํŠธ(์•ฝ 300์ž~1์ฒœ์ž๋กœ ์ด๋ฃจ์–ด์ง„ ๋ฌธ๋‹จ 20๋งŒ๊ฑด๊ณผ ๊ฐ๊ฐ์˜ ๋ฌธ๋‹จ์— ๋Œ€ํ•œ ์š”์•ฝ๋ฌธ 20๋งŒ๊ฑด์œผ๋กœ ์ด๋ฃจ์–ด์ง„ ๋ฐ์ดํ„ฐ) ๋ฐ์ดํ„ฐ๋ฅผ ํ™œ์šฉํ•˜์—ฌ ์ด๋ฃจ์–ด์กŒ์Šต๋‹ˆ๋‹ค.

ํ‰๊ฐ€๋ชจ๋ธ / [BERT SCORE]

: ๋ฌธ์žฅ ์š”์•ฝ ํ‰๊ฐ€์— ๊ด€์Šต์ ์œผ๋กœ ROUGE score๋ฅผ ๋งŽ์ด ์‚ฌ์šฉํ•˜์ง€๋งŒ ROUGE๋Š” exact match๋กœ ํ‰๊ฐ€ํ•ด ๋ฌธ๋งฅ์„ ํ‰๊ฐ€ํ•˜์ง€ ๋ชปํ•ฉ๋‹ˆ๋‹ค. ๋”ฐ๋ผ์„œ ๋‹จ์–ด๊ฐ„์˜ ์ฝ”์‚ฌ์ธ ์œ ์‚ฌ๋„๋ฅผ ํ†ตํ•ด F1 score๋ฅผ ์‚ฐ์ถœํ•˜๊ธฐ ๋•Œ๋ฌธ์— ๋ฌธ์žฅ๊ฐ„์˜ ๋ฌธ๋งฅ์„ ํ‰๊ฐ€ํ•  ์ˆ˜ ์žˆ๋Š” BertScore๋ฅผ ์‚ฌ์šฉํ•˜์˜€์Šต๋‹ˆ๋‹ค.
: BERT_Score์˜ ๊ธฐ๋ณธ ๊ตฌ๋™์›๋ฆฌ๋Š” ์ฐธ์กฐ ๋ฌธ์žฅ๊ณผ ๋ชจ๋ธ์ด ์ƒ์„ฑํ•œ ๋ฌธ์žฅ์„ contextual embeddingํ•˜์—ฌ ์ฝ”์‚ฌ์ธ ์œ ์‚ฌ๋„๋ฅผ ๊ตฌํ•˜๊ณ , ์ดํ›„ Greedy matching์„ ํ†ตํ•ด ๊ฐ€์žฅ ๋†’์€ ์œ ์‚ฌ๋„๋ฅผ ๊ฐ€์ง„ ๋ฒกํ„ฐ๋ฅผ ๋ฝ‘์•„ F1 score๋ฅผ ๊ตฌํ•ฉ๋‹ˆ๋‹ค.
: ์ €ํฌ ๋ชจ๋ธ์˜ bert score๋Š” 0.8391์ž…๋‹ˆ๋‹ค.

์›น์„œ๋น„์Šค ๊ตฌํ˜„ / [Web service Implementation]

< Main >

๋„ค ๊ฐ€์ง€์˜ ์ฃผ์ œ๋ฅผ ์„ ํƒํ•˜๊ฑฐ๋‚˜, โ€˜์‹ค๋ ฅUPโ€™์„ ์„ ํƒํ•˜์—ฌ ์‹ฌํ™” ๋ฒ„์ „์„ ํ•™์Šตํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

< Topic>

๋„ค ๊ฐ€์ง€์˜ ์ฃผ์ œ๋ณด๋‹ค ์ข€ ๋” ๋‹ค์–‘ํ•œ ์ฃผ์ œ๋ฅผ ์•ˆ๋‚ดํ•ฉ๋‹ˆ๋‹ค.

< Summary >

์‹ค์งˆ์ ์œผ๋กœ ๋ฌธ๋‹จ์„ ๋ณด๊ณ  ์‚ฌ์šฉ์ž๊ฐ€ ์ž‘์„ฑํ•œ ์š”์•ฝ๋ฌธ์„ ํ‰๊ฐ€ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
์ƒ๋‹จ์— 4๊ฐ€์ง€์˜ ์ฃผ์ œ ๋ฐ ์‹ฌํ™” ๋ฌธ๋‹จ์œผ๋กœ ์š”์•ฝ๋ฌธ์„ ํ•™์Šตํ•  ์ˆ˜ ์žˆ๋Š” ๋ฒ„ํŠผ์ด ์žˆ์Šต๋‹ˆ๋‹ค. 4๊ฐ€์ง€์˜ ์ฃผ์ œ๋Š” ์ฃผ์ œ๋ณ„ 600์ž ๋ฏธ๋งŒ์˜ ๊ธ€๋“ค๋กœ ๊ตฌ์„ฑ๋˜์–ด ์žˆ๊ณ , ์‹ฌํ™” ๊ธ€์“ฐ๊ธฐ๋Š” ๋žœ๋ค ์ฃผ์ œ๋กœ 600์ž ์ด์ƒ์˜ ๊ธ€๋“ค๋กœ ๊ตฌ์„ฑ๋˜์–ด ์žˆ์Šต๋‹ˆ๋‹ค.
์‚ฌ์šฉ์ž๋Š” ์ฃผ์ œ๋ณ„๋กœ ์ถœ๋ ฅ๋œ ๋žœ๋ค ๋ฌธ๋‹จ์„ ๋ณด๊ณ  ์‚ฌ์šฉ์ž ์š”์•ฝ๋ฌธ์„ ์ง์ ‘ ์ž‘์„ฑํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์ด ๋•Œ, ํƒ€์ด๋จธ๋ฅผ ์ด์šฉํ•˜์—ฌ ํ•™์Šต ์‹œ๊ฐ„์„ ์žด ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
์š”์•ฝ๋ฌธ์„ 60์ž ์ด์ƒ, 200์ž ์ด๋‚ด๋กœ ์ถฉ๋ถ„ํžˆ ์ž‘์„ฑ ํ›„ โ€˜์ œ์ถœํ•˜๊ธฐโ€™๋ฅผ ๋ˆ„๋ฅด๋ฉด ๊ธฐ๊ณ„์š”์•ฝ๋ฌธ๊ณผ ํ•จ๊ป˜ ํ‰๊ฐ€๋œ ์Šค์ฝ”์–ด๊ฐ€ ์ถœ๋ ฅ๋ฉ๋‹ˆ๋‹ค.

< User & Settings >

์‚ฌ์šฉ์ž์˜ ์ •๋ณด์™€ ์›น ์„œ๋น„์Šค์˜ ์—ฌ๋Ÿฌ๊ฐ€์ง€ ์„ค์ •๋“ค์„ ์กฐ์ ˆํ•˜๊ฑฐ๋‚˜ ํ™•์ธํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

cakd3_3nd_for_better_essay's People

Contributors

0oong avatar hye-jj avatar hharimjung avatar seuly1203 avatar sumunoh avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.