๋ชฉ๋ก2024/03/20 (2)

SJ_Koding

GPT-2์— ๋Œ€ํ•ด ์ดํ•ดํ•ด๋ณด์ž (GPT 2ํŽธ)

GPT-1์— ๋Œ€ํ•ด ์ดํ•ดํ•ด๋ณด์ž (GPT 1ํŽธ) Chat GPT์˜ ์‹œ์ดˆ, GPT-1 ๋ถ€ํ„ฐ ์ฐจ๊ทผ์ฐจ๊ทผ ์•Œ์•„๋ณด์ž (๋ณธ ํฌ์ŠคํŒ…์€ AI์—…๊ณ„์—์„œ ์œ ๋ช…ํ•˜์‹  ํ—ˆ๋ฏผ์„ ๊ฐœ๋ฐœ์ž๋‹˜์˜ ์œ ํŠœ๋ธŒ GPT-1(๋ฐ‘๋ฐ”๋‹ฅ๋ถ€ํ„ฐ ์•Œ์•„๋ณด๋Š” GPT) ๊ฐ•์˜๋ฅผ ์ฐธ๊ณ ํ–ˆ์Šต๋‹ˆ๋‹ค.) What is GPT? Generative Pre Training of a la sjkoding.tistory.com ์ƒ์œ„ ํฌ์ŠคํŒ…์— ์ด์–ด์ง„ ๋‚ด์šฉ์ด๋‹ค. ์ด๋ฒˆ ํฌ์ŠคํŒ… ์—ญ์‹œ ํ—ˆ๋ฏผ์„๋‹˜์˜ ์œ ํŠœ๋ธŒ ๊ฐ•์˜๋ฅผ ์ฐธ๊ณ ํ•˜์˜€๋‹ค. GPT-1์˜ ๋‹จ์  "์–ด์จŒ๋“  fine tuning ๊ณผ์ •์ด ํ•„์š”ํ•˜๋‹ค" ์ด๋ฅผ ํ•ด๊ฒฐํ•œ ๊ฒƒ์ด GPT-2์ด๋‹ค. GPT-2๋Š” ์ด fine tuning ๊ณผ์ •์„ ์•„์˜ˆ ์—†์•ด๋‹ค. ์ฆ‰ ์œ„ ๊ทธ๋ฆผ์ฒ˜๋Ÿผ GPT-2์—์„œ Task๋ณ„๋กœ ๋ณ„๋„์˜ Fine tuning์ด ํ•„์š”ํ•˜์ง€ ์•Š๋‹ค๋Š” ์˜๋ฏธ์ด๋‹ค. ๊ทธ๋ฆฌ๊ณ  GPT-2์˜ ..

LLM 2024. 3. 20. 19:41
GPT-1์— ๋Œ€ํ•ด ์ดํ•ดํ•ด๋ณด์ž (GPT 1ํŽธ)

Chat GPT์˜ ์‹œ์ดˆ, GPT-1 ๋ถ€ํ„ฐ ์ฐจ๊ทผ์ฐจ๊ทผ ์•Œ์•„๋ณด์ž (๋ณธ ํฌ์ŠคํŒ…์€ AI์—…๊ณ„์—์„œ ์œ ๋ช…ํ•˜์‹  ํ—ˆ๋ฏผ์„ ๊ฐœ๋ฐœ์ž๋‹˜์˜ ์œ ํŠœ๋ธŒ GPT-1(๋ฐ‘๋ฐ”๋‹ฅ๋ถ€ํ„ฐ ์•Œ์•„๋ณด๋Š” GPT) ๊ฐ•์˜๋ฅผ ์ฐธ๊ณ ํ–ˆ์Šต๋‹ˆ๋‹ค.) What is GPT? Generative Pre Training of a language model (GPT)์˜ ์•ฝ์ž, ์—ฌ๊ธฐ์„œ ๋งํ•˜๋Š” language model๋ถ€ํ„ฐ ์ดํ•ดํ•ด๋ณด์ž. ๊ตฌ๊ธ€์ด๋‚˜ ์œ ํŠœ๋ธŒ๋ฅผ ๊ฒ€์ƒ‰ํ•  ๋•Œ, ์–ด๋–ค ๋‹จ์–ด๋ฅผ ์ž…๋ ฅํ•˜๋ฉด ๋‹ค์Œ ๋‹จ์–ด๊ฐ€ ์ถ”์ฒœ๋˜๋Š” ๊ฒƒ์„ ์ž์ฃผ ํ™•์ธํ•  ์ˆ˜ ์žˆ๋‹ค. ex) ์ž…๋ ฅ: GPT ์ถ”์ฒœ: GPT ์‚ฌ์šฉ๋ฒ•, GPT-4, GPT ์œ ๋ฃŒ, ... ๋“ฑ๋“ฑ language model์€ ์œ„ ์˜ˆ์‹œ์ฒ˜๋Ÿผ ํ˜„์žฌ ํ† ํฐ์„ ๊ฐ€์ง€๊ณ  ๋‹ค์Œ ํ† ํฐ์„ ์˜ˆ์ธกํ•˜๋Š” ํ–‰์œ„๋„ ๊ฐ€๋Šฅํ•˜๋‹ค. ์ด๋•Œ Language model์˜ ์žฅ์ ์€ ํŠน๋ณ„ํ•œ ๋ผ๋ฒจ๋ง์ด ํ•„์š” ์—†..

LLM 2024. 3. 20. 10:22