KAIST School of Computing, 카이스트 전산학부 - 컴퓨터공학과 컴퓨터과학(Computer Science & Engineering)을 교육하고 연구하는 국내 최대 규모의 전산학부

Directions

navi

[Seminar] ˝Large Language Models (in 2023)˝ by Dr. Hyung Won Chung (OpenAI) 11／22 (Wed.) 4pm

Wed, Nov 22, 2023 @ 16:00
Dr. Hyung Won Chung (OpenAI)
Seminar

Time: November 22, 2023 (Wed.) 4pm

Location: Terman Hall, Building E11

Zoom
https://kaist.zoom.us/j/88577386005?pwd=eTZxMGhpc1g1bk4wVVkvWlgxZHM5QT09
Meeting ID: 885 7738 6005
Password: 391500

Title: Large Language Models (in 2023)

Abstract: There is one unique aspect of large language models (LLMs): larger models exhibit abilities that were not present in the smaller models. These emergent abilities have far-reaching consequences in how we should work in the field of AI. I will share some of my observations on the implications of scaling and emergent abilities. After that, I will introduce multiple stages involved in the current generations of LLM training:: pre-training and post-training (including instruction fine-tuning and RLHF). While a huge volume of research exists for each stage, the core aspects can be expressed relatively simply. I will introduce the fundamental aspects of each stage and discuss the unique challenges they pose.
Bio: Hyung Won is a research scientist at OpenAI ChatGPT team. He has worked on various aspects of Large Language Models: pre-training, instruction fine-tuning, reinforcement learning with human feedback, reasoning, multilinguality, parallelism strategies, etc. Some of the notable work includes scaling Flan paper (Flan-T5, Flan-PaLM) and T5X, the training framework used to train the PaLM language model. Before OpenAI, he was at Google Brain and before that he received a PhD from MIT.

Location: online(zoom)
Posted By: 관리자

한국과학기술원(KAIST) 전산학부 34141 대전광역시 유성구 대학로 291(구성동373-1)

로그인

개인정보처리방침