Directions
navi
Wed, Nov 22, 2023 @ 16:00
Dr. Hyung Won Chung (OpenAI)
Seminar
Time: November 22, 2023 (Wed.) 4pm
Location: Terman Hall, Building E11
Zoom
https://kaist.zoom.us/j/88577386005?pwd=eTZxMGhpc1g1bk4wVVkvWlgxZHM5QT09
Meeting ID: 885 7738 6005
Password: 391500
Title: Large Language Models (in 2023)
Abstract: There is one unique aspect of large language models (LLMs): larger models exhibit abilities that were not present in the smaller models. These emergent abilities have far-reaching consequences in how we should work in the field of AI. I will share some of my observations on the implications of scaling and emergent abilities. After that, I will introduce multiple stages involved in the current generations of LLM training:: pre-training and post-training (including instruction fine-tuning and RLHF). While a huge volume of research exists for each stage, the core aspects can be expressed relatively simply. I will introduce the fundamental aspects of each stage and discuss the unique challenges they pose.
Bio: Hyung Won is a research scientist at OpenAI ChatGPT team. He has worked on various aspects of Large Language Models: pre-training, instruction fine-tuning, reinforcement learning with human feedback, reasoning, multilinguality, parallelism strategies, etc. Some of the notable work includes scaling Flan paper (Flan-T5, Flan-PaLM) and T5X, the training framework used to train the PaLM language model. Before OpenAI, he was at Google Brain and before that he received a PhD from MIT.
Location: online(zoom)
Posted By: 관리자