KAIST - COMPUTER SCIENCE

  • korea
  • search
  • login

Directions

[Seminar] ˝Large Language Models (in 2023)˝ by Dr. Hyung Won Chung (OpenAI) 11/22 (Wed.) 4pm

Wed, Nov 22, 2023 @ 16:00
Dr. Hyung Won Chung (OpenAI)
Seminar

Time: November 22, 2023 (Wed.) 4pm

Location: Terman Hall, Building E11

Zoom
https://kaist.zoom.us/j/88577386005?pwd=eTZxMGhpc1g1bk4wVVkvWlgxZHM5QT09
Meeting ID: 885 7738 6005
Password: 391500


Title: Large Language Models (in 2023)

Abstract: There is one unique aspect of large language models (LLMs): larger models exhibit abilities that were not present in the smaller models. These emergent abilities have far-reaching consequences in how we should work in the field of AI. I will share some of my observations on the implications of scaling and emergent abilities. After that, I will introduce multiple stages involved in the current generations of LLM training:: pre-training and post-training (including instruction fine-tuning and RLHF). While a huge volume of research exists for each stage, the core aspects can be expressed relatively simply. I will introduce the fundamental aspects of each stage and discuss the unique challenges they pose.
Bio: Hyung Won is a research scientist at OpenAI ChatGPT team. He has worked on various aspects of Large Language Models: pre-training, instruction fine-tuning, reinforcement learning with human feedback, reasoning, multilinguality, parallelism strategies, etc. Some of the notable work includes scaling Flan paper (Flan-T5, Flan-PaLM) and T5X, the training framework used to train the PaLM language model. Before OpenAI, he was at Google Brain and before that he received a PhD from MIT.

Location: online(zoom)
Posted By: 관리자

list