"Iterative Refinement" 태그로 연결된 1개 게시물개의 게시물이 있습니다.

Self-Refine - Iterative Refinement with Self-Feedback

2023년 12월 10일 · 약 4분

Aman Madaan

Niket Tandon

Prakhar Gupta

Skyler Hallinan

Luyu Gao

Sarah Wiegreffe

Uri Alon

Nouha Dziri

Shrimai Prabhumoye

Yiming Yang

Shashank Gupta

Bodhisattwa Prasad Majumder

Katherine Hermann

Sean Welleck

Amir Yazdanbakhsh

Peter Clark

논문 정보

제목: Self-Refine: Iterative Refinement with Self-Feedback
저자: Aman Madaan (Carnegie Mellon University), Niket Tandon (Allen Institute for AI), Prakhar Gupta (Carnegie Mellon University), Skyler Hallinan (University of Washington), Luyu Gao (Carnegie Mellon University), Sarah Wiegreffe (Allen Institute for AI), Uri Alon (Carnegie Mellon University), Nouha Dziri (Allen Institute for AI), Shrimai Prabhumoye (NVIDIA), Yiming Yang (Carnegie Mellon University), Shashank Gupta (Allen Institute for AI), Bodhisattwa Prasad Majumder (Allen Institute for AI), Katherine Hermann (Google Research), Sean Welleck (Carnegie Mellon University), Amir Yazdanbakhsh (Google Research), Peter Clark (Allen Institute for AI)
학회/저널: NeurIPS 2023 (Thirty-seventh Conference on Neural Information Processing Systems)
발행일: 2023-12-10
DOI: arXiv:2303.17651
주요 연구 내용: 단일 LLM을 생성기, 피드백 제공자, 개선기로 동시에 활용하여 초기 출력을 반복적으로 개선하는 Self-Refine 프레임워크 제안
주요 결과 및 결론: 7개의 다양한 태스크에서 평균 약 20%의 절대적 성능 향상을 달성하며, 추가 학습 데이터나 강화학습 없이 GPT-4와 같은 최신 모델의 성능도 테스트 시간에 추가로 향상 가능함을 입증
기여점: 별도의 학습 없이 단일 LLM만으로 출력 품질을 반복적으로 개선할 수 있는 범용적 프레임워크를 제안하여, LLM의 활용 패러다임에 새로운 방향을 제시

논문 정보​

논문 정보