> ## Documentation Index
> Fetch the complete documentation index at: https://wb-21fd5541-wbdocs-1882.mintlify.site/llms.txt
> Use this file to discover all available pages before exploring further.

# W&B Training

> 강화 학습과 지도 파인튜닝을 사용해 모델을 포스트 트레이닝하세요

현재 퍼블릭 프리뷰로 제공되는 W\&B Training은 대규모 언어 모델(LLM)을 위한 서버리스 포스트 트레이닝을 지원하며, 강화 학습(RL)과 지도 파인튜닝(SFT)을 모두 포함합니다.

* **[Serverless RL](/ko/training/serverless-rl)**: 속도는 높이고 비용은 줄이면서, 여러 턴에 걸친 에이전트형 작업을 수행하는 모델의 신뢰성을 향상합니다. RL은 모델이 출력에 대한 피드백을 바탕으로 동작을 개선하도록 학습하는 트레이닝 기법입니다.
* **[Serverless SFT](/ko/training/sft-training)**: 증류, 출력 스타일 및 형식 학습, 또는 RL 전 워밍업을 위해 선별된 데이터셋으로 모델을 파인튜닝합니다.

W\&B Training은 다음과 인테그레이션됩니다:

* [ART](https://art.openpipe.ai/getting-started/about): 유연한 파인튜닝 프레임워크
* [RULER](https://openpipe.ai/blog/ruler): 범용 검증기
* [CoreWeave Cloud](https://docs.coreweave.com/docs/platform)에서 제공되는 완전관리형 백엔드

시작하려면 먼저 서비스 사용에 필요한 [사전 요구 사항](/ko/training/prerequisites)을 충족한 뒤, [Serverless RL quickstart](https://art.openpipe.ai/getting-started/quick-start) 또는 [Serverless SFT docs](https://art.openpipe.ai/fundamentals/sft-training)를 참조해 모델을 포스트 트레이닝하는 방법을 알아보세요.