๐Ÿ–‹๏ธ
noviceforever
Ctrlk
  • About me
  • Miscellaneous
    • Introduction
  • Machine Learning
    • Tabular Data
    • Computer Vision (CNN-based)
    • Computer Vision (Transformer-based)
    • Natural Language Processing
    • Recommendation System
    • Reinforcement Learning
    • IoT on AWS
    • Distributed Training
    • Deployment
  • AWS AIML
    • Amazon Personalize
    • Amazon Bedrock AgentCore
    • Customer Support
  • GenAI
    • Theory
    • Synthetic Data
    • MoE (Mixture-of-Experts)
    • Open Source SLM-Based Hybrid Agent AI Architecture
    • Fine-tuning
    • LLM Evaluation
      • Overview
      • ํ•œ๊ตญ์–ด LLM ํ‰๊ฐ€์˜ ๋‚œ์ œ
      • [Paper review] KMMLU/KMMLU-Redux/KMMLU-Pro Dataset
      • [Paper review] FunctionChat-Bench
      • ํ˜ธ๋ž‘์ด ํ•œ๊ตญ์–ด LLM ๋ฆฌ๋”๋ณด๋“œ
Powered by GitBook
Page cover
On this page
  1. GenAI

LLM Evaluation

Overviewํ•œ๊ตญ์–ด LLM ํ‰๊ฐ€์˜ ๋‚œ์ œ[Paper review] KMMLU/KMMLU-Redux/KMMLU-Pro Dataset[Paper review] FunctionChat-Benchํ˜ธ๋ž‘์ด ํ•œ๊ตญ์–ด LLM ๋ฆฌ๋”๋ณด๋“œ
Previous[Use-case w/ Hands-on] Azure ML Python SDK ๋ฐ MLflow๋ฅผ ํ™œ์šฉํ•œ Florence-2 ๋ชจ๋ธ ํŒŒ์ธ ํŠœ๋‹NextOverview

Last updated 2 months ago