Vertex AI Evaluation Service Tutorial Notebooks →
https://goo.gle/4i7vdxl
How do developers know if their AI applications are working effectively? How can developers measure AI performance? In this episode of Real Terms for AI, Googlers Aja Hammerly and Jason Davenport delve into creating golden datasets, defining essential metrics, and utilizing tools to measure any AI application’s performance.
Chapters:
0:00 – Welcome
0:34 – Evaluating models versus evaluating apps
1:31 – Grounding
2:17 – Sources of evaluation data
3:47 – Define metrics and evaluation
5:07 – Analyzing and understanding metrics
6:19 – Ongoing evaluation
7:48 – Summary
Watch more Real Terms for AI → https://goo.gle/AIwordsExplained
Subscribe to Google Cloud Tech → https://goo.gle/GoogleCloudTech
#GoogleCloud #GenerativeAI
Speakers: Aja Hammerly, Jason Davenport
Products Mentioned: Gemini, Cloud General, Vertex AI