Not so Prompt: Prompt Optimization as Model Selection
Here's a framework for prompt optimization:Defining Success: Metrics and Evaluation CriteriaBefore collecting any data, establish what success looks like for your specific use case. Choose a primary metric that directly reflects business value—accuracy for classification, F1 for imbalanced datasets, BLEU/ROUGE for generation tasks, or custom domain-specific measures like "percentage of correctly extracted invoice fields" or "customer issue resolution rate." This primary metric drives optimizatio...
Read more at gojiberries.io