procedure
In the paper 'Addressing the Loss-Metric Mismatch with Adaptive Loss Alignment', the authors propose a sample-efficient reinforcement learning approach for adapting the loss function dynamically during training to directly optimize the evaluation metric.
Authors
Sources
- Evaluating Evaluation Metrics — The Mirage of Hallucination ... machinelearning.apple.com via serper
Referenced by nodes (2)
- reinforcement learning concept
- evaluation metrics concept