Ask HN: How are people doing AI evals these days?
0 net
With the buzz that's happening with all the new AI models that get released (what feels like every other week), how are companies running internal AI evals to determine which model is best for their use case?