A model update fixes something, but an old skill gets worse.
Model regression is like a game update adding laser swords. Then the jump button breaks.
You meet it after model updates or fine-tuning. Teams rerun old tests before launch to catch it.
LLMOps
LLMOps uses version monitoring to spot model regression.
Third-party AI evaluation
Third-party AI evaluation can catch regressions the team missed.
AI QA Testing
AI QA Testing reruns old cases before launch to block regressions.
Fine-tuning
Fine-tuning can improve a new skill and hurt an old one.