Failure prediction models: performance, disagreements, and internal rating systems