Draft: rework the management of the inconsistent results
When there is more than one run of the job in the pipeline, all the results are processed and collected together, if there are tests with different results between runs, they are considered a duplicated result. When a test is not only failing, it is considered a flake.