As I was previously involved with Red Hats platform (I can't speak for the java ...

cookiengineer · on Sept 22, 2023

Most vendors that I am scraping already have a confidence score, which is approximated on a statistical level. For example, can't trust the fixed states of Ubuntu and Debian, so they got a lower confidence score; compared to say, Arch Linux which has the highest confidence in that regard.

Matching package names overall is what I was using the CPEs for initially, but it's way too much overhead to match those in a separate database/table.