TOP 500 Analysis

This analysis is meant to take a look at some of the overarching trends and relationships in the Top500 benchmark data. In order to do so, we access and retrieve the necessary data from the CSGenome API and first go through a quick cleaning process to make the values easy to compare and analyze. We then generate a correlation matrix, which represents the relationship between every attribute in our DataFrame. To visualize this, we create a heatmap where the color of each portion of the graph represents the strength of the correlation between the two pertinent attributes. From doing so, we are able to confirm several notions that seem fairly intuitive. For example, we can see that there is an extremely strong correlation between the number of threads present in a system and how fast that system runs.
