
Original code is here:
https://colab.research.google.com/drive/1XFS4Kq5nsYPs5YQIPZrP3q3ehKDrnDgv?usp=sharing
If you’re new to Python, go to the menu bar and click Runtime > Run all.
Then wait for around 20 minutes while this script scrapes data from the VCAA and generates an interactive scatterplot for you. When it’s done, there will be some interesting data files available for download from the file explorer on the left of the screen.
You’ll notice some interesting findings in the scatterplot, including the fact that boys outperform girls in biology, and girls outperform girls in physics! Girls outperform boys in 15 of the 20 most popular VCE subjects with the only exceptions being Chemistry, Biology (only slightly) and all three mathematics subjects.
Feel free to modify this code and repost it. There are some other interesting insights you could glean from the dataset. Enjoy!