This Slide Deck takes us through the history of Computer Vision by tracking one task through time: classifying and detecting a bird in an image. Starting from KeyPoint Detection, through Convolutional Neural Networks, Transformers all the way to modern multimodal models and their capabibilites, we can see a steady development in the models' abilities.