Is a picture really worth a thousand words? Recent progress in AI has in fact been driven by learning from both images and text, opening up exciting new possibilities. Computers are now increasingly able to extract insights from vast quantities of image and text data. This opens up new opportunities, but also leads to notable risks that need to be considered. Can a thoughtful approach towards AI allow us to obtain systems that seem a little more human and better guide us in our everyday life?