it truly is shown that The straightforward pre-teaching task of predicting which caption goes with which picture is surely an productive and scalable way to know SOTA image representations from scratch on the dataset of https://k2spiceshop.com/product/liquid-k2-on-paper-online/