Have I Been Trained?https://haveibeentrained.com/
HaveIBeenTrained is a tool that uses clip retrieval to search the largest public text-to-image datasets, Laion-5B and Laion-400M, to remove links to images that artists want to opt-out from being used to train generative AI systems. These datasets are typically shared as files that contain links to images on the internet and captions that describe them. Stability and Laion partner to remove links that have been flagged for removal, ensuring that future models will not be trained with the opted-out work. HaveIBeenTrained incorporates new datasets as they are released and partners with other organizations to serve as a once-only opt-out tool for every dataset used to train generative AI art tools. The solution builds upon retrieval tools created by the LAION community that enable efficient search through large collections of image-text pairs based on kNN indices pre-computed using CLIP models pre-trained by OpenAI and LAION.