NeoPrompt is a tool designed to make the creation of AI art more accessible and less time-consuming by providing a comprehensive framework for generating unique prompts. Catering to both beginners and expert artists, it features a repository of imaginative words and phrases and offers explanatory insights into its prompt generation methods. It's compatible with popular AI art platforms like Midjourney and Stable Diffusion, and the team is continuously working on enhancing its compatibility and accuracy.
The gpt-prompt-engineer tool is a powerful solution for prompt engineering, enabling users to experiment and find the optimal prompt for GPT-4 and GPT-3.5-Turbo language models. It generates a variety of prompts based on the provided use-case and test cases, and then tests and ranks them using an ELO rating system. Additionally, there is a specific classification version that evaluates test case correctness and provides scores for each prompt. The tool also supports optional logging to Weights & Biases, allowing for tracking of configurations and prompt performance.
The Diffusion Explainer tool is an interactive webpage that allows users to generate an image from a text prompt. Users have control over various hyperparameters, including the seed and guidance scale, to customize the generated image. The text prompt should describe the desired image in detail to generate high-quality images. By changing the random seed, users can obtain different image representations. Moreover, adjusting the guidance scale can improve the adherence of the image to the text prompt but could limit the image's creativity. While the tool offers flexibility in creating images, it does not allow adjustments to other hyperparameters such as the total number of timesteps, image size, and the type of scheduler.
Safe & Stable is a user-friendly tool designed to convert stable diffusion checkpoint files (.ckpt) to the safer and more secure .safetensors format for tensor storage. This new format enhances security by preventing malicious Python code while improving performance during model loading on both CPUs and GPUs. The tool's graphical interface simplifies file selection and monitors conversion progress. Although the initial conversion still requires .ckpt data, future models will be distributed exclusively in the .safetensors format, eliminating the need for scanning or converting from potentially harmful pickle files.
A method for editing images using human instructions, which involves providing an input image and a written instruction that tells the model what to do, and the model follows these instructions to edit the image. To generate training data for this problem, the article combines the knowledge of two large pretrained models, a language model (GPT-3) and a text-to-image model (Stable Diffusion), to generate a large dataset of image editing examples.
The Stable Diffusion WebUI Plugin is a plugin for Photoshop and Krita that interfaces with AUTOMATIC1111's Stable Diffusion WebUI without the need to switch to another WebUI or modify an existing installation. This plugin allows for text to image, image to image, inpainting, and outpainting inside Photoshop and Krita, eliminating the need to fuss around with the inpainter tool in the browser or upload masks. The plugin also allows for better script usage support, standalone face fix, pause and interrupt, render queue, and experiments, with upscaling support planned in the future.
The clip-retrieval package allows for easy computing of clip embeddings and building of a clip retrieval system. It can be used to quickly compute image and text embeddings, build efficient indices, filter data, and host these indices with a simple Flask service. The package also includes a simple UI querying system. The clip-retrieval package has been used by cah-prepro to preprocess 400M image+text for the dataset and by other projects such as autofaiss and antarctic-captions. ClipClient allows remote querying of a clip-retrieval backend via python. The package is installable with pip.
The EveryDream Tools repository contains data engineering tools for Stable Diffusion, an image project, and other image projects. The tools can be used for fine-tuning beyond the initial DreamBooth paper implementations. The repo includes tools for web scraping, auto-captioning, file renaming, image compression, and training. Ground truth Laion data can be mixed with training data to improve training quality. Captioned training and regularization have enabled multi-subject and multi-style training simultaneously.
Deforum is a community of AI image synthesis developers, enthusiasts, and artists. They have created a notebook using Stable Diffusion and continue to improve its functionality daily. It's free, it's amazing, and you can use it to make cool stuff with AI.
OpenOutpaint is an intuitive and convenient outpainting tool that provides queueable, cancelable dreams, arbitrary dream reticle size, and an effectively infinite, resizable, scalable canvas. It has a very functional and familiar layer system, and users can save, load, import, and export workspaces. It also includes an inpainting/touchup mask brush, webUI script support, prompt history panel, and interrogate tool, among others. The tool is available as an extension for webUI and has floating control panels and toolboxes with handy keyboard shortcuts. It supports upscaler for final output images, saves preferences and imported images to browser local storage, and has a reset to defaults button. It also has an optional generate-ahead function to keep generating dreams while users look through the ones that already exist.
The random drawing prompt generator provides users with easy drawing ideas by generating a stream of random prompts. The generator is not based on AI technology, but rather on a list of creative drawing ideas that include universal prompts suitable for easy or detailed observational drawing, as well as cool anime drawing ideas. The purpose of the generator is to help artists overcome creative blocks by providing them with a starting point for their artwork. The prompts are intended to disrupt the artist's mind and help ideas start flowing.
HaveIBeenTrained is a tool that uses clip retrieval to search the largest public text-to-image datasets, Laion-5B and Laion-400M, to remove links to images that artists want to opt-out from being used to train generative AI systems. These datasets are typically shared as files that contain links to images on the internet and captions that describe them. Stability and Laion partner to remove links that have been flagged for removal, ensuring that future models will not be trained with the opted-out work. HaveIBeenTrained incorporates new datasets as they are released and partners with other organizations to serve as a once-only opt-out tool for every dataset used to train generative AI art tools. The solution builds upon retrieval tools created by the LAION community that enable efficient search through large collections of image-text pairs based on kNN indices pre-computed using CLIP models pre-trained by OpenAI and LAION.
Get an approximate text prompt, with style, matching an image. Optimized for stable-diffusion (clip ViT-L/14)). The resource is an adapted version of the CLIP Interrogator notebook by @pharmapsychotic, which uses OpenAI CLIP models to analyze an image's content and suggest text prompts to create more similar images. The results are combined with BLIP caption to provide suggested text prompts.
Example :
a cat wearing a suit and tie with green eyes, a stock photo by Hanns Katz, pexels, furry art, stockphoto, creative commons attribution, quantum wavetracing
A database project that aggregates tools and resources for artists, engineers, curators, and researchers interested in incorporating machine learning (ML) and other forms of artificial intelligence (AI) into their practice. The database contains resources from partners and networks and covers a broad spectrum of possibilities presented by the current advances in ML, enabling users to generate images, create interactive artworks, draft texts, or recognize objects. While most of the tools require coding skills, the database includes entries tagged as courses and encourages beginners to turn to RunwayML. The article notes that the database is not comprehensive and is a growing collection of research commissioned and collected by the Creative AI Lab, with new entries added regularly.
FoldFold allExpandExpand allAre you sure you want to delete this link?Are you sure you want to delete this tag?
The personal, minimalist, super-fast, database free, bookmarking service by the Shaarli community