Generative AI Art with Fooocus - Quickstart Guide

Last Updated	Changes
12/13/2023	First version published
1/17/2023	Updated with Advanced options

Fooocus is an interface for creating images (inference) with Stable Diffusion, aimed at beginner/intermediate users who may not want or need all the complexity of Auto1111 or ComfyUI. It’s super simple to set up, easy to use, the system requirements are low, and it produces absolutely beautiful images!

It doesn’t have all the bells and whistles advanced users have come to expect from Auto or Comfy, but that’s not what it’s for – Fooocus allows us to return to our prompting roots; less tweaking and fiddling with advanced settings allowing us to “fooocus” on our prompt design. It’s refreshing!

If you’re a beginner, or looking to get back to basics, you should be super excited by Fooocus! The interface is minimalist, but there are enough features to make it interesting for the seasoned-prompter! Plus, the images are superb quality! It’s 100% offline, it’s open-source, and it’s free!

The following image generations are not cherry picked; they’re based on prompts selected at random from the Civitai community;

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (1)

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (2)

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (3)

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (4)

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (5)

Fooocus Checkpoint/Model Directory:
Fooocus\models\checkpoints

Fooocus LoRA Directory:
Fooocus\models\loras

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (12)

Fooocus is aimed at beginner users, perhaps coming from other image generation services such as Midjourney, but it does include some extremely advanced features and options

Setting/Option/Concept	Fooocus
Text-to-image prompting	One of the most interesting aspects of Fooocus is the text-to-image processing engine. It passes prompts through an offline GPT-2 engine to ensure that the final images are always beautiful! Even short three-world prompts produce excellent results!
Midjourney V1 V2 V3 V4 commands	Variations can be accessed by checking `Input Image`, `Upscale or Variation`, then selecting a Variation type (`Subtle`) or (`Strong`).
Midjourney U1 U2 U3 U4 commands	Upscaling can be accessed by checking `Input Image`, `Upscale or Variation`, then selecting an Upscale option (`1.5x`) or (`2x`)
Inpainting/Outpainting	Inpainting and Outpainting can be accessed via the `Input Image` checkbox. Note that Fooocus uses its own inpainting algorithm and models which are downloaded the first time you try to inpaint! Results are really good!
Image Prompting (img2img)	Image Prompting can be accessed via the `Input Image` checkbox. Note that Fooocus uses its own image prompting (img2img) algorithm and the results are great!
Midjourney –style command	Preset styles can be accessed from the `Advanced`, `Styles` list.
Prompt Weights	Fooocus uses the `(token:N.N)` syntax for weighting. Fooocus uses Auto1111’s reweighting algorithm.
Embeddings (TIs)	Embeddings can be referenced in the prompt with the syntax `(embedding:file_name:1.1)`
Midjourney –no command	The no (negative prompt) is accessed from the `Advanced` checkbox
Midjourney –ar command	The ar (aspect ratio) is accessed from the `Advanced` checkbox
Face Swapping	InsightFace is used for face swapping between images, and can be accessed from the `Input Image` checkbox, `Advanced`, `Face Swap`
Image Describe/Interrogation	Images can be uploaded to the `Describe` box to be interrogated for a prompt.

Fooocus has excellent inpainting, img2img, and Faceswap capabilities, powered by proprietary Fooocus models/engines, accessible from the “Input Image” checkbox below the prompt box.

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (13)

The four input image boxes are a mix of an; “IP-Adapter, and a precomputed negative embedding from Fooocus team, an attention hacking algorithm from Fooocus team, and an adaptive balancing/weighting algorithm from Fooocus team.” per the Fooocus documentation. So what do they actually do? Here are some visual examples of what’s possible;

Single Image Upload – No Prompt

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (14)

Attempts to “replicate” certain concepts of an input image, no prompt required…

Single Image Upload – With a Prompt

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (15)

But if a prompt is supplied, attempts to replicate the input image along with the additional context of the prompt.

Multiple Images – No Prompt

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (16)

Supplying two images results in an image blend.

Multiple Images – With a Prompt

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (17)

Two images, plus a prompt, creates a blend of all three.

Face Swapping

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (18)

By checking the Advanced options below the input images and switching to FaceSwap, we can swap faces from an input image into our generations. It’s not flawless, but it’s passable in many cases. You may need to adjust the Weight slider to get better results.

PyraCanny

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (19)

Canny is a method of edge detection which users of ControlNet will be familiar with. Fooocus implements Canny in a unique way, based on the principles set forth in this research paper.

Essentially, it’s looking for edges and hard lines in a source image, allowing that “template” to be filled in by details supplied in a new prompt – in the case above, a female athlete.

CPDS

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (20)

CPDS is similar to PyraCanny. It’s also similar to the depth ControlNet preprocessor, but is based upon this research paper – Contrast Preserving Decolorization. Note that the implementation doesn’t include the decolorization part.

Fooocus has excellent promptless inpainting and outpainting, powered by an internal Fooocus model. To activate these options, check input image and set the tab to inpaint or outpaint.

Outpainting

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (21)

Outpainting allows us to supply an input image, select an Outpaint Direction in which to extend the canvas, and “fill” that space.

Inpainting

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (22)

Inpainting on the other hand, is the process of selecting a particular object in an input image, and replacing it with another prompted concept. Here, we’re replacing the woman in the image with a fearsome orc, while retaining the rest of the image.

If you don’t have a graphics card you can also run Fooocus via Google Colab (Last tested working 12/13/2023). Most Fooocus features work on the Free Colab tier.

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (23)

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (24)

The minimum requirements for Fooocus are 4GB of GPU Memory (VRAM), and 8GB of system memory (RAM).
Inference (image generation speed) on a relatively lower-end laptop with 16GB System RAM and an NVidia 3060 (6GB VRAM) can take approximately 1.4 seconds.
Fooocus works on Apple M1 and M2, with macOS Catalina or newer!
Fooocus doesn’t currently embed generation metadata into the output images – a feature Auto1111 and ComfyUI users have come to take for granted! This is a limitation, especially for Fooocus users who’d like to post their images on Civitai.

We’ll continue to expand this quickstart guide with more information as it becomes available!

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (2024)