Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (2024)

Last UpdatedChanges
12/13/2023First version published
1/17/2023Updated with Advanced options

Fooocus is an interface for creating images (inference) with Stable Diffusion, aimed at beginner/intermediate users who may not want or need all the complexity of Auto1111 or ComfyUI. It’s super simple to set up, easy to use, the system requirements are low, and it produces absolutely beautiful images!

It doesn’t have all the bells and whistles advanced users have come to expect from Auto or Comfy, but that’s not what it’s for – Fooocus allows us to return to our prompting roots; less tweaking and fiddling with advanced settings allowing us to “fooocus” on our prompt design. It’s refreshing!

If you’re a beginner, or looking to get back to basics, you should be super excited by Fooocus! The interface is minimalist, but there are enough features to make it interesting for the seasoned-prompter! Plus, the images are superb quality! It’s 100% offline, it’s open-source, and it’s free!

The following image generations are not cherry picked; they’re based on prompts selected at random from the Civitai community;

The good news is that installing Fooocus is super simple! On Windows, click here to download the zipped package!

After unzipping into a folder, all you need to do is double click on run.bat to start the installation process. The first time you run Fooocus, models will be downloaded into the fooocus\models\checkpoints directory, so the first time launching the app can take quite some time! Subsequent launches will be a lot faster.

There’s also the option to launch a run_anime.bat and a run_realistic.bat. These will automatically download models tuned to produce anime, and realism results, respectively.

It’s as easy as that! Each time you’d like to start the Fooocus interface, launch run.bat. Updates are automatically downloaded and installed each time you launch the interface via run.bat

We can use any SDXL Model or LoRA from Civitai with Fooocus! It’s as easy as downloading these models into the following directories, then refreshing Fooocus using the “🔃 Refresh all files”.

Model and LoRA selections can be made by checking the “Advanced” checkbox, below the prompt box.

  • Fooocus Checkpoint/Model Directory:
    Fooocus\models\checkpoints
  • Fooocus LoRA Directory:
    Fooocus\models\loras
Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (12)

Fooocus is aimed at beginner users, perhaps coming from other image generation services such as Midjourney, but it does include some extremely advanced features and options

Setting/Option/ConceptFooocus
Text-to-image promptingOne of the most interesting aspects of Fooocus is the text-to-image processing engine. It passes prompts through an offline GPT-2 engine to ensure that the final images are always beautiful! Even short three-world prompts produce excellent results!
Midjourney V1 V2 V3 V4 commandsVariations can be accessed by checking Input Image, Upscale or Variation, then selecting a Variation type (Subtle) or (Strong).
Midjourney U1 U2 U3 U4 commandsUpscaling can be accessed by checking Input Image, Upscale or Variation, then selecting an Upscale option (1.5x) or (2x)
Inpainting/OutpaintingInpainting and Outpainting can be accessed via the Input Image checkbox. Note that Fooocus uses its own inpainting algorithm and models which are downloaded the first time you try to inpaint! Results are really good!
Image Prompting (img2img)Image Prompting can be accessed via the Input Image checkbox. Note that Fooocus uses its own image prompting (img2img) algorithm and the results are great!
Midjourney –style commandPreset styles can be accessed from the Advanced, Styles list.
Prompt WeightsFooocus uses the (token:N.N) syntax for weighting. Fooocus uses Auto1111’s reweighting algorithm.
Embeddings (TIs)Embeddings can be referenced in the prompt with the syntax (embedding:file_name:1.1)
Midjourney –no commandThe no (negative prompt) is accessed from the Advanced checkbox
Midjourney –ar commandThe ar (aspect ratio) is accessed from the Advanced checkbox
Face SwappingInsightFace is used for face swapping between images, and can be accessed from the Input Image checkbox, Advanced, Face Swap
Image Describe/InterrogationImages can be uploaded to the Describe box to be interrogated for a prompt.

Fooocus has excellent inpainting, img2img, and Faceswap capabilities, powered by proprietary Fooocus models/engines, accessible from the “Input Image” checkbox below the prompt box.

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (13)

The four input image boxes are a mix of an; “IP-Adapter, and a precomputed negative embedding from Fooocus team, an attention hacking algorithm from Fooocus team, and an adaptive balancing/weighting algorithm from Fooocus team.” per the Fooocus documentation. So what do they actually do? Here are some visual examples of what’s possible;

Single Image Upload – No Prompt

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (14)

Attempts to “replicate” certain concepts of an input image, no prompt required…

Single Image Upload – With a Prompt

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (15)

But if a prompt is supplied, attempts to replicate the input image along with the additional context of the prompt.

Multiple Images – No Prompt

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (16)

Supplying two images results in an image blend.

Multiple Images – With a Prompt

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (17)

Two images, plus a prompt, creates a blend of all three.

Face Swapping

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (18)

By checking the Advanced options below the input images and switching to FaceSwap, we can swap faces from an input image into our generations. It’s not flawless, but it’s passable in many cases. You may need to adjust the Weight slider to get better results.

PyraCanny

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (19)

Canny is a method of edge detection which users of ControlNet will be familiar with. Fooocus implements Canny in a unique way, based on the principles set forth in this research paper.

Essentially, it’s looking for edges and hard lines in a source image, allowing that “template” to be filled in by details supplied in a new prompt – in the case above, a female athlete.

CPDS

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (20)

CPDS is similar to PyraCanny. It’s also similar to the depth ControlNet preprocessor, but is based upon this research paper – Contrast Preserving Decolorization. Note that the implementation doesn’t include the decolorization part.

Fooocus has excellent promptless inpainting and outpainting, powered by an internal Fooocus model. To activate these options, check input image and set the tab to inpaint or outpaint.

Outpainting

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (21)

Outpainting allows us to supply an input image, select an Outpaint Direction in which to extend the canvas, and “fill” that space.

Inpainting

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (22)

Inpainting on the other hand, is the process of selecting a particular object in an input image, and replacing it with another prompted concept. Here, we’re replacing the woman in the image with a fearsome orc, while retaining the rest of the image.

If you don’t have a graphics card you can also run Fooocus via Google Colab (Last tested working 12/13/2023). Most Fooocus features work on the Free Colab tier.

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (23)
Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (24)
Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (25)
Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (26)
Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (27)
  • The minimum requirements for Fooocus are 4GB of GPU Memory (VRAM), and 8GB of system memory (RAM).
  • Inference (image generation speed) on a relatively lower-end laptop with 16GB System RAM and an NVidia 3060 (6GB VRAM) can take approximately 1.4 seconds.
  • Fooocus works on Apple M1 and M2, with macOS Catalina or newer!
  • Fooocus doesn’t currently embed generation metadata into the output images – a feature Auto1111 and ComfyUI users have come to take for granted! This is a limitation, especially for Fooocus users who’d like to post their images on Civitai.

We’ll continue to expand this quickstart guide with more information as it becomes available!

Generative AI Art with Fooocus - Quickstart Guide - Civitai Education (2024)
Top Articles
Latest Posts
Article information

Author: Francesca Jacobs Ret

Last Updated:

Views: 6271

Rating: 4.8 / 5 (48 voted)

Reviews: 87% of readers found this page helpful

Author information

Name: Francesca Jacobs Ret

Birthday: 1996-12-09

Address: Apt. 141 1406 Mitch Summit, New Teganshire, UT 82655-0699

Phone: +2296092334654

Job: Technology Architect

Hobby: Snowboarding, Scouting, Foreign language learning, Dowsing, Baton twirling, Sculpting, Cabaret

Introduction: My name is Francesca Jacobs Ret, I am a innocent, super, beautiful, charming, lucky, gentle, clever person who loves writing and wants to share my knowledge and understanding with you.