File:X-Y plot of algorithmically-generated photorealistic portraits by nationality.png
Original file (9,216 × 4,642 pixels, file size: 33.4 MB, MIME type: image/png)
Captions
The factual accuracy of this image is disputed. The image for "American" is unlikely to reflect the training data |
Summary
[edit]DescriptionX-Y plot of algorithmically-generated photorealistic portraits by nationality.png |
An X/Y plot of algorithmically-generated photorealistic AI images featuring an office worker depicted as various different nationalities, created using a custom merged Stable Diffusion AI diffusion model checkpoint featuring R34_e2 merged with gg1342 at 0.5 weighted sum, then merged with Anything V3.0 at 0.5 weighted sum, and then finally merged with F222 at 0.5 weighted sum. This merged model was also paired with the sd-vae-ft-mse-original VAE. This plot serves to illustrate the most basic use-case for the img2img feature within Stable Diffusion.
These images were generated using an NVIDIA RTX 4090; since Ada Lovelace chipsets (using compute capability 8.9, which requires CUDA 11.8) are not fully supported by the pyTorch dependency libraries currently used by Stable Diffusion, I've used a custom build of xformers, along with pyTorch cu116 and cuDNN v8.6, as a temporary workaround. Front-end used for the entire generation process is Stable Diffusion web UI created by AUTOMATIC1111. An initial 768x1024 image was generated with txt2img using the following prompts:
Then, a batch of 1536x2048 images were generating with img2img, using the image generated earlier, along with the following prompts:
During the generation of this batch, an X/Y plot was generated using the "X/Y plot" script, along with the following settings:
|
Date | |
Source | Own work |
Author | Benlisquare |
Permission (Reusing this file) |
As the creator of the output images, I release this image under the licence displayed within the template below.
The Stable Diffusion AI model is released under the CreativeML OpenRAIL-M License, which "does not impose any restrictions on reuse, distribution, commercialization, adaptation" as long as the model is not being intentionally used to cause harm to individuals, for instance, to deliberately mislead or deceive, and the authors of the AI models claim no rights over any image outputs generated, as stipulated by the license.
R34_e2, gg1342 and F222 are custom-trained derivative models of Stable Diffusion 1.4. The CreativeML OpenRAIL-M License applies to all downstream derivative versions of the model, as stipulated under the preamble. Anything V3.0, created by Furqanil Taqwa, is released under the CreativeML OpenRAIL-M License.
|
Licensing
[edit]- You are free:
- to share – to copy, distribute and transmit the work
- to remix – to adapt the work
- Under the following conditions:
- attribution – You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- share alike – If you remix, transform, or build upon the material, you must distribute your contributions under the same or compatible license as the original.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in the section entitled GNU Free Documentation License.http://www.gnu.org/copyleft/fdl.htmlGFDLGNU Free Documentation Licensetruetrue |
File history
Click on a date/time to view the file as it appeared at that time.
Date/Time | Thumbnail | Dimensions | User | Comment | |
---|---|---|---|---|---|
current | 22:12, 3 December 2022 | 9,216 × 4,642 (33.4 MB) | Benlisquare (talk | contribs) | separate into two rows, for better ease of viewing | |
22:08, 3 December 2022 | 18,432 × 2,321 (35.44 MB) | Benlisquare (talk | contribs) | {{Information |Description=An X/Y plot of algorithmically-generated photorealistic AI images featuring an office worker depicted as various different nationalities, created using a custom merged Stable Diffusion AI diffusion model checkpoint featuring R34_e2 merged with gg1342 at 0.5 weighted sum, then merged with [https://huggingface.co/Linaqruf/anything-v3.0 Anything V3.0] at 0.5 weighted sum, and then finally merged with [https://ai.zeipher.com/ F222] at 0.5 weight... |
You cannot overwrite this file.
File usage on Commons
There are no pages that use this file.
Metadata
This file contains additional information such as Exif metadata which may have been added by the digital camera, scanner, or software program used to create or digitize it. If the file has been modified from its original state, some details such as the timestamp may not fully reflect those of the original file. The timestamp is only as accurate as the clock in the camera, and it may be completely wrong.
Horizontal resolution | 28.35 dpc |
---|---|
Vertical resolution | 28.35 dpc |
File change date and time | 22:07, 3 December 2022 |