A new rabbit hole (AI art & music)

dan.tilley · March 4, 2022, 12:33am

Apologies if I am dragging anyone else down a black hole, but, I recently discovered a new app called Visions of Chaos https://www.softology.com.au/voc.htm

I am interested in this mainly because I was trying to get into generative art using GANs and Style Transfer etc

But having downloaded the app, I now see there is so much more potential in it, for a vast number of creative endeavours, including music

So I thought I’d share it here and see what amazing stuff you all can come up with.

Here are a few art pieces on the theme of modular synthesizers that I generated using the text-to-image AI scripts in the app:

augment · March 4, 2022, 1:36am

I gotta say, Dan… I find those images impressive in their way, but also rather disturbing. And knowing that they were generated by AI, only moreso. One might be inclined to the interpretation that the AI equates synth modules with viscera - like they’re dismembered computers with all their inside bits on the outside. - and yet they still process!

john_rose · March 4, 2022, 4:58am

Huh. Some of those look even more disturbing than some of the Google Deep Dream Generator images.

dan.tilley · March 4, 2022, 11:00am

Hah!

Do not worry, our future machine overlords are not to blame, these are from a text-to-image script, which means that I input a text prompt in order to generate them. Their disturbing nature is due to the actual words I used in the text prompt. It is very possible to create a rainbow fairy land synthesizer art piece, if you use the right words.

For example, part of the text prompt for the 3rd piece was

bloodstained Moog synthesizer

dan.tilley · March 4, 2022, 11:44am

Trying to evolve in a less disturbing direction

dan.tilley · March 4, 2022, 11:59am

Wow, everyone that I generate just seems to get better, I love this, have to share these two new ones:

chrome audio interface with wires

oscilloscope wired up to a robot

Richie · March 4, 2022, 3:05pm

Incredible - every one of them is an awesome album cover awaiting its music!

aetrion-music · March 4, 2022, 3:38pm

These look awesome and super inspiring. Pls don’t apologize! This is exactly the kind of rabbit hole that I‘d want to be dragged into

dan.tilley · March 4, 2022, 6:04pm

If anyone wants some art generated (for an album cover or whatever), feel free to message me and I’ll see what I can create for you, I am really loving this process

FiroLFO · March 4, 2022, 7:28pm

I love it.

baudoinia · March 4, 2022, 8:10pm

Sounds like the impetus for a really fascinating vlog!

REDACT3D · March 4, 2022, 9:56pm

Wonder what my logo icon (avatar) would look like lol

interesting images O.O

dan.tilley · March 4, 2022, 11:28pm

Gigeresque VCV logos anyone?

mosphaere · March 5, 2022, 1:27am

That is so strange but somewhat beautiful. Would you mind sharing how difficult it is to set this up? Installation, options, what to learn?

StochasticTelegraph · March 5, 2022, 2:57am

Fascinating. Like a very far forward version of Wombo Dream.

The main description page doesn’t seem to mention text->image scripts (well, it’s a looong list, so I wasn’t that thorough). What do they call it in the app?

REDACT3D · March 5, 2022, 7:43am

Nice! getting some Giger vibes lol

Richie · March 5, 2022, 10:11am

Wow! How do you express what you want, when it’s as specific as the VCV logo?

dan.tilley · March 5, 2022, 2:59pm

Sure, though fair warning, it is not simple or easy:

Note that text-to-image is just one of many modes available in Visions of Chaos, you can just download the app and play with everything that is not under the machine learning section and still have a great time

Now, specifically for the machine learning modes, there is a lot of pre requisites, you need an Nvidia RTX 20x or 30x GPU and then you must following these instructions for installing everything Softology - Visions of Chaos - Machine Learning Support

Once you have all that stuff installed, you need to activate one of the machine learning modes, then the app will prompt you to download all the models, I think currently there are 51 different models, each one can be multiple GB, these step was the hardest part, because the download can be a little bit flaky.

If you get all the models downloaded, then finally you can start generating images, my advice for that is to start really simple and at small resolutions, change one thing at a time and re-render, this way you will start to understand what each parameter affects

If you get stuck but want to persist, the author of the app has a discord and he is very active and responsive there

An alternative is to use something called Google CoLab, this is a cloud based virtual machine solution for running machine learning notebooks shared by other people, though this is just as complex, if not more so, and to really get the best benefit you would need a subscription to the pro version, but i’ll leave this tutorial in case thats of interest https://youtu.be/FA2MNG8D5x0

dan.tilley · March 5, 2022, 3:00pm

dan.tilley · March 5, 2022, 3:08pm

One of the options for some of the text-to-image scripts is to provide a seed image

What I did for those images was seed the VCV logo, set the Init scale, Skip timesteps & Guidance scale to a larger value (these control how alike to the seed image the generated image is) and then I wrote various text prompts that describe what I think the content of a Giger picture is.

For example, I think one of the prompts I wrote was Bleached biomech bones by H R Giger

In the end you just have to use trial and error until you arrive in a place where you have some idea of what each parameter does, and then tailor your text prompt to the result you are looking for. But even then there is still an element of randomness, so you will still sometimes end up running a workflow multiple times because you don’t like the output.