A new rabbit hole (AI art & music)

FiroLFO · November 27, 2024, 11:28am

I find it fascinating that despite all my efforts no chatbot (i.e. ChatGPT, Gemini, Claude) could provide me a proper VCV Rack patch.

It’s pretty simple to select all modules and copy the relevant JSON into their chatbox but whenever I ask for a nice melody or some smart cable arrangement they return something useless.

They typically recognize the modules and suggest some basic module arrangement (like LFO → ADSR → VCO → VCA → VCF → output) but when I ask for a modified JSON code the result is disappointing.

Do any of you have a better experience with this technique?

cosinekitty · November 28, 2024, 4:40pm

It’s funny, I was thinking about how it would be possible to write a program to generate a VCV Rack patch, because it is really just a compressed file with some JSON text inside it. Now, how to make a useful or interesting patch, or even a patch that works at all, would require a lot more thought. I imagined making some kind of software library for programs to generate patches.

As to why LLM systems aren’t good at coming up with patches, I believe it’s a combination of lack of good training data, and that the problem domain does not match the kinds of semantic patterns present in source code or written human languages.

FiroLFO · November 28, 2024, 7:19pm

VCVS selection files can be an easy way to start. They are uncompressed JSON files. (Just saying…)

Guenon · November 29, 2024, 9:16am

Structurally sound module arrangements and such (for some sort of synthesizer topology, for example) should be possible relatively easily; “asking for a nice melody” is something that might require some more creative prompting, hehe.

I’ve experimented with this in various ways, one example: I discussed with Microsoft Copilot about a custom plaintext format for musical notation, and established a sort of pseudo-MIDI-event format in CSV. Then, I proceeded to chat about a Python program that conveniently converts that plaintext to actual MIDI files. All of this went perfectly, and Copilot could then output musical “ideas” and melodies, even harmonies with multiple voices, directly as text, and I just ran those through the Python script and listened to that stuff in a DAW.

HOWEVER all of the actual musical content was more like… humorous, and entertaining, but it wasn’t actually capable of outputting an actually “wow, hey that’s nice” level of melody or something like that. Even with more advanced prompting (recognizing the way language models work, and that the custom format can undermine possibilities for desired kinds of statistical chains to happen, within the actual textual context window, that would lead to nice musical output, and mitigating this by trying to ask for a more common representation of musical content first - but nah, the results were more like funny, not seriously musical :)).

dan.tilley · December 18, 2024, 12:40am

Testing out the new MidJourney personalisation feature

created a moodboard from instagram pictures of my hardware eurorack

…instantly got amazing result, I love it…

FiroLFO · January 31, 2025, 8:36pm

Look at those pictures. Classic AI keyboard layouts!

Ahornberg · February 2, 2025, 5:07pm

yeah, microtoal keyboards

clone45 · March 26, 2025, 3:54am

Using the new image generation in ChatGPT 4o:

Create image Can you create an image of a modular synthesizer? (basic image was generated that looked OK.)

That’s great! Can you create an image of one single module, giving it a slightly dusty old look, like an old moog module?

Can you remove all of the elements and just keep the background panel, and also remove the text?

Can you make this into a seamless texture and remove the screws and edges?

(It didn’t succeed at this, but it got close…)

Ahornberg · March 26, 2025, 5:37am

for a seamless texture the image should be split into the texture part with even distributed lightning, and a transparent overlay that covers the whole area providing a smooth gradient:

I’m not sure if AI actually can do this

dan.tilley · March 26, 2025, 9:47am

thats very nice, pretty good starting point, you could turn those knobs and ports into serviceable pngs in an image editor…

pachde · March 26, 2025, 4:17pm

That’s a nice technique to overlay a gradient for the lighting falloff. I love the worn edges of the masonite panel and corroded screws, so I’d want a way to bring these elements back in. An improvement for the screws would be to have a number of them with the slots at different angles, but a consistent lighting angle.

dan.tilley · April 4, 2025, 9:00am

Tried to use the new MidJourney Draft Mode to fix the AI keyboards… not 100% successful, but it can improve them a little

dan.tilley · April 4, 2025, 9:15am

The conversation mode of MJ doesn’t seem quite as good as your Chat GTP one, it takes what you say and reinterprets the prompt, rather than actually making direct edits to the image…

But its still fun to see how the images change

dan.tilley · April 4, 2025, 10:25pm

I doubled checked a few times… I am allowed to share this on social media (does this forum count as social?) and I am encouraged to tag it with @labsdotgoogle