AI writes a pop song in 4 seconds.

Discussion in 'Ai for Music' started by hackerz4life, Aug 25, 2025 at 8:29 AM.

Tags:
?

How do you feel about it?

  1. Very angry.

    1 vote(s)
    2.1%
  2. Depressed AF about the future of our world.

    12 vote(s)
    25.5%
  3. This is a war on art and humanity.

    8 vote(s)
    17.0%
  4. Who cares....

    21 vote(s)
    44.7%
  5. I love AI music.

    1 vote(s)
    2.1%
  6. It sounds good, would do well in the radio and people wouldnt care ...

    4 vote(s)
    8.5%
  1. curtified

    curtified Audiosexual

    Joined:
    Feb 3, 2015
    Messages:
    936
    Likes Received:
    538
    All depends on the genre you want the pads to be in. But here is a minimal UK garage song with said juno style pads

    https://suno.com/s/YLofJQ6gZmObjEc7
     
  2. curtified

    curtified Audiosexual

    Joined:
    Feb 3, 2015
    Messages:
    936
    Likes Received:
    538
    if were already f*cked why dont we learn how to use the tools to further our creativity in these times? Vibe code a plugin, generate a stem, game plan a rollout, generate cover art. etc etc..

    We can shake our canes at AI all day. Or we can learn how to use it and capitalize on this moment to free our creative ideas to see the light of day!
     
  3. jennyblack

    jennyblack Audiosexual

    Joined:
    Nov 24, 2013
    Messages:
    616
    Likes Received:
    664
    Capitalism and the so called "communist" countries are just different sides of the same coin when productivism, exploitation, alienation, technology, war and growth are controlled by and used on behalf of the power elites (economic, military, technocratic).

    The point is that if these elites are in oligopolistic control of technology, growth, etc. they will try to use all this to increase and mantain their power (political and economic) - capitalism is just a way to achieve this in the West (with the facade of a "free" market).
     
    Last edited: Aug 26, 2025 at 12:57 AM
  4. Mynock

    Mynock Audiosexual

    Joined:
    Jul 14, 2012
    Messages:
    1,422
    Likes Received:
    2,112


    But these AIs don't create something out of nothing... they recombine and vary existing musical and textual patterns that have been "digested" and stored through statistical learning. The real feat lies in the neural network's ability to understand the complex relationships between these elements, combining them in a way that sounds coherent and, sometimes, surprising (thanks to the insertion of randomness and variations!), with audio sample results being generated quickly with increasingly remarkable quality! So...

    [​IMG]
     
    Last edited: Aug 26, 2025 at 12:53 AM
  5. curtified

    curtified Audiosexual

    Joined:
    Feb 3, 2015
    Messages:
    936
    Likes Received:
    538
    Kind of. No audio is stored or used in the process. Its just like GPT it predicts the next token. So it takes in millions of songs and gets a understanding of them based on a tagging system. Then the algo uses a noise profile to start from. Think of it as a block of clay of white noise.

    it shapes the noise to be one millisecond of audio that is based on the context you give it. Then it continues that process for the duration of the song.

    Kind of how a series of still photos together become a movie scene.

    No Audio is being used in generation. Just the overall concepts of the audio. Its just white noise being guided by machine learning to generate audio out of noise based on its understanding of music guided by a prompt or another piece of audio.
     
  6. Mynock

    Mynock Audiosexual

    Joined:
    Jul 14, 2012
    Messages:
    1,422
    Likes Received:
    2,112
    I see, the AI doesn't store the patterns themselves, but rather the relationships and statistical probabilities between all musical elements, in order to perform an intelligent and statistically informed recombination of learned patterns...

    Got it. So AIs are basically masters of imitation and recombination, right? They're still just constructs with no intention, no emotion, no life experience, no culture, and no ability to make unexpected connections between unrelated concepts, like a human would. And this is all just for songs, right? Honestly, I'll be really impressed when an AI creates something on the level of Stravinsky, Villa-Lobos, Thomas Adès, or John Adams... And believe me, I'm not being ironic, 'cos I truly think this will be possible one day! but probably not in my lifetime... or yours... or your kids'... but maybe your grandkids'! :winker:
     
    Last edited: Aug 26, 2025 at 1:43 AM
  7. Demloc

    Demloc Rock Star

    Joined:
    Mar 10, 2020
    Messages:
    369
    Likes Received:
    342
    "It's so amazing when people tell me that … electronic music has not got soul. And they blame the computers. They got the finger pointed at the computers like, "There's no soul here." … You can't blame the computer. If there's not soul in the music, it's because nobody put it there. And it's not the tool's fault."

    I hate spitted "AI music" as much as the next neighbor but c'mon. The diffusion models realm It's a new media in it's infancy. Is like seeing the first lumiere brothers movie and say: fuck it, I'm going back to the theather were the true art resides with real people and not these ghost trains splashed over a white blanket. :guru:

    Don't be ludite, just because any moron can take a camera and make a video it doesn't invalidate the audiovisual media as a whole.
     
  8. Demloc

    Demloc Rock Star

    Joined:
    Mar 10, 2020
    Messages:
    369
    Likes Received:
    342
    More that deep "understanding", is a deep learning of methods of diffusion from noise to signal. Is not AI at all, we have been fooled by that.
    Just take a song, tag it, then add white noise in little intervals until all you have is white noise. Do the same thing with millions of songs. Then teach the model how to take that white noise, and based on tags (prompts from the user) diffuse it into something that has similarities with all or some of the songs with that tagging. The model doesn´t understand shit. It just diffuses different sets of noises (seeds) to signal (results).
     
Loading...
Loading...