Is RX12 Music Rebalance (for Vocal Removal) Superior Than LALA ?

Discussion in 'Working with Sound' started by Stevie Dude, May 2, 2026.

?

fuck ai cover song ?

  1. fuck

    16 vote(s)
    69.6%
  2. fuck

    7 vote(s)
    30.4%
  1. m5g

    m5g Kapellmeister

    Joined:
    Jul 19, 2011
    Messages:
    82
    Likes Received:
    53
    mvsep is the best imo ) just built the app with Nativefier )
     
  2. tzzsmk

    tzzsmk Audiosexual

    Joined:
    Sep 13, 2016
    Messages:
    4,526
    Likes Received:
    2,835
    Location:
    Heart of Europe
    UVR5 is the answer, and details have been already said in this thread before :wink:
    also don't hesitate to experiment with models that split track into stems, not just vocals+instrumental, you can stitch them back easily, but results may be better
     
  3. clone

    clone Audiosexual

    Joined:
    Feb 5, 2021
    Messages:
    10,348
    Likes Received:
    4,454
    If you feel like paying/using credits for anything over 16bit wav output, it's possible. Using Nativefier has no impact on the output. For free? No 24bit wav output, which running UVR5 locally does. MVsep is way faster for me, but the 16bit wav output vs UVR5 24bit wav output makes the slower separation worthwhile. UVR5 allows you to set values for segment/chunk size, overlap, number of shifts, etc. This makes things a lot slower for me, but I just do it on a different Mac and don't really care how long it runs.
     
  4. wizardmoon2

    wizardmoon2 Ultrasonic

    Joined:
    Aug 3, 2024
    Messages:
    52
    Likes Received:
    29
    Your client has shit taste, just saying.
     
    • Dislike Dislike x 1
    • Agree Agree x 1
    • Funny Funny x 1
    • List
  5. Riddim Machine

    Riddim Machine Audiosexual

    Joined:
    Jul 3, 2021
    Messages:
    946
    Likes Received:
    734
    Location:
    Sierra Fox
    This happens because of the way this music is generated. Since you're compiling a bunch of code into the task of performing a human feeling, the best it can do is to copy and paste real performances of real musicians in a unnatural way to mimic those feelings. The results are strange instrument portamento, legatos and quantizations. Guitars that blend into polyphonic synths. All sort of weird stuff. When you split, all that flawed things start getting more attention, so it's not that RX can't split it, but the amount of crap under the hood without all the masking is tremendous.
     
  6. Stevie Dude

    Stevie Dude Audiosexual

    Joined:
    Dec 29, 2020
    Messages:
    2,560
    Likes Received:
    2,342
    Location:
    Near Nyquist
    thanks everyone for all the suggestions. I completed the job with MVSEP (like a week ago I think) but the result is still bad IMO, but bearable somehow acceptable and I got paid for it. Thanks. I think I need to spend more time to learn about everything that has to do with stem separation and shitty AI Music and how it works.

    I never expect anyone ever in this whole world to have better taste than me. So, it's just another day at the office and I got no problem with it. :)
     
  7. curtified

    curtified Audiosexual

    Joined:
    Feb 3, 2015
    Messages:
    1,037
    Likes Received:
    571
    INSTRUMENTAL:
    https://voca.ro/1nRJfPZtrdpj

    PROCESSED v1:
    https://voca.ro/17kkP07jvifi

    PROCESSED v2:
    https://voca.ro/1cqRRm8p7a3S

    Sadly the original AI song was generated on a older suno model (maybe 3.5 or 4) where there is more "shimmer" to the sound at the time so its hard to tame that. But the stems might help you hone it in.


    STEM GROUPS 4 MIXING:
    https://pixeldrain.com/u/1CE19K6V

    another option is to run the instrumental back through suno and have v5.5 generate a better quality version.
     
    Last edited: May 11, 2026
  8. canbi

    canbi Kapellmeister

    Joined:
    Jun 12, 2023
    Messages:
    274
    Likes Received:
    70
    both are garbage, use scnet or rofo

    im happy that devs of real programs arent using open models - the more gatekept good things are the better
     
  9. curtified

    curtified Audiosexual

    Joined:
    Feb 3, 2015
    Messages:
    1,037
    Likes Received:
    571
    can you run through those algos so we can compare?
     
  10. clone

    clone Audiosexual

    Joined:
    Feb 5, 2021
    Messages:
    10,348
    Likes Received:
    4,454
    Anything not open source just gets cracked anyway. If expensive commercial programs aren't cracked, you'll just be one of the people not using them.
     
  11. Djord Emer

    Djord Emer Audiosexual

    Joined:
    Sep 12, 2021
    Messages:
    1,291
    Likes Received:
    1,086
    Location:
    Taured
    If you have a capable GPU and time to tweak, UVR is your best bet.

    Otherwise, go with MVSEP. I wouldn't bother with RX Rebalance much less LALAL.AI. It's honestly kid's toys compared to the more robust alternatives.
     
Loading...
Loading...