Is RX12 Music Rebalance (for Vocal Removal) Superior Than LALA ?

m5g · May 4, 2026

mvsep is the best imo ) just built the app with Nativefier )

tzzsmk · May 4, 2026

Stevie Dude said: ↑

I’ve been tasked with removing vocals from an AI-generated track for a karaoke project.
Click to expand...

Stevie Dude said: ↑

I’ve spent the last two hours wrestling with RX12’s newly improved Music Rebalance and SpectraLayers 12’s Unmix Song module. Neither is cutting it
Click to expand...

UVR5 is the answer, and details have been already said in this thread before
also don't hesitate to experiment with models that split track into stems, not just vocals+instrumental, you can stitch them back easily, but results may be better

clone · May 4, 2026

m5g said: ↑

mvsep is the best imo ) just built the app with Nativefier )
Click to expand...

If you feel like paying/using credits for anything over 16bit wav output, it's possible. Using Nativefier has no impact on the output. For free? No 24bit wav output, which running UVR5 locally does. MVsep is way faster for me, but the 16bit wav output vs UVR5 24bit wav output makes the slower separation worthwhile. UVR5 allows you to set values for segment/chunk size, overlap, number of shifts, etc. This makes things a lot slower for me, but I just do it on a different Mac and don't really care how long it runs.

wizardmoon2 · May 4, 2026

Your client has shit taste, just saying.

Riddim Machine · May 5, 2026

1176f said: ↑

Splitting AI generated music is harder than real music ironically, as the elements sometimes aren't seen as what they are - if that makes sense (sorta). There's in depth explanations on YT somewhere.
Click to expand...

This happens because of the way this music is generated. Since you're compiling a bunch of code into the task of performing a human feeling, the best it can do is to copy and paste real performances of real musicians in a unnatural way to mimic those feelings. The results are strange instrument portamento, legatos and quantizations. Guitars that blend into polyphonic synths. All sort of weird stuff. When you split, all that flawed things start getting more attention, so it's not that RX can't split it, but the amount of crap under the hood without all the masking is tremendous.

Stevie Dude · May 10, 2026

thanks everyone for all the suggestions. I completed the job with MVSEP (like a week ago I think) but the result is still bad IMO, but bearable somehow acceptable and I got paid for it. Thanks. I think I need to spend more time to learn about everything that has to do with stem separation and shitty AI Music and how it works.

wizardmoon2 said: ↑

Your client has shit taste, just saying.
Click to expand...

I never expect anyone ever in this whole world to have better taste than me. So, it's just another day at the office and I got no problem with it.

curtified · May 11, 2026

Stevie Dude said: ↑

thanks everyone for all the suggestions. I completed the job with MVSEP (like a week ago I think) but the result is still bad IMO, but bearable somehow acceptable and I got paid for it. Thanks. I think I need to spend more time to learn about everything that has to do with stem separation and shitty AI Music and how it works.

I never expect anyone ever in this whole world to have better taste than me. So, it's just another day at the office and I got no problem with it.
Click to expand...

INSTRUMENTAL:
https://voca.ro/1nRJfPZtrdpj

PROCESSED v1:
https://voca.ro/17kkP07jvifi

PROCESSED v2:
https://voca.ro/1cqRRm8p7a3S

Sadly the original AI song was generated on a older suno model (maybe 3.5 or 4) where there is more "shimmer" to the sound at the time so its hard to tame that. But the stems might help you hone it in.

STEM GROUPS 4 MIXING:
https://pixeldrain.com/u/1CE19K6V

another option is to run the instrumental back through suno and have v5.5 generate a better quality version.

Last edited: May 11, 2026

canbi · May 11, 2026

both are garbage, use scnet or rofo

im happy that devs of real programs arent using open models - the more gatekept good things are the better

curtified · May 11, 2026

canbi said: ↑

both are garbage, use scnet or rofo

im happy that devs of real programs arent using open models - the more gatekept good things are the better
Click to expand...

can you run through those algos so we can compare?

clone · May 11, 2026

canbi said: ↑

the more gatekept good things are the better
Click to expand...

Anything not open source just gets cracked anyway. If expensive commercial programs aren't cracked, you'll just be one of the people not using them.

Djord Emer · May 12, 2026

If you have a capable GPU and time to tweak, UVR is your best bet.

Otherwise, go with MVSEP. I wouldn't bother with RX Rebalance much less LALAL.AI. It's honestly kid's toys compared to the more robust alternatives.

Similar Threads - RX12 Music Rebalance	Forum	Date
Izotope RX12 ISSUE (MAC Logic Pro)	Mac / Hackintosh	Jun 17, 2026
Laser Beam (Our Music)	Our Music	Yesterday at 5:05 AM
R.I.P. - David Clayton-Thomas - June 24, 2026 - Canadian musician	AudioSEX Memorial	Monday at 8:13 AM
The punchy vocals in funk music	Working with Sound	Jun 25, 2026
Help me limit options for techno music production	Electronic	Jun 23, 2026

Is RX12 Music Rebalance (for Vocal Removal) Superior Than LALA ?

fuck ai cover song ?

fuck

fuck

m5g Kapellmeister

tzzsmk Audiosexual

clone Audiosexual

wizardmoon2 Ultrasonic

Riddim Machine Audiosexual

Stevie Dude Audiosexual

curtified Audiosexual

canbi Producer

curtified Audiosexual

clone Audiosexual

Djord Emer Audiosexual

PROFESSIONAL AUDIO LOVERS

Is RX12 Music Rebalance (for Vocal Removal) Superior Than LALA ?

fuck ai cover song ?

fuck

fuck

m5g Kapellmeister

tzzsmk Audiosexual

clone Audiosexual

wizardmoon2 Ultrasonic

Riddim Machine Audiosexual

Stevie Dude Audiosexual

curtified Audiosexual

canbi Producer

curtified Audiosexual

clone Audiosexual

Djord Emer Audiosexual

Useful Searches