-
Notifications
You must be signed in to change notification settings - Fork 83
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Post your model #1
Comments
Model Type: bs_roformer How to run [example for vocals]: Download config and checkpoint, save to folder with @ZFTurbo training code, run this command:
Input files in input folder, results in separation_results folder DEMO: Vocals: Drums: Bass: Last update: |
Model Type: mel_band_roformer SDR: 6.86 Updatet on: 14.06.2024 18:20:00 UTC+3 |
Model Type: mel_roformer |
Model Type: mel_band_roformer |
@alexclarke236 Would you like to share the checkpoints you've trained ? Best way is to host them on a file-sharing site and post the link here like previous users have done. |
Yes
…On Tue, Jun 11, 2024 at 8:15 PM Jarredou ***@***.***> wrote:
@alexclarke236 <https://github.com/alexclarke236> Would you like to share
the checkpoints you've trained ? Best way is to host them on a file-sharing
site and post the link here like previous users have done.
—
Reply to this email directly, view it on GitHub
<#1 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/BJBMRZEKBP32EHV2GY5IYCDZG6HIVAVCNFSM6AAAAAA67RO3LWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCNRRHAZDSOBRGM>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Architecture: MDX23c |
Same model as above, but trained for a further 54 epochs. Sounds better to the ears than the older model in quite a few cases, but scores a lower SDR on the validation set. Picks up wind instruments better than older model in my testing and has the chance to pick up string sections better too. Edit: 3/08/2024, currently retraining this model with a larger dataset but using same machine, so will take a while. Results will be posted here if it gets anywhere decent |
Description: Instruments: Dataset: Metrics: Config & checkpoint : https://github.com/jarredou/models/releases/tag/aufr33-jarredou_MDX23C_DrumSep_model_v0.1 |
@anvuew thank you for great model. I have a question about your training. Did you apply reverb for full tracks or for vocal part only? Can you share your validation? I'd like to compare your model with older ones. |
noreverb is vocal only. valid is too inadequate to share. the MDX Reverb-HQ SDR is 6.5 for my valid. |
For those who have issues running Roformers from this thread in UVR, you must delete the following line from the YAML file: |
Description:bs_roformer dereverb model Metrics:SDR noreverb: 8.0770(small valid) Config link: config Although this is a dereverb model, it will also remove harmonies or vocal effects that are not in the center channel. If you want to add this model to UVR5, first place the config file and weights in the corresponding directory (weights in |
Congratulations on these fine dereverb models! |
i need model separation include sdr vocals after separation higher than sdr vocals 12.97 pleasse |
If you mean 2024.03 model, t's the same. 12.97 metric comes from private
validation dataset which wasn't multisong dataset.
All newer Roformers with better metrics are currently not public, so cannot
be downloaded.
niedz., 28 lip 2024 o 17:00 lgkt ***@***.***> napisał(a):
… BS Roformer (viperx) is 12.9755?it is old.The latest one is BS Roformer
(finetuned) shown in the image,but where to download? who can tell me?
image.png (view on web)
<https://github.com/user-attachments/assets/81202ec1-e2da-4206-be19-5b216533a0f8>
—
Reply to this email directly, view it on GitHub
<#1 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AIJ3EHDRXHB5XXSJTCARKUDZOUBP5AVCNFSM6AAAAAA67RO3LWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDENJUGU2DONZXGE>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***
com>
|
Aspiration_Mel_Band_Roformermodel repo: https://huggingface.co/Sucial/Aspiration_Mel_Band_Roformer Description: The model is used to separate aspiration, which will be useful for mixing to some mixrs. Configs: config_aspiration_mel_band_roformer.yaml Model: aspiration_mel_band_roformer_sdr_18.9845.ckpt Model: aspiration_mel_band_roformer_less_aggr_sdr_18.1201.ckpt |
To use it in UVR5 GUI, use this config (thx Essid)
https://drive.google.com/file/d/1IrzeCliVNS8zuSJ51IbACMN9FNuza-hE/view?usp=sharing |
Description: Instruments: Dataset: Metrics: Model_2 (ep_271): Downloads: |
@wesleyr36 can you explain what this model do? What are the use cases? |
It extracts the phantom centre from stereo audio i.e. the content that is the same between the two channels and is percieved to be in the middle. It's main intended usecase was inside of a similarity extractor recreation which can be done in the following way:
|
Description: SYH99999/MelBandRoformerSYHFTV2 Please Check the quality on your ears. |
Dereverb-Echo_Mel_Band_RoformerHuggingface link: https://huggingface.co/Sucial/Dereverb-Echo_Mel_Band_Roformer
Metrics: Based on the sdr value of 30 songs for validation.
Training logs: train.log, Tensorboard image: tensorboard.png Thanks |
Huggingface model link: https://huggingface.co/Sucial/Chorus_Male_Female_BS_Roformer Recently, I attempted to train a model for separating male and female voices in choir singing, and the results were quite good, far exceeding my expectations. However, due to the lack of a certain degree of universality in the training and validation data (all the training and validation data used were Chinese songs), I personally classify this model as an experimental model. The model can separate the male and female voices in a chorus. However, if male and female are singing at intervals (one by one), they cannot be separated. The model separation effect can be heard here! I used a total of 750 songs for training, of which 700 were used as the training set and 50 as the validation set. All the songs are from opencpop and m4singer datasets. Fine tuning training from Of these,
Thanks to CN17161 for the GPU math support! |
I used more data (1000+ songs) and the same reverb&delay creation script and fine-tuned this model. Now, the value of |
Thank you for sharing the great dereverb model. The model works fine on most of the audio but I found some bad cases when the reverb is strong. The original model considers most of the sound waves as 'other', and 'dry' is silent in many time. The fine-tuned v2 model improves the performance but the problem still exists. You can find the files before and after the model here. Please have a look at them. I think increasing the reverberation intensity during the training phase may help. You may need to add a penalty term to the loss function when the target stem (dry) is small enough (silent) because we can assume that the reverb audio is totally generated from the dry audio by convolution with the impulse response. |
@happyTonakai @deton24 First of all, I must acknowledge that my v2 model is still somewhat behind the anvuew's v2 model in certain aspects. Over the past few days, I have continued to fine-tune my model, focusing on handling large reverb, an area where the anvuew's v2 model has limitations. I also took note of the issue raised by happyTonakai: "some bad cases when the reverb is strong. The original model considers most of the sound waves as 'other,' and 'dry' is silent in many cases." In response to this, I made some adjustments to the validation code and trained two new models specifically targeting large reverb removal. After training, I combined these two models with my v2 model through a blending process, to better handle all scenarios. At this stage, I am still unsure whether my new models outperform the anvuew's v2 model overall, but I can confidently say that they are more effective in removing large reverb. You can listen to sample outputs here. For more detailed information about these two models and the blended version, please refer to the README.md in the HuggingFace model repository. Here are the three new models: Update: |
To post your model, please, fill the form:
The text was updated successfully, but these errors were encountered: