Add inference API of AMR #40

h-munakata · 2024-10-21T09:03:16Z

Summary

Add inference API of AMR

Add msclap in dependencies
Add configuration for AMR in lighthouse/models.py
I implemented encode_audio() separately from encode_video().

model.encode_audio("api_example/1a-ODBWMUAE.wav")

Modularize PANNs and CLAP

Class AudioEncoder in lighthouse/feature_extractor/audio_encoder.py only have model selector and lighthouse/feature_extractor/audio_encoders/{clap_a | pann}.py has individual model.

Future work

Add Gradio demo

h-munakata · 2024-10-21T09:07:54Z

The test has failed... I'm checking.

awkrail · 2024-10-21T11:39:14Z

setup.py

@@ -5,6 +5,6 @@
    version='0.1',
    install_requires=['easydict', 'pandas', 'tqdm', 'pyyaml', 'scikit-learn', 'ffmpeg-python',
                      'ftfy', 'regex', 'einops', 'fvcore', 'gradio', 'torchlibrosa', 'librosa',
-                      'clip@git+https://github.com/openai/CLIP.git'],
+                      'clip@git+https://github.com/openai/CLIP.git', 'msclap'],


Could you add msclap before 'clip@git+https://github.com/openai/CLIP.git' ?

awkrail · 2024-10-21T11:41:22Z

training/evaluate.py

@@ -439,7 +439,7 @@ def check_valid_combination(dataset, feature, domain):
    is_valid = check_valid_combination(args.dataset, args.feature, args.domain)

    if is_valid:
-        option_manager = BaseOptions(args.model, args.dataset, args.feature, args.domain)
+        option_manager = BaseOptions(args.model, args.dataset, args.feature, False, args.domain)


Could you remove magic number of False? Instead, please insert variable X = False and then use it here for readability?

awkrail · 2024-10-21T11:42:22Z

@h-munakata In addition, no tests are newly provided. Hence, could you add some tests for CLAP feature extractor and AMR inference API?

h-munakata · 2024-10-21T11:45:42Z

@h-munakata In addition, no tests are newly provided. Hence, could you add some tests for CLAP feature extractor and AMR inference API?

I see. I will add some tests in tests/test_models.py later.

h-munakata · 2024-10-21T11:50:51Z

For the test and the Gradio demo, I think {moment | qd | cg | ...}-detr for CLAP features is useful.
Should I prepare the pre-trained weights of all seven models trained with Clotho-Moment?

awkrail · 2024-10-21T12:06:37Z

For the test and the Gradio demo, I think {moment | qd | cg | ...}-detr for CLAP features is useful.
Should I prepare the pre-trained weights of all seven models trained with Clotho-Moment?

Yes, I agree. Could you prepare the pre-trained weights on your Zenodo?

h-munakata · 2024-10-21T12:07:54Z

Yes, I agree. Could you prepare the pre-trained weights on your Zenodo?

Sure. I will upload after the training.

awkrail · 2024-10-21T12:19:10Z

@h-munakata Sorry, I misunderstood your question. I think that in your paper, CG-DETR (or QD-DETR) achieved the highest performance. So, no need for training. All you need to do is upload the current trained models on Zenodo and accessible from Lighthouse.

h-munakata · 2024-10-21T12:26:29Z

My intention in training all models is to add a CLAP feature to the double for loop of FEATURE and MODEL used in the demo and test to make it easier to handle.
As you said, I will stop training all models and define AUDIO_FEATURE variable in the demo and test.

All you need to do is upload the current trained models on Zenodo and accessible from Lighthouse.

I see. I'll upload the model in the next commit for the test.

awkrail · 2024-10-21T12:34:23Z

@h-munakata BTW, could you finish implementing web demo tomorrow? I will tag the current version as v1.0, and wondering whether you can finish this implementation by tomorrow.

h-munakata · 2024-10-21T12:36:19Z

BTW, could you finish implementing web demo tomorrow?

Yes, I want to make it in time for the DCASE workshop the day after tomorrow.

Hokuto Munakata added 8 commits October 14, 2024 16:47

Add clap text encoder

1b6b82d

Add CLAPAudio encoder and refactor AudioEncoder

f821d26

Move feature_time to cfg

6c33019

Fix a bug in evaluate.py

3e29b16

Fix clap feature extractor

025a610

Add clap feature

fcb3d53

Add msclap dependency

8b51819

Add audio sample for API

e7f0111

Hokuto Munakata added 2 commits October 21, 2024 18:42

Refactor for mypy

7aa1d2d

Remove unnecessary imports

f77078d

awkrail reviewed Oct 21, 2024

View reviewed changes

Fix import order

56cfce1

awkrail reviewed Oct 21, 2024

View reviewed changes

Replace args for inference

57465fd

Hokuto Munakata added 5 commits October 21, 2024 22:39

Fix a bug in _is_predictable

6b9c297

Fix CLAP feature extractor because of test

8518776

Add test for AMR model

e2f61ad

Remove unnecessary import

017d30a

Remove duplicated test

e0b4aca

h-munakata force-pushed the muna/amr_demo branch from 9edff28 to e0b4aca Compare October 21, 2024 16:03

Hokuto Munakata added 3 commits October 22, 2024 10:13

Merge branch 'main' into muna/amr_demo

65df5d5

Add msclap in test

d216ea3

Fix typo

6d53e69

awkrail merged commit c2d1d3e into line:main Oct 22, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add inference API of AMR #40

Add inference API of AMR #40

h-munakata commented Oct 21, 2024

h-munakata commented Oct 21, 2024

awkrail Oct 21, 2024

awkrail Oct 21, 2024

awkrail commented Oct 21, 2024

h-munakata commented Oct 21, 2024

h-munakata commented Oct 21, 2024

awkrail commented Oct 21, 2024

h-munakata commented Oct 21, 2024 •

edited

Loading

awkrail commented Oct 21, 2024

h-munakata commented Oct 21, 2024

awkrail commented Oct 21, 2024

h-munakata commented Oct 21, 2024 •

edited

Loading

Add inference API of AMR #40

Add inference API of AMR #40

Conversation

h-munakata commented Oct 21, 2024

Summary

Future work

h-munakata commented Oct 21, 2024

awkrail Oct 21, 2024

Choose a reason for hiding this comment

awkrail Oct 21, 2024

Choose a reason for hiding this comment

awkrail commented Oct 21, 2024

h-munakata commented Oct 21, 2024

h-munakata commented Oct 21, 2024

awkrail commented Oct 21, 2024

h-munakata commented Oct 21, 2024 • edited Loading

awkrail commented Oct 21, 2024

h-munakata commented Oct 21, 2024

awkrail commented Oct 21, 2024

h-munakata commented Oct 21, 2024 • edited Loading

h-munakata commented Oct 21, 2024 •

edited

Loading

h-munakata commented Oct 21, 2024 •

edited

Loading