Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

A little question about the thesis #3

Open
lijain opened this issue Mar 1, 2023 · 2 comments
Open

A little question about the thesis #3

lijain opened this issue Mar 1, 2023 · 2 comments

Comments

@lijain
Copy link

lijain commented Mar 1, 2023

Thanks for your work, when reading your article, I have a bit of a problem with page 4(AP grows from
44.5 AP to 51.7 (+7.2 AP)), replacing the last layer with intermediate results has greatly improved, but the experimental results have not improved much than felt. Does that mean that the optimal result may be distributed differently in different stages
image
In addition, can TP F Rate be more detailed?

@Fangyi-Chen
Copy link
Owner

Fangyi-Chen commented Mar 1, 2023

Hi,

Replacing P6 with the optimum from P1~6 is an idealized investigation - We do this to estimate the upper bound of potential when the problem is fully resolved. But clearly, it is not easy to fully solve. SQR only partially solved this, as you can see from the decrease in TP F rating and FP E rating in Table 6. So the AP improvement of SQR is still far away from the upper bound. However, as one of the reviewers claimed, "a first step to tackling this problem is worthwhile."

Regarding the distribution, maybe Table 3 in the paper can give you the expected information?

In addition, TP Fading Rate studies if a query could predict TP in early stages, but in the final stage, for some reason, the TP is worse than the former ones. As shown in fig 2 traffic light example, The TP is 'fading'. Its scientific description is detailed in the Sec.3. We also observe that more than half of the occurred cases are marginally triggered, i.e. the predictions from the triggered 1∼5 are only marginally better than the sixth. This is a further reason why the deformable DETR has those high rates - the results from the 5th and 6th stages are extremely visually close. In a dominated number of cases, the final stage is (one of) the best, after all, its mAP is the highest.

Best regards

@lijain
Copy link
Author

lijain commented Mar 3, 2023

Ok, thanks for the answer

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants