Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GBH Series Searches not accurate #188

Open
ekemeyer opened this issue Jan 31, 2025 · 1 comment
Open

GBH Series Searches not accurate #188

ekemeyer opened this issue Jan 31, 2025 · 1 comment
Assignees
Labels
bug 🐛 Something isn't working

Comments

@ekemeyer
Copy link
Contributor

Description

In series list: https://openvault.wgbh.org/series, when you click a series title and go to AAPB search, you get relevant results at the beginning, but there's clearly too many results appearing. Can we improve the search that is happening, or cut off the results at a certain point? Is this a solr issue?

Reproduction steps

  1. Go to ''GBH series'
  2. Click on one of the series'
  3. See aapb results

Expected behavior

Only want to see relevant results

Screenshots

![DESCRIPTION](LINK.png)

Browsers

No response

OS

No response

Additional context

No response

@mrharpo mrharpo added the bug 🐛 Something isn't working label Feb 4, 2025
@mrharpo
Copy link
Contributor

mrharpo commented Feb 4, 2025

Example

Using the first series example: The AIDS Quarterly

Existing link

https://americanarchive.org/catalog?q=(+(series_titles:%20The%20AIDS%20Quarterly%20OR%20program_titles:%20The%20AIDS%20Quarterly)%20AND%20(contributing_organizations:%20WGBH(MA)%20OR%20producing_organizations:%20WGBH%20Educational%20Foundation))&f[access_types][]=all
( (series_titles: The AIDS Quarterly OR program_titles: The AIDS Quarterly) AND (contributing_organizations: WGBH(MA) OR producing_organizations: WGBH Educational Foundation) )

☝ This yields 51,440 results, most of which are not relevant.

Improvements

Quote titles

( (series_titles: "The AIDS Quarterly" OR program_titles: "The AIDS Quarterly") AND (contributing_organizations: WGBH(MA) OR producing_organizations: WGBH Educational Foundation) )

=> 1,229 results

  • Almost exclusively "The AIDS Quarterly" content

  • Final page includes 3 Fresh Air episodes that include the exact phrase "The AIDS Quarterly" in the descriptions

  • Adding quotes to the contributing/producing organizations seems to explode the results to 400k+

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug 🐛 Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants