Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Lyrics: Use distance threshold for Genius search matches #5453

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

snejus
Copy link
Member

@snejus snejus commented Oct 8, 2024

Description

This PR introduces a distance threshold mechanism to the Genius backend and unifies its implementation across the rest of backends that perform searching and matching artists and titles.

  • Create a new SearchBackend base class with a method check_match that performs matching.
  • Start using undocumented dist_thresh configuration option for good, and mention it in the docs. This controls the maximum allowable distance for matching artist and title names.

These changes aim to improve the accuracy of lyrics matching, especially when there are slight variations in artist or title names.

Fixes #4791

To Do

  • Documentation. (If you've added a new command-line flag, for example, find the appropriate page under docs/ to describe it.)
  • Changelog. (Add an entry to docs/changelog.rst to the bottom of one of the lists near the top of the document.)
  • Tests. (Very much encouraged but not strictly required.)

@snejus snejus self-assigned this Oct 8, 2024
@snejus snejus force-pushed the lyrics-fix-genius-artist-matching branch 4 times, most recently from adc2d20 to ad04330 Compare October 9, 2024 12:29
@snejus
Copy link
Member Author

snejus commented Oct 11, 2024

Have just discovered that it is not anymore possible to search lyrics on Tekstowo since search results are now shown within a Google container which is not parseable.

@snejus
Copy link
Member Author

snejus commented Oct 11, 2024

Going to move the last commit with tekstowo change to a separate PR and handle this issue there.

@snejus snejus changed the base branch from master to lyrics-fix-tekstowo October 12, 2024 01:26
@snejus snejus force-pushed the lyrics-fix-genius-artist-matching branch from ad04330 to f479cfa Compare October 12, 2024 01:26
Base automatically changed from lyrics-fix-tekstowo to master October 12, 2024 21:52
@snejus snejus force-pushed the lyrics-fix-genius-artist-matching branch from f479cfa to f305c89 Compare October 12, 2024 22:21
This commit introduces a distance threshold mechanism for the Genius and
Google backends.

- Create a new `SearchBackend` base class with a method `check_match`
  that performs checking.
- Start using undocumented `dist_thresh` configuration option for good,
  and mention it in the docs. This controls the maximum allowable
  distance for matching artist and title names.

These changes aim to improve the accuracy of lyrics matching, especially
when there are slight variations in artist or title names, see #4791.
@snejus snejus force-pushed the lyrics-fix-genius-artist-matching branch from f305c89 to 4bf1061 Compare October 13, 2024 15:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

lyrics: In Genius backend, tolerate artist disambiguation markers
1 participant