Backup_Repos/beets

mirror of https://github.com/beetbox/beets.git synced 2025-12-06 16:42:42 +01:00

Author	SHA1	Message	Date
Šarūnas Nejus	44fda7ca0a	lyrics: use another beatles song for Lyricsmode Lady Madonna apparently is gone from this website. ¯\_(ツ)_/¯	2025-08-30 23:10:22 +01:00
Šarūnas Nejus	b3d434f58f	Delegate attribute access to logging	2025-08-30 23:10:21 +01:00
Šarūnas Nejus	d93ddf8dd4	Do not use explicit indices for logging args when not needed	2025-08-30 23:10:21 +01:00
Šarūnas Nejus	1c16b2b308	Replace string concatenation (' + ') - Join hardcoded strings - Replace concatenated variables with f-strings	2025-08-30 23:10:15 +01:00
Šarūnas Nejus	4a361bd501	Replace format calls with f-strings	2025-08-30 18:42:26 +01:00
Šarūnas Nejus	adbd50b237	Move distance to a separate module	2025-05-31 19:17:43 +01:00
Šarūnas Nejus	509cbdcbe4	Move sanitize_pairs/choices from plugins to util module	2025-05-31 17:55:41 +01:00
Šarūnas Nejus	c490ac5810	Fix formatting	2025-05-07 10:41:01 +01:00
Šarūnas Nejus	b713d72612	translations: use a more distinctive separator I found that the translator would sometimes replace the pipe character with another symbol (maybe it got confused thinking the character is part of the text?). Added spaces around the pipe to make it more clear that it's definitely the separator.	2025-02-20 03:47:04 +00:00
Šarūnas Nejus	43032f7bc7	translations: make sure we do not re-translate	2025-02-20 03:47:04 +00:00
Šarūnas Nejus	7893766e4c	Improve flags structure and add tests	2025-02-20 03:47:04 +00:00
Šarūnas Nejus	c95156adcd	Refactor writing rest files	2025-02-20 03:47:04 +00:00
Šarūnas Nejus	d7201062a8	Resurrect translation functionality	2025-02-20 03:47:04 +00:00
Šarūnas Nejus	dab9a0d7c4	Bring back Tekstowo search It was my mistake to remove search earlier - I found that in many cases it works fine.	2025-01-27 10:56:54 +00:00
Šarūnas Nejus	7389f241f4	Do not search for Various Artists, split titles by ' / '	2025-01-27 10:56:53 +00:00
Šarūnas Nejus	39c479fcab	Google: add support for dainuzodziai.lt	2025-01-27 10:56:53 +00:00
Šarūnas Nejus	734bcc28a8	Append source to the lyrics	2025-01-27 10:56:53 +00:00
Šarūnas Nejus	bdc564a573	Tidy up handling of backends	2025-01-27 10:56:53 +00:00
Šarūnas Nejus	04054cac5c	Remove dependency existence checks I think we can make our life easier by removing these checks assuming that users follow the instructions in the docs.	2025-01-27 10:56:53 +00:00
Šarūnas Nejus	b2402b1634	Google: make sure we do not return the captcha text If we get caught by Cloudfare, it forwards our request somewhere else and returns some validation text response. To make sure that this text does not get assumed for lyrics, we can disable redirects for the Google backend, check the response code and raise if there's a redirect attempt. This source will then be skipped and the backend continues with the next one.	2025-01-27 10:56:53 +00:00
Šarūnas Nejus	07d372c13d	Google: prioritise Songlyrics and AZlyrics sources	2025-01-27 10:56:53 +00:00
Šarūnas Nejus	70554640e5	Create Html class for cleaning up the html text Additionally, improve HTML pre-processing: * Ensure a new line between blocks of lyrics text from letras.mus.br. * Parse a missing last block of lyrics text from lacocinelle.net. * Parse a missing last block of lyrics text from paroles.net. * Fix encoding issues with AZLyrics by setting response encoding to None, allowing `requests` to handle it.	2025-01-27 10:56:52 +00:00
Šarūnas Nejus	c5c4138d66	Google: Refactor and improve * Type the response data that Google Custom Search API return. * Exclude some 'letras.mus.br' pages that do not contain lyric. * Exclude results from Musixmatch as we cannot access their pages. * Improve parsing of the URL title: - Handle long URL titles that get truncated (end with ellipsis) for long searches - Remove domains starting with 'www' - Parse the title AND the artist. Previously this would only parse the title, and fetch lyrics even when the artist did not match. * Remove now redundant credits cleanup and checks for valid lyrics.	2025-01-27 10:56:52 +00:00
Šarūnas Nejus	12c5eaae5e	Unite Genius, Tekstowo and Google backends under the same interface	2025-01-27 10:56:52 +00:00
Šarūnas Nejus	745c5eb9f0	Genius: refactor and simplify	2025-01-27 10:56:52 +00:00
Šarūnas Nejus	54fc67b30a	Remove extract_text_between	2025-01-27 08:50:50 +00:00
Šarūnas Nejus	55b7824948	Replace custom unescape implementation by html.unescape	2025-01-27 08:50:50 +00:00
Šarūnas Nejus	8a1ce27421	lyrics: Do not write item unless lyrics have changed	2025-01-27 08:50:50 +00:00
Šarūnas Nejus	8bdc2c6cf0	lyrics: Add symbols for better visual feedback in the logs	2025-01-27 08:50:50 +00:00
Šarūnas Nejus	f94d2767f9	Use a single slug implementation Tidy up 'Google.is_page_candidate' method and remove 'Google.sluggify' method which was a duplicate of 'slug'. Since 'GeniusFetchTest' only tested whether the artist name is cleaned up (the rest of the functionality is patched), remove it and move its test cases to the 'test_slug' test.	2025-01-27 08:50:50 +00:00
Šarūnas Nejus	dd9f178fff	Do not try to strip cruft from the parsed lyrics text. Having removed it I fuond that only the Genius lyrics changed: it had en extra new line. Thus I defined a function 'collapse_newlines' which now gets called for the Genius lyrics.	2025-01-27 08:50:50 +00:00
Šarūnas Nejus	7c2fb31136	Leave a single chef in the kitchen	2025-01-27 08:50:50 +00:00
Šarūnas Nejus	cb29605bfd	Include class name in the log messages	2025-01-27 08:50:50 +00:00
Šarūnas Nejus	283c513c72	Centralise request error handling	2025-01-27 08:50:49 +00:00
Šarūnas Nejus	06eac79c0d	Centralize requests setup with requests.Session Improve requests performance with requests.Session which uses connection pooling for repeated requests to the same host. Additionally, this centralizes request configuration, making sure that we use the same timeout and provide beets user agent for all requests.	2025-01-27 08:50:49 +00:00
Šarūnas Nejus	c40db1034a	Make lyrics plugin documentation slightly more clear	2025-01-27 08:50:49 +00:00
Šarūnas Nejus	2ff57505d8	Apply dist_thresh to Genius and Google backends This commit introduces a distance threshold mechanism for the Genius and Google backends. - Create a new `SearchBackend` base class with a method `check_match` that performs checking. - Start using undocumented `dist_thresh` configuration option for good, and mention it in the docs. This controls the maximum allowable distance for matching artist and title names. These changes aim to improve the accuracy of lyrics matching, especially when there are slight variations in artist or title names, see #4791.	2025-01-27 08:50:48 +00:00
Šarūnas Nejus	bb5f3e0593	lyrics: sort lrclib lyrics by synced field and query search first I found that the `/get` endpoint often returns incorrect or unsynced lyrics, while results returned by the `/search` more accurate options. Thus I reversed the change in the previous commit to prioritize searching first.	2025-01-20 13:14:37 +00:00
Šarūnas Nejus	33aafdd50b	Remove trailing spaces in synced lyrics lines without text	2025-01-19 18:39:56 +00:00
Šarūnas Nejus	618c3a21a6	Try to GET LRCLib lyrics before searching	2025-01-19 18:39:54 +00:00
Šarūnas Nejus	2fb72c65a5	lyrics/LRCLib: handle instrumental lyrics	2025-01-19 15:19:44 +00:00
Šarūnas Nejus	30379bca38	Update lyrics.sources configuration to prioritize lrclib	2025-01-19 15:19:44 +00:00
Šarūnas Nejus	a398fbe62d	LRCLib: Improve exception handling	2025-01-19 15:19:44 +00:00
Šarūnas Nejus	8d4a569291	Fix fetching lyrics from lrclib Adjust the base URL to perform a '/search' instead of attempting to '/get' specific lyrics where we're unlikely to find lyrics for the specific combination of album, artist, track names and the duration (see https://lrclib.net/docs). Since we receive an array of matching lyrics candidates, rank them by their duration similarity to the item's duration, and whether they contain synced lyrics.	2025-01-19 15:19:41 +00:00
Šarūnas Nejus	c250bfa724	Google: test the entire fetch method	2025-01-19 01:48:04 +00:00
Šarūnas Nejus	334bbde826	Make album, duration required for LyricsPlugin.fetch Since at least one Backend requires album` and `duration` arguments (`LRCLib`), the caller (`LyricsPlugin.fetch_item_lyrics`) must always provide them. Since they need to provided, we need to enforce this by defining them as positional arguments. Why is this important? I found that integrated `LRCLib` tests have been passing, but they called `LRCLib.fetch` with values for `artist` and `title` fields only, while the actual functionality always provides values for `album` and `duration` fields too. When I adjusted the test to provide values for the missing fields, I found that it failed. This makes sense: Lib `album` and `duration` filters are strict on LRCLib, so I was not surprised the lyrics could not be found. Thus I adjusted `LRCLib` backend implementation to only filter by each of these fields when their values are truthy.	2025-01-19 01:48:04 +00:00
Šarūnas Nejus	0a12d07a94	Do not attempt to fetch lyrics with empty data Modified `search_pairs` function in `lyrics.py` to: * Firstly strip each of `artist`, `artist_sort` and `title` fields * Only generate alternatives if both `artist` and `title` are not empty * Ensure that `artist_sort` is not empty and not equal to artist (ignoring case) before appending it to the artists Extended tests to cover the changes.	2025-01-19 01:48:04 +00:00
Šarūnas Nejus	3b73a26002	Address failing google sources tests Two google sources failed to return the expected output. I looked into each case why parsing failed: - lyrics on musica.com contain <aside> Google Ads - each lyrics line on lacoccinelle.net is wrapped within alternating <em> and <strong> tags Thus remove these tags as part of the HTML cleanup logic.	2025-01-19 01:32:17 +00:00
Edgars Supe	09360259cc	lyrics: Fallback to plain lyrics if synced not available	2024-12-07 19:08:37 +02:00
Šarūnas Nejus	d3955bac65	Update Tekstowo backend to fetch lyrics directly - Refactored Tekstowo backend to fetch lyrics directly from song pages. - Added `encode` method to convert artist and title to their URL format, where non-alphanumeric characters are replaced with underscores. - Removed the now redundant search functionality and associated tests. - Simplified `extract_lyrics` method to directly parse lyrics without any checks.	2024-10-12 02:14:18 +01:00

1 2 3 4 5 ...

352 commits