Skip to content

Feature request: deduplicate URLs when checking #1678

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
scarf005 opened this issue Apr 15, 2025 · 3 comments
Open

Feature request: deduplicate URLs when checking #1678

scarf005 opened this issue Apr 15, 2025 · 3 comments

Comments

@scarf005
Copy link

related: #1452

  • I'm using lychee to validate links in my blog's build artifacts.
  • however, this means links get called multiple times due to duplicate elements such as headers or footers.

[_site/etc/hm-session-vars.html]:
     [429] https://www.gnu.org/licenses/agpl-3.0.en.html | Network error: Too Many Requests

[_site/kde/kde5-lockscreen.html]:
     [429] https://www.gnu.org/licenses/agpl-3.0.en.html | Network error: Too Many Requests

[_site/www/css.html]:
     [429] https://www.gnu.org/licenses/agpl-3.0.en.html | Network error: Too Many Requests

[_site/index.html]:
     [429] https://www.gnu.org/licenses/agpl-3.0.en.html | Network error: Too Many Requests

[_site/kde/kinoite-sddm.html]:
     [429] https://www.gnu.org/licenses/agpl-3.0.en.html | Network error: Too Many Requests

[_site/kde/kde6-lockscreen.html]:
     [429] https://www.gnu.org/licenses/agpl-3.0.en.html | Network error: Too Many Requests

[_site/www/index.html]:
     [429] https://www.gnu.org/licenses/agpl-3.0.en.html | Network error: Too Many Requests

[_site/www/relay-devtools.html]:
     [429] https://www.gnu.org/licenses/agpl-3.0.en.html | Network error: Too Many Requests

[_site/kde/index.html]:
     [429] https://www.gnu.org/licenses/agpl-3.0.en.html | Network error: Too Many Requests

[_site/etc/hangul.html]:
     [429] https://www.gnu.org/licenses/agpl-3.0.en.html | Network error: Too Many Requests

[_site/scarf005.html]:
     [429] https://www.gnu.org/licenses/agpl-3.0.en.html | Network error: Too Many Requests

[_site/etc/index.html]:
     [429] https://www.gnu.org/licenses/agpl-3.0.en.html | Network error: Too Many Requests

[_site/etc/binary-search.html]:
     [429] https://www.gnu.org/licenses/agpl-3.0.en.html | Network error: Too Many Requests

[_site/404.html]:
     [429] https://www.gnu.org/licenses/agpl-3.0.en.html | Network error: Too Many Requests

🔍 237 Total (in 8s) ✅ 222 OK 🚫 14 Errors 👻 1 Excluded

in this case, the link to license (https://www.gnu.org/licenses/agpl-3.0.en.html) in footer is duplicated to all pages in build artifact, and sending 15 or them gives 429 Too Many Requests error.

@wkaisertexas
Copy link

I am also looking for this feature. If the user interface will not change to declutter duplicate requests, at least a cache of each request made type would help speed things up

@wkaisertexas
Copy link

Also, by cache I do not mean the --cache which persists between runs but one which would always be enabled and would be an in-memory hashmap for the results

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants