You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
GitHub serves an ugly 'unsupported browser' banner if requested with the default AB UA.
and so on...
While annoying, this can be handled for jobs explicitly targeting those sites using --useragent. However, outlinks from other sites might be screwed. I propose adding a mechanism that overrides the user agent on a per-request basis. If the request matches a regular expression, the UA is overridden for that request. As an optimisation, hostname matching would also be possible to skip expensive regex matching, possibly also with wildcards for any subdomain. This may require changes in wpull.
The text was updated successfully, but these errors were encountered:
While annoying, this can be handled for jobs explicitly targeting those sites using
--useragent
. However, outlinks from other sites might be screwed. I propose adding a mechanism that overrides the user agent on a per-request basis. If the request matches a regular expression, the UA is overridden for that request. As an optimisation, hostname matching would also be possible to skip expensive regex matching, possibly also with wildcards for any subdomain. This may require changes in wpull.The text was updated successfully, but these errors were encountered: