33 pointsby bitbashera year ago3 comments
  • alphan0na year ago
    Have you verified that they are actually Amazon crawlers as outlined here:

    https://developer.amazon.com/amazonbot

    • xenaa year ago
      Yes.
      • alphan0na year ago
        Can you share the list of offending ip’s, here or your website so we can use them in block lists?

        Also, there is an email to contact:

        amazonbot@amazon.com

        • xenaa year ago
          The website that you were pummeling is in the article: git.xeserv.us. I sent an email earlier today and have gotten no response.

          Right now your crawler bots are getting the bee movie script, so you may want to delete all the data that's being scraped from that domain. Unless you like jazz that is.

          It'd be a gesture of good faith to remunerate me for the egress fees your bot incurred, but I'm not gonna die on that hill.

          • alphan0na year ago
            Apologies, I’m not affiliated with Amazon in any way.

            I meant the Amazon ip addresses that are causing you trouble so I can preemptively block them.

  • JSTradinga year ago
    How much of a problem is it?
    • xenaa year ago
      3Ti of egress and climbing, I'm in the hole financially and it's making my personal infra that relies on it unstable.
      • jshearda year ago
        Damn that's rude. At least you appear to be using Vultr, imagine if it was running on one of those newfangled cloud providers which mark bandwidth up by a few orders of magnitude...
        • xenaa year ago
          It's actually slightly worse. That vultr node is a reverse proxy over wireguard to my homelab.
          • bitbashera year ago
            Remove the gittea instance for now until it's sorted out? Respond to all git.* traffic with a 420 until it's sorted out.
  • a year ago
    undefined