13 pointsby markhaslam5 hours ago4 comments
  • skybrian5 hours ago
    There's a bot summary in the Reddit thread that claims that a hallucination was mistaken for a leak. It does seem likely.
  • nis0s5 hours ago
    From reading about it, it seems to me Claude is a very limited agent, but it’s optimizing how to get a high score on leaderboards. I suspect most people don’t realize their interactions with Claude, both via web app and API, are used for training, regardless of your account subscription status. But yes, you can opt out.
    • bastawhiz5 hours ago
      I think it's equally likely that the property management company here has an incorrectly configured S3 bucket (or something like it) that has unintentionally exposed a bunch of leases. It makes more sense to me that a directory of hundreds or thousands of nearly-identical leases would be exposed online and scraped than the possibility that someone uploaded enough lease documents to Claude for them to all be included in training data. I'd be really surprised, actually, if any major AI company was taking uploaded documents and using them for training, since they're very, very likely to contain extremely sensitive data.
  • bobomonkey4 hours ago
    An LLM generating a bunch of text is to be expected.
  • boxingdog5 hours ago
    [dead]