2 pointsby badmonster7 months ago1 comment
  • badmonster7 months ago
    a subtle but powerful insight: large multimodal models like CLIP don’t just learn individual concepts. they also depend heavily on how often those concepts appear together during training.