47 pointsby RIshabh235an hour ago13 comments
  • jiehong3 minutes ago
    For those not trying, this allows Deepseek to understand a picture (instead of just extracting text from it), and it can describe what's in the picture, but this is not an image generation system, so you can't ask it to modify an image.

    Personally, I'm a bit surprised the DS chat app still doesn't offer its own text to speech and speech to text features (I know DS doesn't have any ASR model for example, but there are quite a few in the open).

  • rcMgD2BwE72F43 minutes ago
    Points to https://chat.deepseek.com/sign_in for me, that's just a login screen. Anything page with some info?
  • tornikeo15 minutes ago
    I really need this as an API.

    Turns out, to use Claude Agents SDK, you need to have a vision enabled API. If Deepseek API could see, it can fully drive Claude Code and Claude Agents SDK. A project I'm working on relies on a Claude-in-CloudflareWorker setup and I've been relying on Qwen and gemini flash lite, both more expensive than Deepseek.

    Can't wait to have it available on deepseek.

  • arjie21 minutes ago
    If they'd do one of those little extraneous additions like Qwen does, so that I can have DS4 Flash with Vision that would be great. I've got to run a separate model entirely so that I can get vision and I'd prefer to just put it all in one space.
  • bjoli41 minutes ago
    What has been going on with deepseek recently? I have gotten lots of replies in Chinese and even more frequently, reasoning in Chinese as well.

    Is it a new silent update?

    • alfiedotwtf3 minutes ago
      Are you running out of context? I’ve found that tooling and giberish most of the time happens when I’m butting up against the high watermark of my context window. One other thing it could be, I’ve read that lower quanta like Q1 and Q2 for smaller models can leak Chinese
    • abyssin37 minutes ago
      It doesn’t seem that recent to me, at least been like that for six months.
    • Shank38 minutes ago
      Well, it is a Chinese model, maybe it thinks better in Chinese?
    • RIshabh23535 minutes ago
      yes, kind of silent update plus they might have better chinese datasets and user data for their training, that might be leading to chinese preference.
    • epolanski24 minutes ago
      It never happened to me with Deepseek, but it happened multiple times with Kimi 2.6.

      It also happened a handful of times with Anthropic models.

  • earth2mars39 minutes ago
    And it's really good and fast. Have tested with bunch of odd photos on what is happening. Overall the training set seems large enough to know what's what and where
    • RIshabh23534 minutes ago
      yes and I hope their rate of shipping increases after recent funding.
  • crvdgcan hour ago
    Vision has been in A/B testing for a while now (at least in China). Is there an official announcement that this will be available for everyone?
    • RIshabh23537 minutes ago
      I haven't seen any official announcement yet, works for me though.
  • innis226an hour ago
    Nice, is this available in the API now as well?
    • naseemali92527 minutes ago
      I am also waiting on the vision support in API. Its the only thing blocking me from buying their subscription.
    • RIshabh23540 minutes ago
      Not in the api yet.
  • an hour ago
    undefined
  • hklohani16 minutes ago
    [flagged]
  • ValveFan6666an hour ago
    [dead]
  • an hour ago
    undefined
  • andrewstuart30 minutes ago
    OpenAI and Anthropic need to get this free foreign competition banned.
    • epolanski23 minutes ago
      Care to expand on why? Or did you forgot the /s at the end?
      • dudisubekti13 minutes ago
        I feel like '/s' has ruined irony on the internet. Irony is at its best if left ambiguous, lol.
        • Weryja minute ago
          Wait, did that need a /s?