3 pointsby hectormalot3 hours ago1 comment
  • hectormalot3 hours ago
    We're doing a lot with the realtime models. Happy to see a new release.

    Initial feel from a few calls is that it seems to perform better with alphanumeric inputs. Voice seems consistent. Recognition on a few tests seems to be somewhat better, especially did much better on the two 8-bit 8-kHz mulaw calls I tried.

    It does still struggle a bit with some specifics in other languages (e.g., that the Dutch/German pronunciation of 53 'fifty-three' is effectively 'three-and-fifty').