Skip to content

v0.4.2

Latest

Choose a tag to compare

@li-plus li-plus released this 31 Jul 06:12
60c89b7
  • Apply flash attention on vision encoder for lower first-token latency.
  • Fix metal compilation error on Apple silicon chips.