Vily's submissions | Hacker News

1.		LLaVA-Mini: Efficient Image and Video Large Multimodal Models (github.com/ictnlp)
		2 points by Vily on Jan 13, 2025 \| past \| 2 comments
2.		StreamSpeech: "All in One" model for simultaneous ASR, translation and TTS (github.com/ictnlp)
		2 points by Vily on June 17, 2024 \| past