Hacker Newsnew | past | comments | ask | show | jobs | submit | Vily's submissionslogin
1.LLaVA-Mini: Efficient Image and Video Large Multimodal Models (github.com/ictnlp)
2 points by Vily on Jan 13, 2025 | past | 2 comments
2.StreamSpeech: "All in One" model for simultaneous ASR, translation and TTS (github.com/ictnlp)
2 points by Vily on June 17, 2024 | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: