| | LLaVA-Mini: Efficient Image and Video Large Multimodal Models (github.com/ictnlp) |
| 2 points by Vily on Jan 13, 2025 | past | 2 comments |
|
| | Auto-RAG (github.com/ictnlp) |
| 2 points by taikon on Dec 6, 2024 | past |
|
| | Llama 3.1 Omni Model (github.com/ictnlp) |
| 304 points by taikon on Sept 18, 2024 | past | 41 comments |
|
| | StreamSpeech: "All in One" model for simultaneous ASR, translation and TTS (github.com/ictnlp) |
| 2 points by Vily on June 17, 2024 | past |
|
| | StreamSpeech (github.com/ictnlp) |
| 7 points by eddieweng on June 8, 2024 | past |
|