-
Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Paper • 2306.02858 • Published • 18 -
openbmb/MiniCPM-Llama3-V-2_5
Visual Question Answering • Updated • 86.1k • 1.33k -
2Noise/ChatTTS
Text-to-Audio • Updated • 7.33k • 1.29k -
meta-llama/Meta-Llama-3-8B
Text Generation • Updated • 2.16M • 5.67k
josephgu
josephgu
AI & ML interests
None yet
Organizations
None yet
Collections
1
datasets
None public yet