Running a multimodal LLaVA model, camera, and speech synthesis
Originally appeared here:
A Weekend AI Project: Making a Visual Assistant for People with Vision Impairments
Running a multimodal LLaVA model, camera, and speech synthesis
Originally appeared here:
A Weekend AI Project: Making a Visual Assistant for People with Vision Impairments