Experience FastVLM's real-time vision capabilities with live camera input. This demo provides instant visual understanding and captioning.
This demo runs FastVLM directly in your browser with live camera access for real-time visual understanding and captioning.
This interactive demo showcases FastVLM's real-time vision-language capabilities using your device's camera. The model processes live video feed to provide instant visual understanding and captioning. Camera access is required for the demo to function properly. The demo uses WebGPU for accelerated inference, ensuring smooth real-time performance.
This demo requires access to your device's camera for live video captioning. Please allow camera permissions when prompted.