Interactive Demo

Experience FastVLM's real-time vision capabilities with live camera input. This demo provides instant visual understanding and captioning.

FastVLM Live Camera Demo

This demo runs FastVLM directly in your browser with live camera access for real-time visual understanding and captioning.

About This Demo

This interactive demo showcases FastVLM's real-time vision-language capabilities using your device's camera. The model processes live video feed to provide instant visual understanding and captioning. Camera access is required for the demo to function properly. The demo uses WebGPU for accelerated inference, ensuring smooth real-time performance.

Camera Access Required

This demo requires access to your device's camera for live video captioning. Please allow camera permissions when prompted.

Troubleshooting Camera Issues

  • Check if camera permissions are blocked in your browser settings
  • Try refreshing the page and allowing access when prompted
  • Ensure no other apps are currently using your camera
  • Try using a different browser or device if issues persist
  • Make sure your device has a working camera