VScan

Verified safeOpen sourceExclusive
No reviews reviews
48
downloads
7.0
android

AI summary

VScan uses vision LLMs to help blind users perceive their surroundings through a smartphone camera. Create custom cognitive functions by pairing camera settings with AI prompts for tasks like object detection, scene description, or sign reading. Open-source project; requires camera, microphone, storage, and internet permissions.

Generated by AI. May contain inaccuracies.

About this app

This is a little project of mine aiming to research how vision LLMs could help out blind people on travel and in their every-day life by substituting eyesight for various visual tasks. VScan turns your smartphone's camera into a device for visual perception. You can define various optical cognitive functions, like looking for objects, signs, evaluating a scene or simply mediating visual impressions. You can afterwards use these functions on the camera view, just like a sighted person would use their eyes to achieve a specific goal in the physical world.

Each cognitive tool consists of two major parts:

The camera to be used - front / back, as well as camera parameters - resolution, flashlight etc.

The prompts used for LLM processing. LLM is the bridge between raw pixel data and your interpretation of it, and in the user/system prompt, you can specify what exactly are you interested in for the particular function and how should it be communicated, as well as the LLM model that should be used.

Camera input in combination with an LLM processing prompt forms a cognitive function, which can be used to serve various visual tasks.

About this version

Version
0.2.3 (23)
Size
6.64 MB
Requires Android
7.0
Target SDK
24
Architecture
arm64-v8a, armeabi-v7a, x86, x86_64
Downloads
48
Updated
Jun 11, 2026
Package
com.rastislavkish.vscan

Ratings & reviews

0 ratings
  • 5
    0
  • 4
    0
  • 3
    0
  • 2
    0
  • 1
    0

Write a review

Tap a star to rate this app