Application part 2

 Disclosure: Application for Video Playback and Scene-Based Image Analysis for Visually Impaired Users


Title


“Video Playback for Visually Impaired Users with Scene-Based Image Analysis and Tactile Interaction”

Abstract


This invention discloses a second functionality for an application designed to assist visually impaired users by enabling them to interpret videos. The application integrates a video playback feature where videos from a source such as YouTube can be opened and processed into sequential still images. The software detects significant scene changes (e.g., new angles, wider or closer shots) and presents each new scene as a still image. Users can explore each still image using touch to detect highlighted features, supported by audio feedback. The app enables simultaneous listening to the video content and interactive tactile exploration of images, providing a unique multisensory experience.

Technical Description

1. Video Source Integration:

The application allows users to import videos directly from platforms like YouTube or upload locally stored videos.

A playback option is available within the app to process videos in real-time or pre-process them for scene analysis.

2. Scene-Based Image Analysis:

The software analyzes the video stream and detects significant scene changes:

New Camera Angles: Identifies a shift in perspective or angle.

Zoom Changes: Detects transitions to wider or closer shots.

Visual Variations: Recognizes substantial differences in lighting, objects, or composition.

Upon detecting a scene change, the app freezes the video and converts the frame into a still image.

3. Stills Exploration:

Each still image is processed to extract highlighted lines (e.g., edges or contours) using advanced image processing algorithms like OpenCV or Google Vision API.

Users can explore the still image by moving their finger on the screen. The app provides:

Clear beeps for following lines or edges.

Noisy feedback when deviating from the lines, with intensity increasing as the user moves further away.

The app automatically moves to the next still image when a new scene begins.

4. Multisensory Feedback:

The app is designed for dual auditory input:

One Ear: Plays the original audio content of the video (e.g., dialogue, music).

Other Ear: Provides tactile guidance sounds (beeps and noise) corresponding to the user’s touch navigation on the image.

5. Zoom Functionality:

Users can zoom in and out on still images to explore finer details.

Upon moving to the next scene, the app resets the image to its default size for consistency.

6. Interface and Navigation:

Accessible interface for visually impaired users.

Simple controls to pause, play, and skip scenes or navigate to previous stills.


Applications

1. Education:

Helps visually impaired users interpret educational videos or presentations.

2. Entertainment:

Enables interaction with movies, series, and other video content.

3. Accessibility:

Provides a novel way for visually impaired individuals to “watch” videos by exploring visual content and listening simultaneously.


Advantages

1. Enhanced Accessibility:

Bridges the gap between video content and visually impaired users by converting videos into interpretable still images.

2. Dual Feedback:

Combines auditory and tactile feedback for a multisensory experience.

3. Scene-Based Analysis:

Intelligent detection of significant scene changes for smoother and more meaningful interaction.

4. Scalability:

Can be integrated with multiple video platforms and formats.

Claims

1. A software feature that allows users to open and process video files or streams, dividing them into still images based on detected scene changes.

2. An image analysis system that identifies significant visual variations in video scenes and processes frames into tactilely explorable stills.

3. A dual-audio system that provides synchronized playback of video audio content in one ear and tactile feedback sounds in the other.

4. Zoom functionality for detailed image exploration, resetting to default size upon a new scene.

5. A user interface designed for visually impaired individuals, integrating touch and sound interaction.


Comments

Popular posts from this blog

Foldable Dual-Hook Adjustable-Spacer Hanger with Centering Rotation

Dual-Hook Closet Hanger Providing Bilateral Garment Spacing

Quarter-Curve Side Plate for Efficient Dining Space and Side Dish Serving