Accessible Canvas

Project

Personal Project

Timeline

3 Weeks

Team

Ananay Gupta

tools

Hugging Face

P5js

TL;DR

Don't have time to read?, I’ll cover the details!

What is it?

The project is aimed at creating accessible tools for Blind and Low Vision (BLV) users to express creativity and collaborate through digital art. It features a digital canvas with real-time feedback and AI-powered tools to transform sketches into detailed, realistic visualizations

Why is it needed?

BLV users face significant barriers to artistic creation and communication due to the lack of tactile feedback and inclusive tools in digital art platforms. This project bridges the gap, offering spatial, emotional, and contextual support to empower BLV users in both creating and sharing their artwork.

How is it impactful?

The Accessible Canvas promotes inclusivity in creative expression, enabling BLV users to communicate artistic ideas to sighted audiences and fostering collaboration. By combining AI with accessibility, it creates opportunities for BLV users to engage deeply with art and share their unique perspectives.

My Role and Responsibilities

I am developing the digital canvas and designing its feedback system, ensuring it provides spatial guidance for BLV users. I am also integrating AI tools for generating detailed images and descriptive feedback, as well as building an interactive system for follow-up questions to enhance user trust and engagement.

Main Features

The Canvas provides real-time spatial feedback and transforms sketches into detailed visualizations

Sketch and Prompt

The user can sketch with spatial feedback and enter an image prompt describing the image

Gen AI

An AI feature that transforms user sketches and prompts into detailed, realistic visualizations.

Follow-up

Asking follow-up questions, ensuring alignment with their vision and enhancing trust

Primary Research

A semi-structured interview was conducted with two BLV participants, focusing on understanding the users' experiences with digital drawing platforms, AI image generation tools, and their preferences for image descriptions. The interview also included a small activity where participants were asked to sketch an image of "a dog playing with a ball" and describe it.
‍

Themes

Digital drawing platforms.
‍
AI image generation tools
‍
Preferences for image descriptions.

‍

Activity

Sketch an image of "a dog playing with a ball" and describe it.
The image shows a sketch drawn by a Blind participant

Data Analysis

No personal data identifying the participants was collected. The interviews were held virtually via Microsoft Teams, and transcripts were analyzed using open-coding method.

Insights and Design Implications

Digital Drawing Platforms

Combine locating coordinates on the canvas with auditory feedback

Provide feedback on the relationship between objects, such as their relative positioning and distances.

Explore methods to include tactile inputs using the keyboard and mouse.

AI Image Generation

Provide a way for BLV users to validate whether the generated image matches their prompts and sketches.

Enable users to include additional context or details to improve the quality of the generated image.

AI Image Description

Implement a feature that allows for follow-up questions, enabling users to gain more detailed insights.

Ensure that descriptions include spatial relationships and contextual details.

Keep descriptions flexible to allow for individual interpretation

Initial Assumptions

Initially, I assumed that describing the image by explaining the position of subjects and objects within four quadrants, and explaining images based on the emotions intended by the user. However, after conducting preliminary research, I realized the need to include more detailed spatial relationships and contextual descriptions to better suit BLV users' needs. A key insight was to validate the generated image and allow people to ask follow-up questions about the image, I plan to explore this by using a visual question and answering model. I will also experiment with refining my image-to-text prompts to ensure they effectively describe the image.

Models Used

P5.js

Used to create an Accessible canvas that gives auditory feedback

Image to Image Model (Sketch to Image)

TencentARC/t2i-adapter-sketch-sdxl-1.0 (Hugging Face)

‍Diffusion-based text-to-image generation model

Image to Text Model

Open AI 4o

‍To give a detailed description of subject and background of the generated image

Visual Question Answering Model

Salesforce/blip-vqa-capfilt-large(Hugging Face)

‍Allows the user to ask additional questions about the image to get a more detailed understanding

Project Demo

Learning Takeaways

What I learnt?

Unlearning:
This project taught me to unlearn assumptions about how they approach tasks. By listening and observing, I realized their strategies are far more diverse and nuanced than I initially thought, reinforcing the value of open-minded collaboration.
‍
Communication:
Working with Blind and Low Vision individuals taught me the value of precise and spatially aware communication. Describing concepts required me to think beyond visual references, focusing on clear, positional, and tactile language.
‍
Universal Design:
‍I learned the value of universal design, and I aim to take that forward by applying it in my future projects. By taking deliberate steps to create inclusive solutions, I can ensure my designs are accessible and meaningful for everyone.