Data science and machine learning resources for visually impaired

Project

In recent years, AI-related technologies, such as data science and machine learning, have been used in various fields. In the future, there will be more and more opportunities to learn about such technologies in school education. Moreover, an essential part of learning data science is the ability to analyze vast amounts of data using graphs and other tools. For the visually impaired, the necessity of using such methods that rely on sight is a serious barrier to learning data science.

Therefore, in this project, I would like to organize useful information and develop tools necessary to learn data science and machine learning techniques without relying on sight. At the moment, I have developed a library that notifies visual state changes with sound on Google Colaboratory, and an alternative visualization tool that can convert graphs into audio. I have also created tutorials on basic machine learning and advanced deep learning using these libraries.

The goal is that visually impaired people with some programming knowledge and an interest in data science will be able to run and understand a few typical machine learning tutorials. Eventually, I hope that someone will be able to apply the various tools to other fields, such as scientific research. The detailed background of the project can be found in the following blog, if you would like to read it.

What would it take to learn data science without relying on sight?

Libraries

I am developing various Python libraries mainly for use on Google Colaboratory, which is a relatively accessible notebook environment often used in data science. I have also tried to use local environment scripting for the graph sonification library, but it strongly depends on the running environment, so if you have any problems, please contact me.

audio-plot-lib

This library converts graph data into sound and playback it to enable users to understand the characteristics of the data. It also provides functions that are essential for data analysis, such as the interactive check of values by tracing them with the mouse cursor. The following is an example of a graph that allows you to interactively check the data. Two graphs, one sine wave and one cosine wave, are overlapped and sonified. First of all, please click on the following button to unmute the graph audio. Note that you do not need to click the button on Google Colab.

After pressing the button, next move the mouse cursor appropriately to find if the sound "Enter graph" is played out. Then, while you are on the graph image, you can check the values plotted on the graph as pitches by moving the mouse left or right. When you don't have a mouse, move the cursor to the slider and try moving the slider left or right.

In the initial state, you can hear the sine wave values. By double-clicking, you can now switch to the cosine value. You can also single-click to read out the x and y values for that point. When you don't have a mouse, multiple sliders are available for each graph.

In addition to checking interactively the graph data, it is also possible to create a graph that simply plays sounds. The following is an example of a graph of the same data in audio format. Play it back and check how the sound changes.

You can also access the Google Colab example below to try out all the functions while modifying them by yourself. Please check out the basic usage and how to specify options.

colab-a11y-utils

This library adds the following features to improve usability on Google Colab. Together with the browser extension options described next, I think you will be able to use Google Colab more smoothly without having to navigate around a lot with a screen reader.

One beep sound when executed
Two beep sounds when normal completion
Two buzzer sounds when abnormal completion
Sound notification of progress bar by tqdm
Simple audio output function that can be used instead of print

Open example of colab accessibility utilities in Colab

Browser Extention Script (Optional)

This is a script that partially rewrites HTML to make it easier to operate the screen reader on Google Colab. It is not required to run the tutorial described below, so please install it if you have time.

1. Install Tampermonkey

Tampermonkey is a popular browser extension for managing user scripts. It allows you to easily import JavaScript that will affect only specific pages in a generic manner. For more information, please do a google search.

2. Import the forrowing script

If you already have Tampermonkey installed, just click on the following user script link and you will be taken to a page to import it.

colab_a11y_utils.user.js

Tutorials

Linear Regression

In this tutorial, you will learn about the basics of machine learning, the linear regression algorithm, which is also introduced in the Python Data Science Handbook, a learning resource described below.

Speech Command Recognition

In this tutorial, you will learn how to train a voice command recognizer using deep learning, as described on the official PyTorch website.

Learning Materials

Introduction to Data Science for Screen Reader Users

This website is designed for screen reader users to learn data science. It covers the basics of NVDA, Google Colaboratory, Python, and foundational concepts in data science. The content is carefully written to cater to students.

Python Data Science Handbook

This website contains the full text of the Python Data Science Handbook by Jake VanderPlas. A link to Colab is provided each page so that you can probably learn all about the basics of machine learning by using the library in this project. Please feel free to request me any additional features that are essential for reading along with the screen reader.

Google Colab Simple Usage

Press Ctrl+Shift+P to display the command palette. Most of the operations can be performed from the command palette, so you just need to remember this.
Press Ctrl+M followed by H to display a list of keyboard shortcuts.
The basic operation is to write the python code on a cell and press Ctrl+Enter to execute the code. The result of the execution will be displayed directly below the cell, but it may be displayed as an iframe, which may be easily overlooked by some screen readers as it only reads "frame". If you move the focus to the child element, you should be able to access the result.
Press Ctrl+M followed by B to add cells
Press Ctrl+M followed by D to remove cells
Press Ctrl+M followed by I to interrupt runnning
Press Ctrl+Q to check running status of the cell if you installed the tampermonkey script

Related Projects

Desmos

Desmos is a free online mathematics tool that includes a functional calculator and graphing capabilities. It is also focused on accessibility, and has provided sonified graphs, such as the one developed in this project. Actually, it has been used in many educational fields and exams, and I think it is the best tool to learn mathematics as a basis for data science.

Accessible Graphs

This is a free online tool that specializes in the conversion of graph data to sound. It is designed to be simple and easy to use, so you can easily experience the audio graph. The Braille display is also supported, which I think is really excellent.

Contact

For more information and collaboration, please let me know on disucussion page on Github or send me an email directly. In particular, it would be great if you could tell me what you would like to see added.

GitHub Discussion Page
E-mail

Acknowledgments

Please share this project. Tweet

I would be happy to have more Stars on the GitHub page.