Home |
Software |
Research |
Try HFVE! |
About |
The HFVE (Heard & Felt Vision Effects) audiotactile vision information system enables totally blind people to access aspects of visual images. HFVE software outputs sounds that convey key shapes, and can present corresponding tactile effects on a force-feedback joystick or mouse (or tactile array). Click here for demo videos. Click here to try HFVE! Heard & Felt EffectsThe shapes of items within images are presented via apparently-moving speech-like sounds (termed "tracers") which trace out the paths of outlines, center lines, symbolic shapes, etc., with corners emphasized. The speech describes features such as color, layout, etc. Coded speech sounds may be less intrusive than standard speech. An optional non-speech "buzz track" can clarify shapes. The moving sounds are positioned in stereo soundspace; according to location, and pitched according to the height of the feature they are representing. In the tactile modality, HFVE software can command a standard force-feedback device (such as Microsoft's Sidewinder® Force Feedback 2 joystick or Logitech's Wingman® Force Feedback Mouse) to move the user's hand along the paths of shapes. (Alternatively apparently-moving tracers can be presented on a tactile array e.g. forehead-, abdomen-, or tongue-placed electro-tactile display.) Braille can present regular regions, and Morse code-like taps can present visual information.
In both modalities the paths followed by the tracers describe the shape, size and location of items. As the system outputs both audio and tactile effects, users can choose which modality to follow; or both modalities can be used simultaneously. What can be presented ?HFVE software can exhibit visual representations such as abstract shapes; still or moving filed images (.GIF, .AVI, etc.); live images; data that can be presented visually; media player items (DVDs etc.); clipboard contents; drag & drop items; or sections of a computer desktop. Other apps, which produce shapes and images (e.g. map applications, educational software, certain games, etc.), can send visual representations for HFVE software to exhibit, via a simple interface. How are Images Exhibited ?Tracers, corners, and buzz tracksWhen exhibiting items, the paths followed by the moving tracers are typically the outlines of the items, but alternatives include medial (center) lines, enclosing rectangles, and standard easily-recognised symbolic paths for common "items" such as faces and people. Corners are emphasized with special effects which help to give a clearer impression of shape. A "buzz track" gives additional non-speech cues about location and shape, and plays at the same time as the speech-like sounds. Its buzzing sounds are easier to mentally position in soundspace, and allow more accurate perception of shape. Location and direction cues, and timbre-conveyed information, can be included on the buzz track. "Matrix" effects are produced by dividing the image several equal-width columns and/or rows, so that special effects can be triggered whenever the tracer moves from one such column or row to another, allowing the shape of lines to be perceived more clearly – if the tracer travels at a constant speed, the rate at which the border effects are presented will correspond to the angle of slope.
Polytracers & RegionsAdditionally, optophone-like multiple-tracer effects (known as "polytracers") sweep through an area to give an intuitive impression of the layout of item content. Furthermore, user-controlled regular rectangular areas (known as "regions") can also be exhibited. These map well to braille. (Item shape and content can also be displayed via braille.)
Imprints"Imprints" rapidly and intuitively summarize an image, by conveying the approximate extent of items via multiple stationary voices, or via non-speech effects. The effect of the voices presenting successive items can give the impression of the items being "stamped out" or "printed". Imprints can be presented in conjunction with other effects, such as shape-conveying buzz-track tracers, and optophone-like polytracer effects. Blind users may not need to know the exact size, shape and location of each item - the approximate size and extent presented by the Imprint is sufficient. However users can command the system to lock on to an item when it is presented, in order to obtain the exact shape etc. of the item. Multi-talker Focus effectsMulti-level multi-talker "Focus" effects allow several properties and items to be presented and investigated at the same time. The system presents the items that are currently the primary focus of attention via crisp non-modified sounds, for example via speech sounds. At the same time the system presents the speech sounds for items that are not at the focus of attention, but applies a distinct differentiating effect on them, for example by changing the type of voice (e.g. monotone, or with intonation), or by applying echo or reverberation effects.
This helps users to focus their auditory attention on the item emphasized by the system, or switch their attention to another item that is also presented but not emphasized. They can then cause the system to highlight that other item instead. For the illustrated spreadsheet, the pointer is over a particular cell, but is also over a column of cells, a row of cells, a block of cells, and the spreadsheet. The user’s focus of attention can be drawn towards any one of these spreadsheet items (cell, column, row, block etc.) while at the same time the user can be made aware of the other co-located items, which are at different levels of view.A blind user can rapidly navigate between such levels, for example by using a mouse wheel or a dial device, while hearing the focus effects speaking the level (e.g. cell, column, row, or block) that is currently emphasized, and at the same time being made aware of the levels above and below the current level of view, which have distinguishing effects applied. Navigating with locked-on items, and freely exploringAt any moment the user can lock on the item being presented. When an item is locked on, and the user moves the pointer within the area of the item, typically the items at lower- (and/or higher-) levels than the locked item are presented, so that the user can be aware of items in adjacent levels (or items nearby on the same level), and can switch to being locked on one of them instead. Alternatively the system can step around the lower-level items within the locked-on higher-level item, and the user can at any time lock on the item being presented. The method of user interaction can also be “exploring” in style, using a moving pointer. Click here for demo videos. One option is for a different voice to start when a new item is to be announced (optionally with a different persona), but with the earlier voice continuing on at a reduced volume level, being reduced further with each subsequent item (the previous voices can also be moved to the side to keep them distinct from the new main voice). This may produce a less abrupt effect on change of announced item, with the previous voices gradually fading away.
Computer VisionHFVE software includes facilities for identifying objects within images. Optical processing techniques are used to present areas containing particular colors (known as "blobs"), and computer vision object-recognition techniques work well for certain items (such as people's faces). HFVE software can also highlight areas and directions of motion in a live or moving image - the shape of the area of movement can be detected and exhibited; or a special tracking mode can be selected, whereupon the area of motion is followed, with polytracer-like effects continuously conveying the movement. Other types of item e.g. faces, or selected areas, can also be tracked. Open source Tesseract OCR is used for text recognition, and open source OpenCV is used for face and motion detection.
The illustration shows this process in action, with 5 faces detected, and with the resultant assumed figures overlapped appropriately. The resultant figures can then be presented using the effects, for example as imprints, or symbolic tracers.
Creating Audiotactile ImagesBlind people can draw shapes and items onto an image or blank background via keyboard, joystick, touch-screen, or speech - these can then be replayed using standard HFVE conventions. A regular computer mouse can also be used - stereophonic buzz track-like sounds give continuous feedback about the location of the mouse pointer, whose path is mapped onto the drawing canvas. When users move the mouse (or joystick) in a certain path, the sounds they hear will be similar to those produced by the moving tracers when the shape is replayed. Prepared GuidesAlthough HFVE software can automatically identify and highlight some items in images, for certain applications images can be prepared by a designer (e.g. a teacher), who can produce a guide which specifies the items to be presented, their extent, importance, etc. Presenting Data, Graphs and ChartsData can be presented in the form of audiotactile shapes that are similar to standard graphs and charts, such as line graphs or bar charts. Line graphs have previously been presented by using optophone-like audio mapping. However by using audiotactile tracers that can move in any direction, shapes that resemble other graph and chart types (e.g. pie charts) can also be traced out with audiotactile effects. One difference from general image presentation is that the special effects, which normally highlight corners, can be used to represent data points. The apparent speed of travel and sound timbre can change on change of section. Several rows of data can be presented. HFVE software can also present audiotactile waveform shapes that are defined in a spreadsheet. Several waves can be combined, and the resultant waveform presented. Blind people can create audiotactile graphs, charts and waveforms, and can experience the effect on the waveform shape of changing parameters. System RequirementsHFVE software runs on Microsoft Windows® and can run virtualized on other OSs. HFVE software development is nearing completion. Contact HFVE for more information. Demo videos
|