Institutions
|
About Us
|
Help
|
Gaeilge
0
1000
Home
Browse
Advanced Search
Search History
Marked List
Statistics
A
A
A
Author(s)
Institution
Publication types
Funder
Year
Limited By:
Subject = speech;
4 items found
Sort by
Title
Author
Item type
Date
Institution
Peer review status
Language
Order
Ascending
Descending
25
50
100
per page
Bibtex
CSV
EndNote
RefWorks
RIS
XML
Displaying Results 1 - 4 of 4 on page 1 of 1
Marked
Mark
Color-to-speech sensory substitution device for the visually impaired
(1997)
McMorrow, Gabriel; Wang, Xiaojun; Whelan, Paul F.
Color-to-speech sensory substitution device for the visually impaired
(1997)
McMorrow, Gabriel; Wang, Xiaojun; Whelan, Paul F.
Abstract:
A hardware device is presented that converts color to speech for use by the blind and visually impaired. The use of audio tones for transferring knowledge of colors identified to individuals was investigated but was discarded in favor of the use of direct speech. A unique color-clustering algorithm was implemented using a hardware description language (VHDL), which in-turn was used to program an Altera Corporation's programmable logic device (PLD). The PLD maps all possible incoming colors into one of 24 color names, and outputs an address to a speech device, which in-turn plays back one of 24 voice recorded color names. To the author's knowledge, there are only two such color to speech systems available on the market. However, both are designed to operate at a distance of less than an inch from the surface whose color is to be checked. The device presented here uses original front-end optics to increase the range of operation from less than an inch to sixteen feet and gre...
http://doras.dcu.ie/4667/
Marked
Mark
Investigating segment-based query expansion for user-generated spoken content retrieval
(2016)
Khwileh, Ahmad; Jones, Gareth J.F.
Investigating segment-based query expansion for user-generated spoken content retrieval
(2016)
Khwileh, Ahmad; Jones, Gareth J.F.
Abstract:
The very rapid growth in user-generated social multimedia content on online platforms is creating new challenges for search technologies. A significant issue for search of this type of content is its highly variable form and quality. This is compounded by the standard information retrieval (IR) problem of mismatch between search queries and target items. Query Expansion (QE) has been shown to be an effect technique to improve IR effectiveness for multiple search tasks. In QE, words from a number of relevant or assumed relevant top ranked documents from an initial search are added to the initial search query to enrich it before carrying out a further search operation. In this work, we investigate the application of QE methods for searching social multimedia content. In particular we focus on social multimedia content where the information is primarily in the audio stream. To address the challenge of content variability, we introduce three speech segment-based methods for QE using: Se...
http://doras.dcu.ie/23386/
Marked
Mark
MPEG-1 bitstreams processing for audio content analysis
(2002)
Jarina, Roman; Duffner, Orla; Marlow, Seán; O'Connor, Noel E.; Murphy, Noel
MPEG-1 bitstreams processing for audio content analysis
(2002)
Jarina, Roman; Duffner, Orla; Marlow, Seán; O'Connor, Noel E.; Murphy, Noel
Abstract:
In this paper, we present the MPEG-1 Audio bitstreams processing work which our research group is involved in. This work is primarily based on the processing of the encoded bitstream, and the extraction of useful audio features for the purposes of analysis and browsing. In order to prepare for the discussion of these features, the MPEG-1 audio bitstream format is first described. The Application Interface Protocol (API) which we have been developing in C++ is then introduced, before completing the paper with a discussion on audio feature extraction.
http://doras.dcu.ie/327/
Marked
Mark
Speech-music discrimination from MPEG-1 bitstream
(2001)
Jarina, Roman; Murphy, Noel; O'Connor, Noel E.; Marlow, Seán
Speech-music discrimination from MPEG-1 bitstream
(2001)
Jarina, Roman; Murphy, Noel; O'Connor, Noel E.; Marlow, Seán
Abstract:
This paper describes a proposed algorithm for speech/music discrimination, which works on data directly taken from MPEG encoded bitstream thus avoiding the computationally difficult decoding-encoding process. The method is based on thresholding of features derived from the modulation envelope of the frequency-limited audio signal. The discriminator is tested on more than 2 hours of audio data, which contain clean and noisy speech from several speakers and a variety of music content. The discriminator is able to work in real time and despite its simplicity, results are very promising.
http://doras.dcu.ie/332/
Displaying Results 1 - 4 of 4 on page 1 of 1
Bibtex
CSV
EndNote
RefWorks
RIS
XML
Year
2016 (1)
2002 (1)
2001 (1)
1997 (1)
built by Enovation Solutions