Dmitry Zotkin
Adjunct Associate Professor
4218 Iribe Center
(301) 405-1049
Research Group(s):
Education:
Ph.D., University of Maryland (Computer Science)
Biography:
Dmitry Zotkin is an adjunct associate professor in the University of Maryland Institute for Advanced Computer Studies.
Zotkin is working with audio and acoustic signal processing. His main research interests are spatial audio capture and reproduction. Zotkin also works in related areas, such as microphone arrays, auditory scene analysis, and fast numerical methods for the acoustic wave equation.
Go here to view Zotkin's academic publications.
Publications
2011
2011. A partial least squares framework for speaker recognition. Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on. :5276-5279.
2011. Method and System for Dereverberation of Signals Propagating in .... 13/047,311
2011. Partial least squares based speaker recognition system. Snowbird Learning Workshop.
2011. Kernel partial least squares for speaker recognition. Twelfth Annual Conference of the International Speech Communication Association.
2010
2010. Plane-Wave Decomposition of Acoustical Scenes Via Spherical and Cylindrical Microphone Arrays. IEEE Transactions on Audio, Speech, and Language Processing. 18(1):2-16.
2010. Kernelized Rényi distance for speaker recognition. Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on. :4506-4509.
2010. Automatic matched filter recovery via the audio camera. Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on. :2826-2829.
2010. Loudspeaker and Microphone Array Signal Processing-Plane-Wave Decomposition of Acoustical Scenes Via Spherical and Cylindrical Microphone Arrays. IEEE transactions on audio, speech, and language processing. 20(1):2-2.
2010. Signal Processing for Audio HCI. Handbook of Signal Processing Systems. :243-265.
2010. Audio visual scene analysis using spherical arrays and cameras.. The Journal of the Acoustical Society of America. 127(3):1979-1979.
2010. Computation of the head-related transfer function via the fast multipole accelerated boundary element method and its spherical harmonic representation. The Journal of the Acoustical Society of America. 127(1):370-386.
2009
2009. Computation of the head-related transfer function via the boundary element method and representation via the spherical harmonic spectrum. Technical Reports from UMIACS UMIACS-TR-2009-06.
2009. Plane-wave decomposition of a sound scene using a cylindrical microphone array. Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on. :85-88.
2009. Imaging room acoustics with the audio camera.. The Journal of the Acoustical Society of America. 125(4):2544-2544.
2009. Regularized HRTF fitting using spherical harmonics. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009. WASPAA '09. :257-260.
2008
2008. Spherical microphone array based immersive audio scene rendering. Proc. ICAD.
2008. Imaging concert hall acoustics using visual and audio cameras. IEEE International Conference on Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. :5284-5287.
2008. Sound field decomposition using spherical microphone arrays. IEEE International Conference on Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. :277-280.
2008. A spherical microphone array based system for immersive audio scene rendering. UMIACS-TR-2008-09
2007
2007. Multimodal Tracking for Smart Videoconferencing and Video Surveillance. Computer Vision and Pattern Recognition, 2007. CVPR '07. IEEE Conference on. :1-2.
2007. Fast Evaluation of the Room Transfer Function Using Multipole Expansion. Audio, Speech, and Language Processing, IEEE Transactions on. 15(2):565-576.
2007. Fast Multipole Accelerated Boundary Elements for Numerical Computation of the Head Related Transfer Function. IEEE International Conference on Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. 1:I-165-I-168-I-165-I-168.
2007. Efficient Conversion of X.Y Surround Sound Content to Binaural Head-Tracked Form for HRTF-Enabled Playback. IEEE International Conference on Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. 1:I-21-I-24-I-21-I-24.
2006
2006. Fast head-related transfer function measurement via reciprocity. The Journal of the Acoustical Society of America. 120(4):2202-2215.
2006. Head-related transfer functions via the fast multipole accelerated boundary element method. The Journal of the Acoustical Society of America. 120(5):3342-3343.
2006. Capture and rendering of spatial sound over headphones. The Journal of the Acoustical Society of America. 120(5):3094-3094.
2006. Spherical and hemispherical microphone arrays for capture and analysis of sound fields. The Journal of the Acoustical Society of America. 120(5):3225-3225.
2006. Frequency Independent Flexible Spherical Beamforming Via Rbf Fitting. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 5:V-V-V-V.
2005
2005. High Order Spatial Audio Capture and Its Binaural Head-Tracked Playback Over Headphones with HRTF Cues. Audio Engineering Society Convention 119.
2005. High Order Spatial Audio Capture and Its Binaural Head-Tracked Playback Over Headphones with HRTF Cues. Audio Engineering Society Convention 119.
2005. Processing of reverberant speech for time-delay estimation. IEEE Transactions on Speech and Audio Processing. 13(6):1110-1118.
2005. Neuromimetic sound representation for percept detection and manipulation. EURASIP Journal on Applied Signal Processing. 9:1350-1350.
2005. High Order Spatial Audio Capture and Its Binaural Head-Tracked Playback Over Headphones with HRTF Cues. Audio Engineering Society Convention 119.
2005. High Order Spatial Audio Capture and Binaural Head-Tracked Playback over Headphones with HRTF Cues. Proceedings 119th convention of AES.
2005. Plane-wave decomposition analysis for spherical microphone arrays. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005. :150-153.
2004
2004. Rendering localized spatial audio in a virtual auditory space. Multimedia, IEEE Transactions on. 6(4):553-564.
2004. Rendering localized spatial audio in a virtual auditory space. IEEE Transactions on Multimedia. 6(4):553-564.
2004. Accelerated speech source localization via a hierarchical search of steered response power. IEEE Transactions on Speech and Audio Processing. 12(5):499-508.
2004. INTERPOLATION AND RANGE EXTRAPOLATION OF HEAD RELATED TRANSFER FUNCTIONS. IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS SPEECH AND SIGNAL PROCESSING. 4
2003
2003. Pitch and timbre manipulations using cortical representation of sound. Multimedia and Expo, IEEE International Conference on. 3:381-384.
2003. Using computer vision to generate customized spatial audio. Multimedia and Expo, IEEE International Conference on. 3:57-60.
2003. HRTF personalization using anthropometric measurements. Applications of Signal Processing to Audio and Acoustics, 2003 IEEE Workshop on.. :157-160.
2003. AUDIO-P2. 1: PITCH AND TIMBRE MANIPULATIONS USING CORTICAL REPRESENTATION OF SOUND. IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS SPEECH AND SIGNAL PROCESSING. 5
2002
2002. Virtual audio system customization using visual matching of ear parameters. 16th International Conference on Pattern Recognition, 2002. Proceedings. 3:1003-1006vol.3-1003-1006vol.3.
2002. Creation of virtual auditory spaces. 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). 2
2002. Customizable auditory displays. Proceedings of the International Conference on Auditory Display. :167-176.
2002. Joint audio-visual tracking using particle filters. EURASIP J. Appl. Signal Process.. 2002(1):1154-1164.
2001
2001. Attentive toys. International Conference on Multimedia and Expo. 22:25-25.
2001. Efficient evaluation of reverberant sound fields. Applications of Signal Processing to Audio and Acoustics, 2001 IEEE Workshop on the. :203-206.
2001. Multimodal tracking for smart videoconferencing. Second International Conference on Multimedia and Expo, Tokyo, Japan.
2001. Multimodal localization of a flying bat. Acoustics, Speech, and Signal Processing, IEEE International Conference on. 5:3057-3060.
2001. Active speech source localization by a dual coarse-to-fine search. 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 5:3309-3312vol.5-3309-3312vol.5.
2001. Multimodal 3-D tracking and event detection via the particle filter. IEEE Workshop on Detection and Recognition of Events in Video, 2001. Proceedings. :20-27.
2000
2000. Active source location and beamforming. The Journal of the Acoustical Society of America. 107:2790-2790.
2000. Attacking the bottlenecks of backfilling schedulers. Cluster Computing.
2000. An audio-video front-end for multimedia applications. 2000 IEEE International Conference on Systems, Man, and Cybernetics. 2:786-791vol.2-786-791vol.2.
2000. Smart videoconferencing. 2000 IEEE International Conference on Multimedia and Expo, 2000. ICME 2000. 3:1597-1600vol.3-1597-1600vol.3.
1999
1999. A real-time audio–video front-end for multimedia applications. The Journal of the Acoustical Society of America. 106:2271-2271.
1999. Exact solutions for the problem of source location from measured time differences of arrival. The Journal of the Acoustical Society of America. 106:2277-2277.
1998
1998. Pictorial query trees for query specification in image databases. Fourteenth International Conference on Pattern Recognition, 1998. Proceedings. 1:919-921vol.1-919-921vol.1.