Joseph Keshet

Publications

This is a listing of most of my publications, arranged in reverse chronological order. Where available, publications include links to implementation code and demonstration pages. Additional publications can be found on my Google Scholar profile.

Papers that are downloaded are solely for personal use, which includes educational or research purposes, and they cannot be used for commercial redistribution or promotion.

A Front-End Adaptation Network for Improving Speech Recognition Performance in Packet Loss and Noisy Environments
Yehoshua Dissen, Shiry Yonash, Israel Cohen, and Joseph Keshet

IEEE Transactions on Audio, Speech and Language Processing, Volume 33, pp. 2175-2188, 2025

This work presents a novel method to improve automatic speech recognition (ASR) performance in noisy and packet loss conditions without retraining or fine-tuning large pretrained models. By attaching a lightweight front-end adaptation network to a frozen ASR model, the system learns to correct…
New
Spectral Analysis of Diffusion Models with Application to Schedule Design
Roi Benita, Michael Elad, Joseph Keshet

Preprint, 2025

We propose a spectral analysis of the diffusion model’s inference, treating it as a transfer function that maps initial noise to the generated signal. This analysis leads to a mechanism for designing noise schedules.
New

2025

A Front-End Adaptation Network for Improving Speech Recognition Performance in Packet Loss and Noisy Environments
Yehoshua Dissen, Shiry Yonash, Israel Cohen, and Joseph Keshet
IEEE Transactions on Audio, Speech and Language Processing, Volume 33, pp. 2175-2188, 2025
Spectral Analysis of Diffusion Models with Application to Schedule Design
Roi Benita, Michael Elad, Joseph Keshet
Preprint, 2025
We propose a spectral analysis of the diffusion model’s inference, treating it as a transfer function that maps initial noise to the generated signal. This analysis leads to a mechanism for designing noise schedules.
FlowTSE: Target Speaker Extraction with Flow Matching
Aviv Navon, Aviv Shamsian, Yael Segal-Feldman, Neta Glazer, Gil Hetz, Joseph Keshet
The 26th Annual Conference of the International Speech Communication Association (Interspeech), 2025.
Whisper in Medusa’s Ear: Multi-head Efficient Decoding for Transformer-based ASR
Yael Segal-Feldman, Aviv Shamsian, Aviv Navon, Gill Hetz, Joseph Keshet
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2025
Whisper is a powerful encoder-decoder model for speech transcription and translation. To accelerate its inference, we propose two architectures that extend Whisper by enabling multi-token prediction per iteration.
Enhancing analysis of diadochokinetic speech using deep neural networks
Yael Segal-Feldman, Kasia Hitczenko, Matthew Goldrick, Adam Buchwald, Angela Roberts, Joseph Keshet
Computer Speech & Language, Volume 90, 101715, March 2025
Predicting relative intelligibility from inter-talker distances in a perceptual similarity space for speech
Seung-Eun Kim, Bronya R Chernyak, Joseph Keshet, Matthew Goldrick, Ann R Bradlow
Psychonomic Bulletin & Review, 2025

2024

WhisperNER: Unified Open Named Entity and Speech Recognition
Gil Ayache, Menachem Pirchi, Aviv Navon, Aviv Shamsian, Gill Hetz, Joseph Keshet
Preprint, 2024.
HebDB: a Weakly Supervised Dataset for Hebrew Speech Processing
Arnon Turetzky, Or Tal, Yael Segal-Feldman, Yehoshua Dissen, Ella Zeldes, Amit Roth, Eyal Cohen, Yosi Shrem, Bronya R. Chernyak, Olga Seleznova, Joseph Keshet, Yossi Adi
The 25th Annual Conference of the International Speech Communication Association (Interspeech), 2024
Enhanced ASR Robustness to Packet Loss with a Front-End Adaptation Network
Yehoshua Dissen, Shiry Yonash, Israel Cohen, Joseph Keshet
The 25th Annual Conference of the International Speech Communication Association (Interspeech), 2024
Tradition or Innovation: A Comparison of Modern ASR Methods for Forced Alignment
Rotem Rousso, Eyal Cohen, Joseph Keshet, Eleanor Chodroff
The 25th Annual Conference of the International Speech Communication Association (Interspeech), 2024
Keyword-Guided Adaptation of Automatic Speech Recognition
Aviv Shamsian, Aviv Navon, Neta Glazer, Gill Hetz, Joseph Keshet
The 25th Annual Conference of the International Speech Communication Association (Interspeech), 2024
A Perceptual Similarity Space For Speech Based On Self-Supervised Speech Representations
Bronya R. Chernyak, Ann R. Bradlow, Joseph Keshet, and Matthew Goldrick
Journal of the Acoustical Society of America, Vol. 155, Issue 6, pp. 3915–3929, 2024
Open Vocabulary Keyword-Spotting with Adaptive Instance Normalization
Aviv Navon, Aviv Shamsian, Neta Glazer, Gill Hetz, Joseph Keshet
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 11656-11660, 2024
Open vocabulary keyword spotting is a crucial and challenging task in automatic speech recognition (ASR) that focuses on detecting user-defined keywords within a spoken utterance.
Automatic Recognition of Second Language Speech-in-Noise
Seung-Eun Kim, Bronya R. Chernyak, Olga Seleznova, Joseph Keshet, Matthew Goldrick, Ann R. Bradlow
JASA Express Letters, Volume 4, Issue 2, 2024
DiffAR: Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation
Roi Benita, Michael Elad, Joseph Keshet
International Conference on Learning Representations (ICLR), 2024

2023

Speech Characteristics Yield Important Clues About Motor Function: Speech Variability in Individuals at Clinical High-Risk for Psychosis
Kasia Hitczenko, Yael Segal, Joseph Keshet, Matthew Goldrick, Vijay A Mittal
Nature Schizophrenia, Volume 9, Article number: 60, 2023
Speech characteristics yield important clues about motor function: Speech variability in individuals at clinical high-risk for psychosis.
Using Automatic Acoustic Analysis to Reveal Disruptions to Speech Articulation in Individuals at Risk for Psychosis
Kasia Hitczenko, Yael Segal, Joseph Keshet, Vijay Mittal, Matthew Goldrick
Journal of the Acoustical Society of America, Volume 153, A290, 2023

2022

A Baseline for Detecting Out-of-Distribution Examples in Image Captioning
Gal Shalev, Gabi Shalev, Joseph Keshet
The 30th ACM International Conference on Multimedia, pp. 4175 – 4184, 2022
THOR: Threshold-Based Ranking Loss for Ordinal Regression
Tzeviya Sylvia Fuchs, Joseph Keshet
Formant Estimation and Tracking using Probabilistic Heat-maps
Yosi Shrem, Felix Kreuk, Joseph Keshet
The 23rd Annual Conference of the International Speech Communication Association (Interspeech), 2022
DDKtor: Automatic Diadochokinetic Speech Analysis
Yael Segal, Kasia Hitczenko, Matthew Goldrick, Adam Buchwald, Angela Roberts, Joseph Keshet
The 23rd Annual Conference of the International Speech Communication Association (Interspeech), 2022
Unsupervised Word Segmentation Using K Nearest Neighbors
Tzeviya Sylvia Fuchs, Yedid Hoshen, Joseph Keshet
The 23rd Annual Conference of the International Speech Communication Association (Interspeech), 2022
Self-Supervised Speaker Diarization
Yehoshua Dissen, Felix Kreuk, Joseph Keshet
The 23rd Annual Conference of the International Speech Communication Association (Interspeech), 2022
Correcting Mispronunciations In Speech Using Spectrogram Inpainting
Talia Ben-Simon, Felix Kreuk, Faten Awwad, Jacob T Cohen, Joseph Keshet
The 23rd Annual Conference of the International Speech Communication Association (Interspeech), 2022
DeepFry: Identifying Vocal Fry using Deep Neural Networks
Bronya R Chernyak, Talia Ben Simon, Yael Segal, Jeremy Steffman, Eleanor Chodroff, Jennifer S Cole, Joseph Keshet
The 23rd Annual Conference of the International Speech Communication Association (Interspeech), 2022
Speech Time-Scale Modification with GANs
Eyal Cohen, Felix Kreuk, Joseph Keshet
IEEE Signal Processing Letters, Volume 29, pp. 1067 – 1071, 2022

2021

Using Automated Acoustic Analysis to Explore the Link Between Planning and Articulation in Second Language Speech Production
Matthew Goldrick, Yosi Shrem, Oriana Kilbourn-Ceron, Cristina Baus, Joseph Keshet
Language, Cognition and Neuroscience, Volume 36, Issue 7, pp. 824-839, 2021
Pitch Estimation by Multiple Octave Decoders
Yael Segal, May Arama-Chayoth, Joseph Keshet
IEEE Signal Processing Letters, Volume 28, pp. 1610-1614, 2021
Fairness in the Eyes of the Data: Certifying Machine-Learning Models
Shahar Segal, Yossi Adi, Benny Pinkas, Carsten Baum, Chaya Ganesh, and Joseph Keshet
AAAI/ACM Conference on AI, Ethics, and Society, pp. 926-935, 2021
CNN-based Spoken Term Detection and Localization without Dynamic Programming
Tzeviya Sylvia Fuchs, Yael Segal, and Joseph Keshet
The 46th IEEE International Conference on Acoustic, Speech, and Signal Processing (ICASSP), pp. 6853-6857, 2021
On Randomized Classification Layers and Their Implications in Natural Language Generation
Gal-Lev Shalev, Gabi Shalev, Joseph Keshet
The 3rd Workshop on Multimodal Artificial Intelligence, pp. 6-11, 2021
Constant Random Perturbations Provide Adversarial Robustness with Minimal Effect on Accuracy
Bronya Roni Chernyak, Bhiksha Raj, Tamir Hazan, Joseph Keshet
RobustML Workshop, 9th International Conference on Learning Representations (ICLR), 2021
Intrapersonal and Interpersonal Vocal Affect Dynamics During Psychotherapy
Adar Paz, Eshkol Rafaeli, Eran Bar-Kalifa, Eva Gilboa-Schectman, Sharon Gannot, Bracha Laufer-Goldshtein, Shrikanth Narayanan, Joseph Keshet, Dana Atzil-Sloni
Journal of Consulting and Clinical Psychology, Volume 89, Issue 3, pp. 227, ASA, March 2021
Adversarial Robustness for Face Recognition: How to Introduce Ensemble Diversity among Feature Extractors?
akuma Amada, Kazuya Kakizaki, Seng Pei Liew, Toshinori Araki, Joseph Keshet, Jun Furukawa
Workshop on Artificial Intelligence Safety (SafeAI), AAAI, 2021

2020

Redesigning The Classification Layer by Randomizing the Class Representation Vectors
Gabi Shalev, Gal-Lev Shalev, Joseph Keshet
Preprint, 2020
Diadochokinetic Speech in Individuals at Clinical High Risk for Schizophrenia
Kasia Hitczenko, Yael Segal, Tzeviya Sylvia Fuchs, Matthew Goldrick, Joseph Keshet, Vijay Mittal
Journal of the Acoustical Society of America, Vol. 148, Issue 4, pp. 2584, Oct 2020
Hide and Speak: Towards Deep Neural Networks for Speech Steganography
Felix Kreuk, Yossi Adi, Bhiksha Raj, Rita Singh, Joseph Keshet
The 21th Annual Conference of the International Speech Communication Association (Interspeech), 2020
Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation
Felix Kreuk, Joseph Keshet and Yossi Adi
The 21th Annual Conference of the International Speech Communication Association (Interspeech), 2020
Hide and Speak: Towards Deep Neural Networks for Speech Steganography
Felix Kreuk, Yossi Adi, Bhiksha Raj, Rita Singh and Joseph Keshet
The 21th Annual Conference of the International Speech Communication Association (Interspeech), Shanghai, China, 2020
Minimal Modifications of Deep Neural Networks using Verification
Ben Goldberger, Guy Katz, Yossi Adi, Joseph Keshet
23rd International Conference on Logic for Programming, Artificial Intelligence and Reasoning (LPAR), 2020
Phoneme Boundary Detection using Learnable Segmental Features
Felix Kreuk, Yanov Sheena, Joseph Keshet, Yossi Adi
The 45th International Conference in Acoustic, Speech and Signal Processing (ICASSP), 2020
Online Prediction of Time Series with Assumed Behavior
Ariel Rosenfeld, Moshe Cohen, Sarit Kraus, and Joseph Keshet
Engineering Applications of Artificial Intelligence, Volume 88, 103358, February 2020

2019

SpeechYOLO: Detection and Localization of Speech Objects
Yael Segal, Tzeviya Sylvia Fuchs, Joseph Keshet
The 20th Annual Conference of the International Speech Communication Association (Interspeech), 2019
Dr.VOT : Measuring Positive and Negative Voice Onset Time in the Wild
Yosi Shrem, Matthew Goldrick, Joseph Keshet
The 20th Annual Conference of the International Speech Communication Association (Interspeech), 2019
The influence of lexical selection disruptions on articulation
Matthew Goldrick, Rhonda McClain, Emily Cibelli, Yossi Adi, Erin Gustafson, Cornelia Moers, Joseph Keshet
Journal of Experimental Psychology: Learning, Memory, and Cognition, Volume 45, Issue 6, pp. 1107-1141, 2019
Bio-sensor based on multiclass support vector machine with a reject option
Stav Buchsbaum, Yossi Keshet, Nisan Ozana, Zeev Zalevsky
Proceedings Volume 10871, Multimodal Biomedical Imaging XIV; 108710O, 2019
Predicting glottal closure insufficiency using fundamental frequency contour analysis
Jacob T Cohen, Alma Cohen, Limor Benyamini, Yossi Adi, Joseph Keshet
Head & Neck, Volume 41, Issue 7, pp. 2324-2331, 2019
Formant estimation and tracking: A deep learning approach
Yehoshua Dissen, Jacob Goldberger, Joseph Keshet
The Journal of Acoustical Society of America, Volume 145, Issue 2, pp. 642-653, 2019

2018

Automatic speech recognition: A primer for speech-language pathology researchers
Joseph Keshet
International Journal of Speech-Language Pathology, Volume 20, Issue 6, pp. 599-609, 2018
Assessing automatic VOT annotation using unimpaired and impaired speech
Esteban Buz, Adam Buchwald, Tzeviya Fuchs, Joseph Keshet
International Journal of Speech-Language Pathology, Volume 20, Issue 6, 2018
Out-of-Distribution Detection using Multiple Semantic Label Representations
Gabi Shalev, Yossi Adi, Joseph Keshet
Advances in Neural Information Processing Systems 31 (NeurIPS), 2018
Fooling End-To-End Speaker Verification with Adversarial Examples
Felix Kreuk, Yossi Adi, Moustapha Cisse, Joseph Keshet
IEEE international conference on acoustics, speech and signal processing (ICASSP), pp. 1962-1966, 2018
Deceiving End-to-End Deep Learning Malware Detectors using Adversarial Examples
Felix Kreuk, Assi Barak, Shir Aviv-Reuven, Moran Baruch, Benny Pinkas, Joseph Keshet
Preprint, 2018
Turning Your Weakness Into a Strength: Watermarking Deep Neural Networks by Backdooring
Yossi Adi, Carsten Baum, Moustapha Cisse, Benny Pinkas, Joseph Keshet
The 27th USENIX Security Symposium, pp. 1615-1631, 2018

2017

Spoken Term Detection Automatically Adjusted for a Given Threshold
Tzeviya Fuchs, Joseph Keshet
IEEE Journal of Selected Topics in Signal Processing, Volume 11, Issue 8, pp. 1310-1317, December 2017
Learning Similarity Functions for Pronunciation Variations
Einat Naaman, Yossi Adi, Joseph Keshet
The 18th Annual Conference of the International Speech Communication Association (Interspeech), Stockholm, 2017
Automatic Measurement of Pre-aspiration
Yaniv Sheena, Míša Hejná, Yossi Adi, Joseph Keshet
The 18th Annual Conference of the International Speech Communication Association (Interspeech), Stockholm, 2017
Sequence Segmentation Using Joint RNN and Structured Prediction Models
Yossi Adi, Joseph Keshet, Emily Cibelli, Matthew Goldrick
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2017
Houdini: Fooling Deep Structured Visual and Speech Recognition Models with Adversarial Examples
Moustapha M Cisse, Yossi Adi, Natalia Neverova, Joseph Keshet
Advances in Neural Information Processing Systems 30 (NIPS), 2017

2016

Automatic measurement of vowel duration via structured prediction
Yossi Adi, Joseph Keshet, Emily Cibelli, Erin Gustafson, Cynthia Clopper, Matthew Goldrick
The Journal of the Acoustical Society of America, Volume 140, Issue 6, pp. 4517-4527, 2016
Formant Estimation and Tracking Using Deep Learning
Yehoshua Dissen, Joseph Keshet
The 17th Annual Conference of the International Speech Communication Association (Interspeech), 2016
Perturbation Models and PAC-Bayesian Generalization Bounds
Joseph Keshet, Subhransu Maji, Tamir Hazan, and Tommi Jaakkola
A book chapter in Perturbations, Optimization, and Statistics, Tamir Hazan, George Papandreou, and Daniel Tarlow, Eds., The MIT Press, 2016
Profiling hoax callers
Rita Singh, Joseph Keshet, Eduard Hovy
IEEE Symposium on Technologies for Homeland Security (HST), 2016
StructED: Risk Minimization in Structured Prediction
Yossi Adi, Joseph Keshet
Journal of Machine Learning Research, Volume 17, Issue 64, pp. 1−5, 2016.
Automatic analysis of slips of the tongue: Insights into the cognitive architecture of speech production
Matthew Goldrick, Joseph Keshet, Erin Gustafson, Jordana Heller, Jeremy Needle
Cognition, Volume 149, Pages 31-39, April 2016
The Relationship of Voice Onset Time and Voice Offset Time to Physical Age
Rita Singh, Joseph Keshet, Deniz Gencaga, Bhiksha Raj
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 5390-5394, 2016
Automatic Measurement of Voice Onset Time and Prevoicing Using Recurrent Neural Networks.
Yossi Adi, Joseph Keshet, Olga Dmitrieva, Matthew Goldrick
The 17th Annual Conference of the International Speech Communication Association (Interspeech), pp. 3152-3155, 2016
Online Prediction of Exponential Decay Time Series with Human-Agent Application
Ariel Rosenfeld, Joseph Keshet, Claudia V Goldman, Sarit Kraus
Frontiers in Artificial Intelligence and Applications, ECAI, Volume 285, pp. 595 – 603, 2016

2015

Context-Based Prediction of App Usage
Joseph Keshet, Adam Kariv, Arnon Dagan, Dvir Volk, Joey Simhon
Preprint, 2015
Risk Minimization in Structured Prediction using Orbit Loss
Danny Karmon, Joseph Keshet
Preprint, 2015
Vowel duration measurement using deep neural networks
Yossi Adi, Joseph Keshet, Matthew Goldrick
IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP), 2015

2014

Optimizing the Measure of Performance in Structured Prediction
Joseph Keshet
Book chapter in Advanced Structured Prediction, Sebastian Nowozin, Peter V. Gehler, Jeremy Jancsary, and Christoph H. Lampert, Eds., The MIT Press, 2014
Automatic Tools for Analyzing Spoken Hebrew
Adiel Ben-Shalom, Joseph Keshet, Doron Modan, Asher Laufer
Afeka Conference for Speech Processing, 2014
Automatic analysis of music: Performance of cantillation signs in Yemenite Jewish traditional cantillation
Adiel Ben-Shalom, Joseph Keshet, Roni Yeger-Granot
The 9th Conference on Interdisciplinary Musicology (CIM14), Berlin, Germany, 2014

2013

Discriminative Articulatory Models for Spoken Term Detection in Low-Resource Conversational Settings
Rohit Prabhavalkar, Karen Livescu, Eric Fosler-Lussier, Joseph Keshet
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vancouver, Canada, 2013
Discriminative learning with latent articulatory variables
Eric Fosler-Lussier, Preethi Jyothi, Joseph Keshet, Karen Livescu, Rohit Prabhavalkar, Hao Tang
Speech Production in Automatic Speech Recognition (SPASR), 58-59, 2013
Predicting Human Strategic Decisions Using Facial Expressions
Noam Peled, Moshe Bitan, Joseph Keshet, Sarit Kraus
23rd International Joint Conference on Artificial Intelligence (IJCAI), Beijing, China, 2013
Learning Efficient Random Maximum A-Posteriori Predictors with Non-Decomposable Loss Functions
Tamir Hazan, Subhransu Maji, Joseph Keshet, Tommi Jaakkola
Neural Information and Processing Systems 26 (NIPS), 2013

2012

Automatic measurement of voice onset time using discriminative structured prediction
Morgan Sonderegger, Joseph Keshet
The Journal of the Acoustical Society of America, Volume 132, Issue 6, pp. 3965-3979, 2012
Discriminative Spoken Term Detection with Limited Data
Rohit Prabhavalkar, Joseph Keshet, Karen Livescu, Eric Fosler-Lussier
Symposium on Machine Learning in Speech and Language Processing (MLSLP), pp. 22-25, 2012
Automatic Measurement of Positive and Negative Voice Onset Time
Katharine Henry, Morgan Sonderegger, Joseph Keshet
The 13th Annual Conference of the International Speech Communication Association (Interspeech), pp. 871-874, 2012
Discriminative Pronunciation Modeling: A Large-Margin, Feature-Rich Approach
Hao Tang, Joseph Keshet, Karen Livescu
The 50th Annual Meeting of the Association for Computational Linguistics (ACL), 2012

2011

Explicit Approximations of the Gaussian Kernel
Andrew Cotter, Joseph Keshet, Nathan Srebro
Preprint, 2011
Direct Error Rate Minimization of Hidden Markov Models
Joseph Keshet, Chih-Chieh Cheng, Mark Stoehr, David A McAllester
The Annual Conference of the International Speech Communication Association (Interspeech), pp. 449-452, 2011
A GPU-tailored approach for training kernelized SVMs
Andrew Cotter, Nathan Srebro, Joseph Keshet
The 17th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2011
PAC-Bayesian approach for minimization of phoneme error rate
Joseph Keshet, David McAllester, Tamir Hazan
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011
Generalization Bounds and Consistency for Latent Structural Probit and Ramp Loss
Joseph Keshet, David A. McAllester
Advances in Neural Information Processing Systems (NIPS), 2011

2010

Automatic Discriminative Measurement of Voice Onset Time
Morgan Sonderegger, Joseph Keshet
The Annual Conference of the International Speech Communication Association (Interspeech), pp. 2242-2245, 2010
Direct Loss Minimization for Structured Prediction
Tamir Hazan, Joseph Keshet, David McAllester
Advances in Neural Information Processing Systems (NIPS), 2010

2009

Bounded Kernel-Based Online Learning
Francesco Orabona, Joseph Keshet, Barbara Caputo
Journal of Machine Learning Research, Volume 10, Issue 11, 2009
Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods
Joseph Keshet, Samy Bengio
Edited book, John Wiley & Sons, Ltd., 2009
Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks
Martin Wollmer, Florian Eyben, Joseph Keshet, Alex Graves, Bjorn Schuller, Gerhard Rigoll
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 3949-3952, 2009
Discriminative keyword spotting
Joseph Keshet, David Grangier, Samy Bengio
Speech Communication, Volume 51, Issue 4, pp. 317-329, 2009

2008

The projectron: a bounded kernel-based Perceptron
Francesco Orabona, Joseph Keshet, Barbara Caputo
The 25th International Conference on Machine learning (ICML), pp, 720-727, 2008
Support Vector Machines with a Reject Option
Yves Grandvalet, Alain Rakotomamonjy, Joseph Keshet, Stéphane Canu
Advances in Neural Information Processing Systems 21 (NIPS), 2008

2007

A Large Margin Algorithm for Speech-to-Phoneme and Music-to-Score Alignment
Joseph Keshet, Shai Shalev-Shwartz, Yoram Singer, Dan Chazan
IEEE Trans. on Audio, Speech and Language Processing, Volume 15, Issue 8, pp. 2373–2382, Nov. 2007.
Large Margin Algorithms for Discriminative Continuous Speech Recognition
Joseph Keshet
Ph.D. dissertation, The Hebrew University, August 2007.
Discriminative Kernel-Based Phoneme Sequence Recognition
Joseph Keshet, Shai Shalev-Shwartz, Samy Bengio, Yoram Singer, Dan Chazan
The International Conference on Spoken Language Processing (Intherspeech), Pittsburgh, PA, 2006.
Online Passive-Aggressive Algorithms
Koby Crammer, Ofer Dekel, Joseph Keshet, Shai Shalev-Shwartz, Yoram Singer
Journal of Machine Learning Research, Volume 7, pp. 551−585, 2006.
Phoneme Alignment Based on Discriminative Learning
Joseph Keshet, Shai Shalev-Shwartz, Yoram Singer, Dan Chazan
The European Conference on Speech Communication and Technology (INTERSPEECH), Lisbon, 2005.
Learning to Align Polyphonic Music
Shai Shalev-Shwartz, Joseph Keshet, Yoram Singer
The 5th International Conference on Music Information Retrieval (ISMIR), Barcelona, Spain, 2004
An Online Algorithm for Hierarchical Phoneme Classification
Ofer Dekel, Joseph Keshet, Yoram Singer
Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), LNCS, Vol. 3361/2005, p.146, Martigny, Switzerland, 2004.
Large Margin Hierarchical Classification
Ofer Dekel, Joseph Keshet, Yoram Singer
The 21st International Conference on Machine Learning (ICML), Banff, Canada, 2004
Kernel Design Using Boosting
Koby Crammer, Joseph Keshet, Yoram Singer
The 15th Annual Conference on Neural Information Processing Systems (NIPS), 2002.
Plosive Spotting with Margin Classifiers
Joseph Keshet, Dan Chazan, Ben-Zion Bobrovsky,
The 7th European Conference on Speech Communication and Technology (EUROSPEECH), Aalborg, Denmark, 2001