Journals/Book Chapters (74)
Please note that the
main site is now at
https://www.telecom-paris.fr/gael-richard
K. Schulze-Forster, C. Doire, G. Richard, R. Badeau Phoneme
Level Lyrics Alignment and Text-Informed Singing Voice
Separation, in IEEE/ACM Transactions on Audio, Speech, and
Language Processing, vol. 29, pp. 2382-2395, 2021
G. Peeters, G. Richard, Deep Learning for Audio and Music,
published in Multi-faceted Deep Learning: Models and Data,
edited by J. Benois-Pineau, A. Zemmari, 2021, Springer
Ondrej Cifka, Umut Simsekli, Gaël Richard, “Groove2Groove:
One-Shot Music Style Transfer with Supervision from Synthetic
Data”, IEEE/ACM Transactions on Audio, Speech, and Language
Processing, vol. 28, pp. 2638-2650, 2020
Sanjeel Parekh, Slim Essid, Alexey Ozerov, Ngoc Q. K. Duong,
Patrick Perez, Gaël Richard, “Weakly Supervised Representation
Learning for Audio-Visual Scene Analysis”, IEEE/ACM Transactions
on Audio, Speech, and Language Processing, dec. 2019
Simon Henriet, Umut Simsekli, Sergio Dos Santos, Benoit Fuentes,
Gaël Richard, “Independent-Variation Matrix Factorization With
Application to Energy Disaggregation”, IEEE Signal Processing
Letters, Vol. 26, no. 11, November 2019
Zhiyao Duan, Slim Essid, Cynthia C. S. Liem, Gaël Richard,
Gaurav Sharma, “Audio-Visual Analysis of Music Performances”,
IEEE Signal Processing Magazine, vol. 36, no. 1, pp. 63-73, Jan.
2019.
Gaël Richard, Sébastien Fenêt, Yves Grenier, “De Fourier à
reconnaissance musicale”, Revue Interstices, Fev. 2019, online
at:
https://interstices.info/de-fourier-a-la-reconnaissance-musicale/
(in French)
Thanh
Huy Nguyen, Umut Şimşekli, Gaël Richard, Ali Taylan Cemgil,
“Efficient Bayesian Model Selection in PARAFAC via Stochastic
Thermodynamic Integration”, IEEE Signal Processing Letters,
April 2018
Simon Henriet, Umut Simsekli, Benoit Fuentes, Gaël Richard,
(2018), "A Generative Model for Non-Intrusive Load Monitoring in
Commercial Buildings", Volume 177, Oct. 2018, Pages 268-278
(a former version on arxiv)
Clément Laroche, Matthieu Kowalski, Hélène Papadopoulos and Gaël
Richard, (2018), "Hybrid Projective Nonnegative Matrix
Factorization with Drum Dictionaries for Harmonic/Percussive
Source Separation", ACM/IEEE Transactions on Audio, Speech and
Language Processing.Simon Leglaive, Vol.
26 (9), pp.
1499-1511, Sept. 2018
Simon Leglaive, Roland Badeau and Gaël Richard, (2018),
"Student's t Source and Mixing Models for Multichannel Audio
Source Separation", IEEE Transactions on Audio, Speech and
Language Processing, vol. 26, no. 6, pp. 1154-1168, June 2018.
B. Pardo, A. Liutkus, Z. Duan, G. Richard, Applying source
separation to music; in Audio Source Separation and Speech
Enhancement, E. Vincent, T. Virtanen, S. Gannot, Eds., Wiley
International, 2018.
R.
Serizel, V. Bisot, S. Essid, G.Richard, Acoustic Features for
Environmental sound Analysis, in Computational Analysis of Sound
Scenes and Events, T. Virtanen, D. Ellis, M. Plumbley Eds.,
Springer International Publishing AG, pp 71-101, 2018
K. Nathwani, G. Richard, B. David, P.
Prablanc, V. Roussarie, Speech Intelligibility Improvement in
Car Noise Environment by Voice Transformation, Speech
Communication, May 2017.
T. Janssoone, C. Clavel, K. Bailly et G.
Richard,."Règles d’associations temporelles de signaux sociaux
pour la synthèse d’un Agent Conversationnel Animé: Application
aux attitudes sociales", Revue d'Intelligence Artificielle (in
French). 2017.
V. Bisot, R. Serizel, S. Essid, G.
Richard, "Feature Learning with Matrix Factorization Applied to
Acoustic Scene Classification", IEEE/ACM Transactions on Audio,
Speech, and Language Processing, (2017), Special Issue on Sound
Scene and Event Analysis.
S. Fenet, R. Badeau, G. Richard,
"Reassigned Time-Frequency Representations of Discrete Time
Signals and Application to the Constant-Q Transform", Signal
Processing, Signal Processing 132 (2017) 170–176
S Durand, J. Bello, S. Leglaive, B.
David, G. Richard, "Robust Downbeat Tracking Using an Ensemble
of Convolutional Networks", IEEE/ACM Transactions on Audio,
Speech, and Language Processing, Vol 25, N°1, 2017
S. Leglaive, R. Badeau, G. Richard,
"Multichannel Audio Source Separation with Probabilistic
Reverberation Priors", IEEE/ACM Transactions on Audio, Speech,
and Language Processing, Vol. 24, no. 12, December 2016
X. Jaureguiberry, E. Vincent, G. Richard, «Fusion methods for
speech enhancement and audio source separation», IEEE/ACM
Transactions on Audio, Speech, and Language Processing, vol. 24,
no. 7, pp. 1266-1279, July 2016.
A. Masurelle, A. Rida Sekkat, S. Essid, G. Richard, «
TPT-Dance&Actions : un corpus multimodal d’activités
humaines », Revue Traitement du signal – no 4/2015, pp. 443-475.
H. Bai, G. Richard, L. Daudet "Late
Reverberation Synthesis: From Radiance Transfer to Feedback
Delay Networks", IEEE/ACM Transactions on Audio, Speech, and
Language Processing, 2015 vol. 23, n° 12, pp. 2260 ‑ 2271.
J. Salomon, E. Gomez, D. Ellis, G.
Richard, "Melody Extraction from Polyphonic Music Signals:
Approaches, Applications and Challenges", IEEE Signal Processing
magazine, Vol 31, Issue 2, pp 118-134, March 2014
M. Moussallam, A. Gramfort, L. Daudet, G.
Richard, "Blind Denoising with Random Greedy Pursuits", IEEE
Signal Processing letters, Vol. 21, N° 11, Nov. 2014
C. Joder, S. Essid, G. Richard, "Learning
Optimal Features for Polyphonic Audio-to-Score Alignment,"
Audio, Speech, and Language Processing, IEEE Transactions on ,
vol.21, no.10, pp.2118,2128, Oct. 2013
G.
Richard, S. Sundaram, S. Narayanan "An overview on Perceptually
Motivated Audio Indexing and Classification", Proceedings of the
IEEE, 2013.
B. Fuentes, R. Badeau et G. Richard, (2013),
"Harmonic Adaptive Latent Component Analysis of Audio and
Application to Music Transcription", EEE Transactions on Audio,
Speech and Language Processing, Vol 21, N°9, Sept. 2013
O.Derrien, R. Badeau, G. Richard, "A
Parametric Audio Coding with Exponentially Damped Sinusoids,
IEEE Transactions on Audio, Speech and Language Processing, Vol
21, N° 7, July 2013.
A.
Ozerov, A. Liutkus, R. Badeau et G. Richard, "Coding-based
Informed Source Separation: Nonnegative Tensor Factorization
Approach. IEEE Transactions on Audio, Speech and Language
Processing, Vol 21, N°8, Aout 2013.
M. Ramona, G. Richard, B. David "Multiclass
Feature Selection with Kernel Gram-matrix-based criteria", IEEE
Transactions on Neural Networks and Learning Systems, vol.PP,
no.99, pp.1, 0 doi: 10.1109/TNNLS.2012.2201748
A. Liutkus, J. Pinel, R. Badeau, L. Girin and G.
Richard, Informed source separation through spectrogram coding
and data embedding, Signal Processing, August 2012, vol. 92, n°
8, pp. 1937-1949
S. Essid, G. Richard, "Fusion of Multimodal
Information in Music Content Analysis", in Multimodal Music
Processing, Dagstuhl Follow-Ups, Ed. M. Muller, M. Goto, M.
Schedl, Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik, 2012
F. Vallet, S. Essid, J. Carrive and G.
Richard,High-level TV talk show structuring centered on
speakers' interventions, In book: TV Content Analysis:
Techniques and Applications, Y. Kompatsiaris, B. Merialdo and S.
Lian (Eds.), CRC Press, Taylor Francis LLC, 2012.
M.
Moussallam, L. Daudet, G. Richard, "Matching pursuits with
random sequential subdictionaries", Signal Processing, 2012,
http://dx.doi.org/10.1016/j.sigpro.2012.03.019
S. Essid, X. Lin, M. Gowing, G. Kordelas,
A. Aksay, P. Kelly, T. Fillon, Q. Zhang, A. Dielmann, V.
Kitanovski, R. Tournemenne, A. Masurelle, E. Izquierdo, N. E.
O'Connor, P. Daras, G. Richard, "A multi modal dance corpus for
research into interaction between humans in virtual
environments", Journal on Multimodal User Interfaces, 2012,7
(1-2). pp. 157-170. ISSN 1783-7677.
A. Liutkus, J. Pinel, R. Badeau, L. Girin,
G. Richard, Informed source separation through spectrogram
coding and data embedding, Signal Processing, September 2011.
P. Dymarski, N. Moreau, G. Richard, Greedy
sparse decompositions: A comparative study, EURASIP Journal on
Advances in Signal Processing, 2011:34
J-L Durrieu, B. David, G. Richard, A musically
motivated mid-level representation for pitch estimation and
musical audio source separation, IEEE Journal on Selected Topics
in Signal Processing, October 2011.
M.
Mueller, D. Ellis, A. Klapuri, G. Richard, Signal Processing
for Music Analysis", IEEE Journal on Selected Topics in Signal
Processing, October 2011.
C. Joder, S. Essid, G. Richard, A
Conditional Random Field Framework for Robust and Scalable
Audio-to-Score Matching, IEEE Transactions on Audio, Speech and
Language Processing, vol.19, no.8, pp.2385-2397, Nov. 2011.
A. Liutkus, R. Badeau, G. Richard,
Gaussian Processes for Underdetermined Source Separation, IEEE
Transactions on Signal Processing, vol.59, no.7, pp.3155-3167,
July 2011. R. Benmokhtar, B. Huet, G. Richard, T. Declerck and S. Essid, Feature Extraction for Multimedia Analysis, Chapitre 4., Multimedia Semantics: Metadata, Analysis and Interaction, Ed. Wiley, 2011.
S. Essid, M. Campedel, G. Richard, T. Piatrik, R. Benmokhtar and B. Huet Machine Learning Techniques for Multimedia Analysis. Chapitre 5., Multimedia Semantics: Metadata, Analysis and Interaction, Ed. Wiley, 2011.
J-L Durrieu, G. Richard, B. David, C.
Févotte, Source/Filter Model for Unsupervised Main Melody
Extraction From Polyphonic Audio Signals, IEEE Transactions on
Audio, Speech and Language Processing, Vol. 18, No 3, March
2010, pp564-575.
C. Clavel, G. Richard, " Reconnaissance
acoustique des émotions", Chapter 5 in Systèmes d’Interaction
Emotionnelle, Editor: C. Pelachaud, Hermès, 2010 (in French).
E. Ravelli, G. Richard, L. Daudet, Audio signal
representations for indexing in the transform domain, IEEE
Transactions on Audio, Speech and Language Processing, Vol. 18,
No 3, March 2010, pp 434- 446
M. Lagrange, M. Raspaud, R. Badeau, G.
Richard, "Explicit modeling of temporal dynamics within musical
signals for acoustical unit similarity," Pattern Recognition
Letters, Sept. 2009.
C. Joder, S. Essid, G. Richard, Temporal
integration for audio classification with application to musical
instrument classification, IEEE Transactions on Audio, Speech
and Language Processing, Vol. 17, N° 1, pp 174-186, Jan. 20092008
E. Ravelli, G. Richard, L. Daudet, Union of
MDCT bases for audio coding, IEEE Transactions on Audio, Speech
and Language Processing, Vol. 16, Issue 8, pp 1361-1372, Nov.
2008.
O. Derrien and G. Richard, A new
model-based algorithm for optimizing the MPEG-AAC in MS-stereo,
IEEE Transactions on Audio, Speech and Language Processing, Vol.
16, Issue 8, 1373-1382, Nov. 2008.
C. Clavel,I. Vasilescu, L. Devillers,G .
Richard, T. Ehrette Fear-type emotion recognition for future
audio-based surveillance systems, Speech Communication, Vol 50
(2008), pp. 487–503.
G . Richard, Audio Indexing, Encyclopedia
of Data Warehousing and Mining, Second Edition. Information
Science Reference - IGI Global, 2008.
R. Badeau, B. David and G. Richard, Fast
and stable YAST algorithm for principal and minor subspace
tracking, IEEE Transactions on Signal Processing, vol. 56, no.
8, pp. 3437-3446, août 2008.
R. Badeau, B. David and G. Richard,
Cramer-Rao bounds for multiple poles and coefficients of
quasipolynomials in colored noise, IEEE Transactions on Signal
Processing, vol. 56, no. 8, pp. 3458-3467, août 2008
P. Leveau, E. Vincent, G. Richard, L.
Daudet, Instrument-Specific Harmonic Atoms for Mid-Level Music
Representation, IEEE Transactions on Audio, Speech and Language
Processing, Volume 16, N°1 Jan. 2008 Page(s):116 - 128.
O. Gillet and G. Richard, Transcription and
Separation of Drum Signals From Polyphonic Music . IEEE
Transactions on Audio, Speech and Language Processing, Volume
16, N° 3, March 2008 Page(s):529 - 540.
R. Badeau, B. David and G. Richard,
“Performance of ESPRIT for Estimating Mixtures of Complex
Exponentials Modulated by Polynomials, IEEE Transactions on
Signal Processing, Vol. 56, N°. 2,February 2008, Page(s):492 -
504.
M. Betser, P. Collen, G. Richard and B.
David « Estimation of frequency for AM/FM models using the phase
vocoder framework», IEEE Transactions on Signal Processing,
Vol. 56, N°. 2, February 2008., Page(s):505 - 517.
O. Gillet, S. Essid and G. Richard, On the
Correlation of Audio and Visual Segmentations of Music Videos.
IEEE Transactions on Circuits and Systems for Video Technology,
17 (2), March 2007, pp 347-355. M. Alonso, G. Richard and B. David, “Tempo estimation for audio recordings”, Journal of New Music Research, Vol 36, N° 1, March 2007.
M. Alonso, G. Richard and B. David,
“Accurate tempo estimation based on harmonic+noise
decomposition”, EURASIP Journal on Advances in Signal
Processing, vol. 2007, Article ID 82795, 14 pages, 2007
S. Essid, G. Richard and B. David, “Musical
Instrument Recognition by pairwise classification strategies”,
IEEE Transactions on Speech, Audio and Language Processing,
Volume 14, Issue 4, July 2006 Page(s):1401 - 1412.C. Clavel, I. Vasilescu, G. Richard and L. Devillers, « Du corpus émotionnel au système de détection : le point de vue applicatif de la surveillance dans les lieux publics », (RIA06), Revue en Intelligence Artificielle RIA, edition spéciale « Interaction Emotionnelle, 2006.
O. Derrien, P. Duhamel, M. Charbit et G.
Richard, “A new quantization optimization algorithm for the
MPEGAdvanced Audio Coder using a statistical sub-band model of
the quantization noise” IEEE Transactions on Speech, Audio and
Language Processing, Volume 14, Issue 4, July 2006
Page(s):1328 - 1339
S. Essid, G. Richard and B. David,
Instrument Recognition in Polyphonic Music Based on Automatic
Taxonomies, IEEE Transactions on Speech, Audio and Language
Processing, Volume 14, Issue 1, Jan. 2006 Page(s):68 - 80
R. Badeau, B. David and G. Richard, “A new
perturbation analysis for signal enumeration in rotational
invariance techniques”, IEEE Transactions on Signal Processing,
Volume 54, Issue 2, Feb. 2006 Page(s):450 - 458
R. Badeau, B. David and G. Richard, “High
resolution spectral analysis of mixtures of complex exponentials
modulated by polynomialss”, IEEE Transactions on Signal
Processing, Volume 54, Issue 4, April 2006 Page(s):1341 - 1350
R. Badeau, B. David and G. Richard,
“Fast Approximated Power Iteration Subspace Tracking”, IEEE
Transactions on Signal Processing, Volume 53, Issue 8, Part
1, Aug. 2005 Page(s):2931 - 2941
O. Gillet et G. Richard , “Drum loops retrieval
from spoken queries”, Journal of Intelligent Information Systems
- Special issue on Intelligent Multimedia Applications, vol. 24,
n° 2/3, pp. 159-177, March 2005
R. Badeau, G. Richard et B. David, Sliding
window adaptive SVD algorithms, IEEE Transactions on Signal
Processing, Volume 52, Issue 1, Jan 2004 Page(s):1 - 10G. Richard et O. Cappé, “Synthèse de la parole à partir du texte”, Collection Techniques de l’ingénieur, Paris, 2003.
Van den Heuvel H., Boves L., Moreno A., Omologo
M., Richard G., Sanders E., “Annotation in the speechdat
projects’, International Journal of Speech Technology, 4, pp.
127-143., 2001.
G. Richard, C. d’Alessandro,
“Analysis/synthesis and modification of the speech aperiodic
component”, Speech Communication, Volume 19, Issue 3, September
1996, Pages 221–244Richard G., d'Alessandro C., (1997). ``Modification of the aperiodic component of speech signals for synthesis,'', chapter in Progress in Text-To-Speech synthesis, J.P.H. Van Santen, R.W. Sproat, J.P. Olive and J. Hirschberg, eds., Springer-Verlag, New-York
Theses (2)
Habilitation à diriger des recherches
Richard G. (2001). «Codage et Interfaces homme-machine », Habilitation à diriger des recherches de l’université Paris-XI, Orsay, Sept. 2001 – Part II :Mémoire (in French).
Ph.D. Thesis
Richard G., (1994). "Modélisation de la composante stochastique de la Parole" , thèse de Doctorat de l'Université Paris-XI, Orsay, 8 avril (in French)
Patents (11)
E. Gentet, S. Denjean, V. Roussarie, B. David, G. Richard, "Conversion de la parole par apprentissage statistique avec modélisation complexe des modifications temporelles", Brevet n° FR3106691 – 2021(in French)
S. Parekh, A. Ozerov, Q. Duong, G. Richard, S. Essid, P. Perez, "Method for Processing an input audio signal and corresponding electronic device, , Application 173305456.0 - 1914 (2017)
N. Lopez, Y. Grenier, G. Richard , « Procédé de suppression de la réverbération tardive d’un signal sonore », WO2015011078 A1, 2015
S. Fenet, Y. Grenier, G. Richard, “ Génération d'une Signature d'un Signal Audio Musical », WO2014131984 A2 ; 2013
L. Girin, Antoine Liutkus, G. Richard et R. Badeau, (2010), Procédé et dispositif de formation d'un signal mixé numérique audio, procédé et dispositif de séparation de signaux, et signal correspondant, Rapport de recherche, n° B10/3035FR, pp. 36.
R. Badeau, G. Richard and B. David, “Procédé de poursuite d'un sous-espace de dimension inférieure à celle des vecteurs de données, notamment audio”, Brevet d’invention n° 05 50678. 2005
Murgia C., Richard G., Lockwood P., “Procédés de codage, de décodage et de transcodage”, Brevet d’invention n° FR9903314, publié le 22/09/2000 sous le numéro FR2791166.
Murgia C, Richard G., Lockwood P., “Procédés de codage, de decodage et de transcodage audio”, Brevet d’invention n° FR9903323, publié le 22/09/2000 sous le numéro FR2791167.
Richard G., Murgia C., Le Doré A., Lockwood P., “Codeur Audio”, Brevet d’invention n° 9708784, Bulletin officiel de la propriété industrielle n° 99/37, 17.09.99 (n° de publication : 2 766 032).
Richard G., Lockwood P, Capman F., Boudy J., “Procédé et système de restitution sonore à effet spatial, et terminal de téléphone incorporant un tel système”, Brevet d’invention n° FR9909243, publié le 02/02/2001 sous le n°FR2797132.
Conference
Papers (215)
M. Barsbey, M. Sefidgaran, M. Erdogdu, G. Richard, Umut
Şimşekli. "Heavy Tails in SGD and Compressibility of
Overparametrized Neural Networks" 35th Conference on Neural
Information Processing Systems (NeurIPS), Dec 2021, Online,
United States.
A. Vaglio, R. Hennequin, M. Moussallam, G. Richard. "The words
remain the same: cover detection with lyrics transcription" 22nd
International Society for Music Information Retrieval Conference
(ISMIR), Nov 2021, Online, India.
L. Prétet, G. Richard, G. Peeters. "Is There a "Language of
Music-Video Clips" ? A Qualitative and Quantitative Study" 22nd
International Society for Music Information Retrieval Conference
(ISMIR), Nov 2021, Online, India. Best presentation award
Javier Nistal Stefan Lattner Gaël Richard. "DarkGAN: Exploiting
Knowledge Distillation for Comprehensible Audio Synthesis With
GANs" " 22nd International Society for Music Information
Retrieval Conference (ISMIR), Nov 2021, Online, India.
A. Liutkus and O. Cifka and S. Wu and U. Simsekli and Y. Yang
and G. Richard "Relative positional encoding for transformers
with linear complexity" in Proc. of International Conference on
Machine Learning (ICML) - Long paper presentation - 2021.
G. Cantisani, A. Ozerov, S. Essid, G.Richard. "User-guided
one-shot deep model adaptation for music source separation. in
IEEE Workshop on Applications of Signal Processing to Audio and
Acoustics (WASPAA), 2021.
J. Nistal, C. Aouameur, S. Lattner, G. Richard, “VQCPC-GAN:
Variable-Length Adversarial Audio Synthesis Using
Vector-Quantized Contrastive Predictive Coding” in IEEE Workshop
on Applications of Signal Processing to Audio and Acoustics
(WASPAA), 2021
G. Cantisani, S. Essid and G. Richard, “Neuro-Steered Music
Source Separation With EEG-Based Auditory Attention Decoding And
Contrastive-NMF” in Proc. of International Conference on
Acoustics, Speech and Signal Processing (ICASSP), 2021.
L. Prétet, G. Richard, G. Peeters. “Cross-Modal Music-Video
Recommendation: A Study of Design Choices.” Special Session of
the International Joint Conference on Neural Networks (IJCNN
2021),
O. Cífka, A. Ozerov, U. Simsekli, G. Richard, “Self-Supervised
VQ-VAE for One-Shot Music Style Transfer” in Proc. of
International Conference on Acoustics, Speech and Signal
Processing (ICASSP), 2021.
J. Nistal, S. Lattner, and G. Richard, “Comparing
representations for audio synthesis using generative adversarial
networks,” in Proc. of the 28th European SignalProcessing
Conference, EUSIPCO2020, Amsterdam,NL, Jan. 2021.
J. Nistal, S. Lattner and G. Richard, “DrumGAN: Synthesis of
Drum Sounds With Timbral Feature Condition-ing Using Generative
Adversarial Networks” in Proc. of the International Society for
Music Information Retrieval (ISMIR), Oct. 2020 (preprint).
A. Vaglio, R. Hennequin, M. Moussallam, G. Richard, F.
d'Alché-Buc "Multilingual Lyrics-to-Audio alignment" in Proc. of
the International Society for Music Information Retrieval
(ISMIR), Oct. 2020
Karim Ibrahim, Elena Epure, Geoffroy Peeters, Gael Richard
"Should we consider the users in contex-tual music auto-tagging
models?" in Proc. of the International Society for Music
Information Retrieval (ISMIR), Oct. 2020
Karim Ibrahim, Elena Epure, Geoffroy Peeters, Gael Richard
"Confidence-based Weighted Loss for Multi-label Classification
with Missing Labels." The 2020 International Conference on
Multimedia Retrieval (ICMR '20), Jun 2020, Dublin, Ireland.
E. Gentet, B. David, S. Denjean, G. Richard, V. Roussarie
"Speech intelligibility enhancement by equalization for in-car
applications" in Proc. of International Conference on Acoustics,
Speech and Signal Processing (ICASSP), May 2020, Barcelona,
Spain
E. Gentet, B. David, S. Denjean, G. Richard, V. Roussarie
"Neutral-to-Lombard speech conversion with deep learning" in
Proc. of International Conference on Acoustics, Speech and
Signal Processing (ICASSP), May 2020, Barcelona, Spain
L. Prétet, G. Richard, G. Peeters "Learning to rank music tracks
using triplet loss" in Proc. of International Conference on
Acoustics, Speech and Signal Processing (ICASSP), May 2020,
Barcelona, Spain
A. Vaglio, R. Hennequin, M. Moussallam, G. Richard, F.
d'Alché-Buc "Audio-Based detection of explicit content in music"
in Proc. of International Conference on Acoustics, Speech and
Signal Processing (ICASSP), May 2020, Barcelona, Spain
K.
Schulze-Forster, C. Doire, G. Richard, . Badeau, "Joint phoneme
alignment and text-informed speech separation on highly
corrupted speech" in Proc. of International Conference on
Acoustics, Speech and Signal Processing (ICASSP), May 2020,
Barcelona, Spain
K.M. Ibrahim, J.
Royo-Letelier, E. Epure, G. Peeters, G. Richard, "Audio-Based
Auto-Tagging with contextual tags for music" in Proc. of
International Conference on Acoustics, Speech and Signal
Processing (ICASSP), May 2020, Barcelona, Spain
T.H. Nguyen, U.
Şimşekli, M. Gürbüzbalaban, G. Richard, "First Exit Time
Analysis of Stochastic Gradient Descent Under Heavy-Tailed
Gradient Noise" 33rd Conference on Neural Information Processing
Systems (NeurIPS 2019), Dec 2019, Vancouver, Canada
O. Cifka, U. Şimşekli, G. Richard, "Supervised Symbolic Music
Style Translation Using Synthetic Data" (preprint), in Proc. of
ISMIR 2019, Delft, Netherlands
S. Parekh, A. Ozerov, S. Essid, N. Q. K. Duong, P. Pérez, G.
Richard, "Identify, Locate and Separate: Audio-visual Object
Extraction in Large Video Collections Using Weak Supervision",
in Proc. of WASPAA, New Paltz, NY, USA, 2019
G. Cantisani, S.
Essid, G. Richard, "EEG-based decoding of auditory attention to
a target instrument in polyphonic music", in Proc. of WASPAA,
New Paltz, NY, USA, 2019
K.
Schulze-Forster, C. Doire, G. Richard, R. Badeau, "Weakly
Informed Audio Source Separation", in Proc. of WASPAA, New
Paltz, NY, USA, 2019
S. Henriet, U.
Simsekli, S. Dos Santos, B. Fuentes, G. Richard, "Factorisation
Matricielle Semi Non-Négative: Application à la Décomposition de
Consommations Electriques" (in French), in Proc. of GRETSI,
Lille, France, 2019
G. Cantisani, G.
Trgoat, S. Essid, G. Richard, "MAD-EEG: an EEG dataset for
decoding auditory attention to a target instrument in polyphonic
music", in Proc. of Speech, Music and Mind (SMM), Satellite
Workshop of Interspeech 2019, Vienna, Austria, 2019
T. H. Nguyen, U. Şimşekli, G. Richard, "Non-Asymptotic Analysis
of Fractional Langevin Monte Carlo for Non-Convex Optimization"
(preprint), International Conference on Machine Learning (ICML),
Long Beach, CA, USA, 2019
U. Şimşekli, Ç.
Yildiz, T. H. Nguyen, G. Richard, A. T. Cemgil, “Asynchronous
Stochastic Quasi-Newton MCMC for Non-Convex Optimization”,
International Conference on Machine Learning (ICML), Stockholm,
Sweden, 2018
Sanjeel Parekh, Slim Essid, Alexey Ozerov, Ngoc Q. K. Duong,
Patrick Pérez, Gaël Richard, (2018), Weakly Supervised
Representation Learning for Unsynchronized Audio-Visual Events,
arXiv:1804.07345
Umut Simsekli, Halil Erdogan, Simon Leglaive, Antoine Liutkus,
Roland Badeau and Gaël Richard, (2018), Alpha stable low rank plus
residual decomposition for speech enhancement, "ICASSP", Calgary,
Alberta, Canada.)
Sanjeel Parekh,
Slim Essid, Alexey Ozerov, Ngoc Q. K. Duong, Patrick Pérez, Gaël
Richard, (2018), Weakly Supervised Representation Learning for
Unsynchronized Audio-Visual Events, CVPR Workshop, Salt Lake City,
US.
Enguerrand Gentet, Bertrand David, Sebastien Denjean, Gaël
Richard, Vincent Roussarie, « Optimisation d'un critère
d'Intelligibilité de la Parole dans un Contexte Bruité Automobile
», Congrès Français d’Acoustique (CFA), Avril 2018. (in French)
Simon Henriet, Umut Simsekli, Gaël Richard, Benoit Fuentes, «
Energy Disaggregation for Commercial Buildings: A Statistical
Analysis », International Workshop on Non-Intrusive Load
Monitoring (NILM2018), Austin, Tx, USA, 2018. Best Poster Award
Victor Bisot,
Romain Serizel, Slim Essid, Gaël Richard, Leveraging deep neural
networks with nonnegative representations for improved
environmental sound classification IEEE International Workshop on
Machine Learning for Signal Processing MLSP, Sep 2017, Tokyo,
Japan. 2017
Victor Bisot, Romain
Serizel, Slim Essid, Gaël Richard, Nonnegative Feature Learning
Methods for Acoustic Scene Classification DCASE 2017 – Workshop on
Detection and Classification of Acoustic Scenes and Events, Nov
2017, Munich, Germany
Simon
Leglaive, Roland Badeau, Gaël Richard (2017). Separating
Time-Frequency Sources from Time-Domain Convolutive Mixtures Using
Non-negative Matrix Factorization. IEEE Workshop on Applications
of Signal Processing to Audio and Acoustics (WASPAA), New Paltz,
New York, United States.
Sanjeel
Parekh, Slim Essid, Alexey Ozerov, Ngoc Q. K. Duong, Patrick
Perez, Gaël Richard (2017), “Guiding Audio Source Separation by
Video Object Information", IEEE Workshop on Applications of
Signal Processing to Audio and Acoustics (WASPAA), New Paltz,
New York, United States.
Simon
Leglaive, Roland Badeau, Gaël Richard (2017), "Semi-blind
Student's t source separation for multichannel audio convolutive
mixtures", Proc. of the European Signal Processing Conference
(EUSIPCO), Kos island, Greece,
Simon
Leglaive, Roland Badeau, Gaël Richard (2017), "Séparation de
sources audio en milieu réverbérant : Factorisation en
matrices non-négatives et représentation temporelle du
mélange convolutif (in French)", Proc. of the XXVIe Colloque
GRETSI, Juan-Les-Pins, France,
R.
Serizel, V. Bisot, S. Essid and G. Richard (2017), “Supervised
group nonnegative matrix factorisation with similarity
constraints and applications to speaker identification,
"International Conference on Acoustics, Speech and Signal
Processing (ICASSP)", New Orleans, USA
V.
Bisot, R. Serizel, S. Essid and G. Richard (2017), “Overlapping
sound event detection with supervised nonnegative matrix
factorization, "International Conference on Acoustics, Speech and
Signal Processing (ICASSP)", New Orleans, USA
Clément
Laroche, Hélène Papadopoulos, Matthieu Kowalski and Gaël Richard,
(2017), Drum extraction in single channel audio signals using
Multi Layer Non negative Matrix Factor Deconvolution
"International Conference on Acoustics, Speech and Signal
Processing (ICASSP)", New Orleans, USA
S.
Leglaive, U. Simsekli, A. Liutkus, R. Badeau, G. Richard, (2017),
Alpha Stable Multichannel Audio Source Separation "International
Conference on Acoustics, Speech and Signal Processing (ICASSP)",
New Orleans, USA
U.
Simsekli, A. Durmus, R. Badeau, G. Richard, E. Moulines, (2017),
Parallelized Stochastic Gradient Markov Chain Monte Carlo
Algorithms for Non Negative Matrix Factorization, "International
Conference on Acoustics, Speech and Signal Processing (ICASSP)",
New Orleans, USA
S.
Leglaive, R. Badeau, G. Richard, (2017), Multichannel audio source
separation: variational inference of time frequency sources from
time domain observations, "International Conference on Acoustics,
Speech and Signal Processing (ICASSP)", New Orleans, USA
S.
Parekh. S. Essid, A. Ozerov, N. Duong, P. Perez, G. Richard
(2017), "Motion Informed Audio source Separation", "International
Conference on Acoustics, Speech and Signal Processing (ICASSP)",
New Orleans, USA.
Alain
Durmus, Umut Simsekli, Eric Moulines, Roland Badeau and Gaël
Richard, (2016), Stochastic Gradient Richardson Romberg Markov
Chain Monte Carlo, "Thirtieth Annual Conference on Neural
Information Processing Systems (NIPS)", Barcelona, Spain.
Thomas
Janssoone, C. Clavel, Kévin Bailly and Gaël Richard, (2016), Using
Temporal Association Rules For the synthesis of Embodied
Conversational Agent With a specific stance., "International
Conference on Intelligent Virtual Agents", Los Angeles, USA, n°
16th.
Simon
Leglaive, Roland Badeau and Gaël Richard, (2016), Autoregressive
Moving Average Modeling of Late Reverberation in the Frequency
Domain, "European Signal Processing Conference (EUSIPCO)",
Budapest, Hungary.
Umut
Simsekli, Roland Badeau, Gaël Richard and Ali Taylan Cemgil,
(2016), Stochastic Quasi Newton Langevin Monte Carlo, "ICML", New
York, NY, USA.
Romain
Serizel, Victor Bisot, Slim Essid and Gaël Richard, (2016),
Machine listening techniques as a complement to video image
analysis in forensics, "ICIP".
Romain
Serizel, Slim Essid and Gaël Richard, (2016), Mini batch
stochastic approaches for accelerated multiplicative updates in
nonnegative matrix factorisation with beta divergence, "MLSP".
Karan Nathwani, Morgane Daniel, Gaël
Richard, Bertrand David and Vincent Roussarie, (2016), Formant
shifting for speech Intelligibility improvement in car noise
environment, "ICASSP", Shanghai, Chine.
Simon Durand, Juan P. Bello, Bertrand
David and Gaël Richard, (2016), Feature Adapted Convolutional
Neural Networks for Downbeat Tracking, "ICASSP ", Shanghai, Chine.
Umut Simsekli, Roland Badeau, Gaël
Richard and Ali Taylan Cemgil, (2016), Stochastic thermodynamic
integration: efficient Bayesian model selection via stochastic
gradient MCMC, "ICASSP", Shanghai, China.
Victor Bisot, Romain Serizel, Slim
Essid and Gaël Richard, (2016), Acoustic scene classification with
matrix factorization for unsupervised feature learning, "ICASSP",
Shangai, CHine.
Romain Serizel, Slim Essid and Gaël
Richard, (2016), Group nonnegative matrix factorisation with
speaker and session variability compensation for speaker
identification, "ICASSP", Shangai, Chine.
Simon Leglaive, R. Badeau
and Gaël Richard, (2015), Multichannel audio source separation
with probabilistic reverberation modeling, "WASPAA", New Paltz,
New York, USA.
Simon Durand, Juan P. Bello, Bertrand
DAVID and Gaël Richard, (2015), Downbeat tracking with multiple
features and deep neural networks, "ICASSP 2015", Brisbane,
Australie.
Hequn Bai, Laurent Daudet and Gaël
Richard, (2015), Geometric‑Based Reverberator Using Acoustic
Rendering Networks, "IEEE Workshop on Applications of Signal
Processing to Audio and Acoustics (WASPAA)", New Paltz, New York,
U.S.A.
Clément
Laroche, Matthieu Kowalski, Hélène Papadopoulos et Gaël Richard,
(2015), Méthode Structurée de décomposition en matrices
nonnégatives appliquée à la séparation de sources audio, "GRETSI",
Lyon.
Victor Bisot, Slim
Essid and Gaël Richard, (2015), HOG and Subband power distribution
image features for acoustic scene classification, "EUSIPCO", Nice,
France, pp. 719 723.
Clément Laroche,
Matthieu Kowalski, Hélène Papadopoulos and Gaël Richard, (2015), A
structured nonnegative matrix factorization for source separation,
"EUSIPCO", Nice.
Simon
Leglaive, R. Badeau et Gaël Richard, (2015), A priori probabiliste
anéchoïque pour la séparation sous‑déterminée de sources sonores
en milieu réverbérant, "Colloque GRETSI", Lyon, France.
Camila de Andrade Scatolini, Gaël
Richard et Benoît Fuentes, (2015), Multipitch estimation using a
PLCA‑based model: impact of partial user annotation, "ICASSP",
Brisbane,
Emmanouil Benetos, R. Badeau, Tillman
Weyde et Gaël Richard, (2014), Template adaptation for improving
automatic music transcription, "ISMIR 2014", Taipei, Taiwan.
Benoît Fuentes, R. Badeau et Gaël
Richard, (2014), Controlling the Convergence Rate to Help
Parameter Estimation in a PLCA‑based Model, "EUSIPCO", Lisbon,
Portugal.
Xabier Jaureguiberry, E. Vincent et Gaël
Richard, (2014), Multiple‑order non‑negative matrix factorization
for speech enhancement, in Proceedings of Interspeech, 2014.
Nicolás López, Yves Grenier, Gaël
Richard et Ivan Bourmeyster, (2014), Single Channel Reverberation
Suppression Based on Sparse Linear Prediction, "IEEE International
Conference on Acoustics, Speech, and Signal Processing (ICASSP)",
Florence, Italy.
Xabier Jaureguiberry, E. Vincent et Gaël
Richard, (2014), Variational Bayesian model averaging for audio
source separation, "SSP (Workshop on Statistical Signal
Processing)".
Aymeric Masurelle, S. Essid et Gaël
Richard, (2014), Gesture recognition using a NMF‑based
representation of motion‑traces extracted from depth silhouettes,
"IEEE International Conference on Acoustics, Speech, and Signal
Processing (ICASSP)", Florence, Italy.
Simon Durand, Bertrand DAVID et Gaël
Richard, (2014), Enhancing downbeat detection when facing
different music styles, "IEEE International Conference on
Acoustics, Speech, and Signal Processing (ICASSP)", Florence,
Italie, pp. 3152‑3156.G. Richard, (2014), Informed Audio source Separation, "AES International Conference on Semantic Audio", London, GB.
Manuel Moussallam, Alexandre Gramfort, Laurent Daudet et Gaël Richard, (2013), Débruitage Aveugle par Décompositions Parcimonieuses et Aléatoires, "GRETSI", Brest, France.
Sébastien
Fenet, Yves Grenier, Gaël Richard: An Extended Audio Fingerprint
Method with Capabilities for Similar Music Detection. ISMIR 2013:
569-57
Nicolás
López, Mounira Maazaoui, Yves Grenier, Gaël Richard et Ivan
Bourmeyster, (2013), Does dereverberation help multichannel blind
source separation? A study case, "European Signal Processing
Conference (EUSIPCO)", Marrakech, Maroc.
Antoine Liutkus, J.‑L.
Durrieu, Laurent Daudet et G. Richard, (2013), An overview of
informed audio source separation, "WIAMIS".
Antoine Liutkus, R.
Badeau et Gaël Richard, (2013), Low bitrate informed source
separation of realistic mixtures, "ICASSP", Vancouver, Canada, pp.
66‑70.
Aymeric Masurelle,
Slim Essid et Gaël Richard, (2013), MULTIMODAL CLASSIFICATION OF
DANCE MOVEMENTS USING BODY JOINT TRAJECTORIES AND STEP SOUNDS,
"International Workshop on Image and Audio Analysis for Multimedia
Interactive Services WIAMIS", Paris, France.
Konstantinos
Apostolakis*, Dimitrios Alexiadis, Petros Daras, David Monaghan,
Noel O’Connor, Benjamin Prestele, Peter Eisert, Gaël Richard,
Qianni Zhang, Ebroul Izquierdo, Maher Ben Moussa and Nadia
Magnenat, Blending real with virtual in 3Dlife, WIAMIS 2013, Paris
France
Xabier Jaureguiberry,
Gaël Richard, P. Leveau, Romain Hennequin et E. Vincent, (2013),
Introducing A Simple Fusion Framework For Audio Source Separation,
"Machine Learning for Signal Processing (MLSP)", Southampton, UK
Rémi Foucard, Slim
Essid, Gaël Richard and Mathieu Lagrange, Exploring new features
for music classification, WIAMIS 2013, Paris France
Sylvain
Marchand, R. Badeau, Cléo Barras, Laurent Daudet, Dominique
Fourer, L. Girin, Stanislaw Gorlow, Antoine Liutkus, Jonathan
Pinel, Gaël Richard, Nicolas Sturmel et Shuhua Zhang, (2012),
DReaM: a novel system for joint source separation and multi‑track
coding, "133rd AES Convention", San Francisco, USA.
Nicolás
López, Yves Grenier, Gaël Richard et Ivan Bourmeyster, (2012), Low
variance blind estimation of the reverberation time, "13th
International Workshop on Acoustic Signal Enhancement (IWAENC
2012)", Aachen, Germany.
Benoît
Fuentes, R. Badeau et G. Richard, (2012), Blind Harmonic Adaptive
Decomposition Applied to Supervised Source Separation, "20th
European Signal Processing Conference (EUSIPCO)", Bucharest,
Romania, pp. 2654‑2658.
Antoine
Liutkus, A. Ozerov, R. Badeau et G. Richard, (2012), Spatial
Coding‑based Informed Source Separation, "20th European Signal
Processing Conference (EUSIPCO)", Bucharest, Romania, pp.
2407‑2411.
Antoine
Liutkus, Stanislaw Gorlow, Nicolas Sturmel, Shuhua Zhang, L.
Girin, R. Badeau, Laurent Daudet, Sylvain Marchand et G. Richard,
(2012), Informed Audio Source Separation: A Comparative Study,
"20th European Signal Processing Conference (EUSIPCO)", Bucharest,
Romania, pp. 2397‑2401.
Sébastien
Fenet, Manuel Moussallam, Yves Grenier, Gaël Richard et Laurent
Daudet, (2012), A Framework for Fingerprint‑Based Detection of
Repeating Objects in Multimedia Streams, "EUSIPCO", Bucharest,
Romania, pp. 1464‑1468.
Manuel
Moussallam, Gaël Richard et Laurent Daudet, (2012), AUDIO SOURCE
SEPARATION INFORMED BY REDUNDANCY WITH GREEDY MULTISCALE
DECOMPOSITIONS, "European Signal Processing Conference", Bucarest,
Roumanie, pp. 2644‑2648.
Nicolas
Sturmel, Antoine Liutkus, Jonathan Pinel, L. Girin, Sylvain
Marchand, G. Richard, R. Badeau et Laurent Daudet, (2012), Linear
mixing models for active listening of music productions in
realistic studio conditions, "132nd AES Convention", Budapest,
Hongrie.
Rémi
Foucard, Slim Essid, Mathieu Lagrange et Gaël Richard, (2012), A
REGRESSIVE BOOSTING APPROACH TO AUTOMATIC AUDIO TAGGING BASED ON
SOFT ANNOTATOR FUSION, "IEEE International Conference on
Acoustics, Speech and Signal Processing (ICASSP)", Kyoto, Japan.
Antoine
Liutkus, Zafar Rafii, R. Badeau, Bryan Pardo et G. Richard,
(2012), Adaptive filtering for music/voice separation exploiting
the repeating musical structure, "37th International Conference on
Acoustics, Speech, and Signal Processing ICASSP’12", Kyoto, Japan,
pp. 53‑56.
Benoît
Fuentes, Antoine Liutkus, R. Badeau et G. Richard, (2012),
Probabilistic model for main melody extraction using constant‑Q
transform, "37th International Conference on Acoustics, Speech,
and Signal Processing ICASSP’12", Kyoto, Japan, pp. 5357‑5360.
Manuel
Moussallam, Laurent Daudet et Gaël Richard, (2012), Random
time‑frequency Subdictionary design for sparse representation with
greedy algorithms, "ICASSP", Kyoto, Japon, pp. 3577‑3580.
Maksim
Khadkevich, T. Fillon, G. Richard et Maurizio Omologo, (2012), A
probabilistic approach to simultaneous extraction of beats and
downbeats , "IEEE International Conference on Acoustics, Speech
and Signal Processing (ICASSP)", Kyoto Japan, pp. 445‑448 .
Slim
Essid, Yves Grenier, Mounira Maazaoui, G. Richard et Robin
Tournemenne, (2011), An audio‑driven virtual dance‑teaching
assistant, "ACM Multimedia", Scottsdale, Arizona, USA.
Slim
Essid, Xinyu Lin, Marc Gowing, Georgios Kordelas, Anil Aksay,
Philip Kelly, Thomas Fillon, Qianni Zhang, Alfred Dielmann, Vlado
Kitanovski, Robin Tournemenne, N. E. O'Connor, Petros Daras et G.
Richard, (2011), A multimodal dance corpus for research into
real‑time interaction between humans in online virtual
environments , "ICMI WORKSHOP ON MULTIMODAL CORPORA FOR MACHINE
LEARNING", Alicante, Spain.
Alexey
Ozerov, Antoine Liutkus, R. Badeau et G. Richard, (2011), Informed
source separation: source coding meets source separation,
"Workshop on Applications of Signal Processing to Audio and
Acoustics (WASPAA)", New Paltz, New York, USA, pp. 257‑260.
Cyril
Joder, Slim Essid et G. Richard, (2011), Optimizing the Mapping
from a Symbolic to an Audio Representation for Music‑to‑Score
Alignment, "Workshop on Applications of Signal Processing to Audio
and Acoustics (WASPAA)", New Paltz, New York, USA.
Sébastien
Gulluni, Slim Essid, Olivier Buisson et G. Richard, (2011), An
Interactive System for Electro‑Acoustic Music Analysis, "ISMIR",
Miami, USA.
Sébastien
Fenet, Gaël Richard et Yves Grenier, (2011), A Scalable Audio
Fingerprint Method with Robustness to Pitch‑Shifting, "ISMIR",
Miami, USA, pp. 121‑126.
Rémi
Foucard, Slim Essid, Mathieu Lagrange et Gaël Richard, (2011),
Multi‑scale temporal fusion by boosting for music classification,
"ISMIR", Miami, USA, pp. 663‑‑668.
Benoît
Fuentes, R. Badeau et G. Richard, (2011), Analyse des structures
harmoniques dans les signaux audio : modéliser les variations de
hauteur et d’enveloppe spectrale, " XXIIIème Colloque GRETSI",
Bordeaux, France.
Sébastien
Fenet, Yves Grenier et Gaël Richard, (2011), Une empreinte audio à
base de CQT appliquée à la surveillance de flux radiophoniques,
"GRETSI", Bordeaux, France, pp. NA.
Sébastien
Gulluni, Slim Essid, Olivier Buisson et G. Richard, (2011),
Interactive Classification of Sound Objects for Polyphonic
Electro‑Acoustic Music Annotation, "AES Conference", Ilmenau,
Allemagne.
Antoine
Liutkus, R. Badeau et G. Richard, (2011), Multi‑dimensional signal
separation with Gaussian processes, "IEEE Workshop on Statistical
Signal Processing (SSP2011)", Nice, France.
Benoît
Fuentes, R. Badeau et G. Richard, (2011), Adaptive harmonic
time‑frequency decomposition of audio using shift‑invariant PLCA,
"36th International Conference on Acoustics, Speech, and Signal
Processing ICASSP’11", Prague, Czech Republic.
O.
Derrien, R. Badeau et G. Richard, (2011), Entropy‑constrained
quantization of exponentially damped sinusoids parameters, "36th
International Conference on Acoustics, Speech, and Signal
Processing ICASSP’11", Prague, Czech Republic.
C.
Joder, Slim Essid et G. Richard, (2011), Hidden Discrete Tempo
Model: a Tempo‑aware Timing Model for Audio‑to‑Score Alignment,
"ICASSP", Prague, Rep. Tchèque.
Felix
Weninger, J.‑L. Durrieu, Florian Eyben, G. Richard et Bjorn
Schüller, (2011), COMBINING MONAURAL SOURCE SEPARATION WITH LONG
SHORT‑TERM MEMORY FOR INCREASED ROBUSTNESS IN VOCALIST GENDER
RECOGNITION, "ICASSP 2011", Prague.
Manuel
Moussallam, Laurent Daudet et G. Richard, (2011), Audio Signal
Representations for Factorization in the sparse domain, "ICASSP",
Prague, Czech, pp. 513‑516.2010
F. Vallet, Slim Essid, J. Carrive and G. Richard, "ROBUST VISUAL
FEATURES FOR THE MULTIMODAL IDENTIFICATION OF UNREGISTERED
SPEAKERS IN TV TALK-SHOWS", Proc. of ICIP, Oct 2010.
C. Joder, Slim Essid and G. Richard, "A Conditional Random Field
Viewpoint of Symbolic Audio-to-Score Matching", Proc. of ACM
Multimedia, oct 2010, Firenze, Italy
M. Moussalam, T. Fillon, G. Richard et L. Daudet, "How
Sparsely Can a Signal be Approximated while Keeping its Class
Identity?", Proc. of MML10, satellite workshop of ACM Multimedia,
oct 2010, Firenze, Italy
Antoine
Liutkus, R. Badeau and G. Richard, "Informed source separation
using latent components", Proc. of ICA/LVA 2010, St Malo, France
C. Joder, Slim Essid and G. Richard, "An Improved
Hierarchical Approach for Music-to-Symbolic Score Alignment",
Proc. of ISMIR 2010, Utrecht, Netherlands.
B. Mathieu, Slim Essid, T. Fillon, J. Prado and G.
Richard, "YAAFE, AN EASY TO USE AND EFFICIENT AUDIO FEATURE
EXTRACTION SOFTWARE", Proc. of ISMIR 2010, Utrecht, Netherlands.
S. Bozonnet, F. Vallet, N. Evans, Slim Essid, J.
Carrive and G. Richard, "A multimodal approach to initialisation
for top-down speaker diarization of television shows", Proc. of
Eusipco 2010, Aalborg, Denmark
Rémi Foucard, J.-L. Durrieu, Mathieu Lagrange and G.
Richard, "Multimodal similarity between musical streams for cover
version detection", Proc. of ICASSP 2010, Dallas, USA.
Mathieu Lagrange, R. Badeau and G. Richard, "Robust
similarity metrics between audio signals based on asymmetrical
spectral envelope matching", Proc. of ICASSP 2010, Dallas, USA.
E. Dupraz and G. Richard, "Robust frequency-based
audio fingerprinting", Proc. of ICASSP 2010, Dallas, USA.
C. Joder, Slim Essid and G. Richard, "A Comparative
Study of tonal acoustic Features for a Symbolic Level
Music-to-Score Alignment", Proc. of ICASSP 2010, Dallas, USA.
Rémi Foucard, J.-L. Durrieu, Mathieu Lagrange and G.
Richard, "Multimodal similarity between musical streams for cover
version detection", Proc. of ICASSP 2010, Dallas, USA.
J. Weil, J.-L. Durrieu, G. Richard et Thomas Sikora,
"Automatic Generation of Lead Sheets from Polyphonic Music
Signals", Proc. of ISMIR 2009, Kobe, Japan, 2009.
M. Ramona, G. Richard, "Comparison of different
strategies for a SVM-based audio segmentation", In Proc. of
European Conference onSignal Processing, EUSIPCO'09, Sept. 2009,
Glasgow, UK.
J.L Durrieu, A. Ozerov, C. Févotte, G. Richard and B.
David, "Main Instrument Separation from Stereophonic Audio Signals
using a Source/Filter Model", In Proc. of European Conference on
Signal Processing, EUSIPCO'09, Sept. 2009, Glasgow, UK.
C. Joder, S. Essid, G. Richard, "Etude des
descripteurs acoustiques pour l’alignement temporel
audio-sur-partition musicale", in Proc. of Colloque GRETSI, Sept.
2009 (in French).
J.L Durrieu, A. Ozerov, C. Févotte, G. Richard and B. David, "An
Iterative Approach to Manaural Musical Mixture De-soloing" IEEE
International Conference on Acoustics, Speech, and Signal
Processing, ICASSP'09, Taipei, Taiwan, April 2009.
M. Lardeur, S. Essid, G. Richard, M. Haller and T. Sikora,
"Incorporating Prior Knowledge on the Digital Media Creation
Process into Audio Classifiers" IEEE International Conference on
Acoustics, Speech, and Signal Processing, ICASSP'09, Taipei,
Taiwan, April 2009.
E. Ravelli, G. Richard and L. Daudet, "Fast MIR in a sparse
transform domain, In Proc of International Conference on Music
Information Retrieval, ISMIR 2008, Sept. 2008, Philadelphia, USA.
E. Ravelli, G. Richard and L. Daudet, "Matching Pursuit in
Adaptive Dictionaries for Scalable Audio Coding, In Proc of
European Conference onSignal Processing, EUSIPCO'08, Sept. 2008,
Lausanne, Switzerland.
S.
Wegener, M. Haller, J-JBurred, T. Sikora, S. Essid and G. Richard,
"On the Robustness of Audio Features for Musical Instrument
Classification, In Proc of European Conference onSignal
Processing, EUSIPCO'08, Sept. 2008, Lausanne, Switzerland.
C. Joder, S. Essid and G. Richard, "Alignment Kernels for Audio
Classification with Application to Music Instrument Recognition,
In Proc of European Conference onSignal Processing, EUSIPCO'08,
Sept. 2008, Lausanne, Switzerland.
M. Ramona, G. Richard, "Segmentation parole/musique par Machines à
Vecteurs de Support", Journées d'Etude de la Parole JEP 2008,
Avignon, France.
J-L. Durrieu, G. Richard and B. David, "Singer melody
extraction in polyphonic signals using source separation methods,
IEEE International Conference on Acoustics, Speech, and Signal
Processing, ICASSP'08, LAs Vegas, USA, April 2008.
M. Ramona, G. Richard, B. David, "Vocal detection in music
with support vector machines", IEEE International Conference on
Acoustics, Speech, and Signal Processing, ICASSP'08, LAs Vegas,
USA, April 2008.
O. Gillet and G. Richard, "Supervised and Unsupervised Sequence
Modelling for Drum Transcription, In Proc of 8th International
Conference on Music Information Retrieval, ISMIR 2007, Sept. 2007,
Vienna, Austria.
E. Ravelli, G. Richard and L. Daudet, "Extending fine-grain
scalable audio coding to very low bitrates using overcomplete
dictionaries," Proc. of IEEE Workshop on Applications of Signal
Processing to Audio and Acoustics (WASPAA'07), october 2007.
G. Richard, P. Leveau, L. Daudet, S. Essid
and B. David, "Towards polyphonic musical instrument recognition",
19th International Congress on Acoustics (ICA), Madrid, 2-7
september 2007.
N. Bertin, R. Badeau and G. Richard, "Blind
signal decompositions for automatic transcription of polyphonic
music: NMF and K-SVD on the benchmark", IEEE International
Conference on Acoustics, Speech, and Signal Processing, ICASSP'07,
Honolulu, Hawaii, USA, 15-20 avril 2007.
R. Badeau, B. David and G. Richard,
"Conjugate gradient algorithms for minor subspace analysis," IEEE
International Conference on Acoustics, Speech, and Signal
Processing, ICASSP'07, Honolulu, Hawaii, USA, 15-20 avril 2007.
G. Richard, M. Ramona and S. Essid,
"Combined supervised and unsupervised approaches for automatic
segmentation of radiophonic audio streams",IEEE International
Conference on Acoustics, Speech, and Signal Processing, ICASSP'07,
Honolulu, Hawaii, USA, 15-20 avril 2007.
C. Clavel, L. Devillers, G.
Richard, I. Vasilescu and T. Ehrette, "Abnormal Situation
Detection And Analysis Through Fear-Type Acoustic Manifestations
,IEEE International Conference on Acoustics, Speech, and Signal
Processing, ICASSP'07, Honolulu, Hawaii, USA, 15-20 avril 2007.
K. McGuinness, O. Gillet, N. E. O'Connor,
and G. Richard,Visual Analysis for Drum Sequence Transcription,
EUSIPCO 2007 - Proceedings of the 15th European Signal Processing
Conference, Poznan, Poland, 3-7 September 2007,
P. Leveau, E. Vincent, L. Daudet, G.
Richard,. Mid-level sparse representations for timbre
identification: design of an instrument-specific harmonic
dictionary,” 1st Workshop on Learning the Semantics of Audio
Signals (LSAS 2006), Athens, Greece, December 2006.
C. Clavel, I. Vasilescu, L.
Devillers, T. Ehrette and G. Richard. “Fear-type emotions of the
Safe corpus:annotation issues.” In Proc. of LREC 2006, Genoa,
Italy, May 2006.
M. Betser, P. Collen, G. Richard.
“Frequency Estimation Based on Adjacent DFT bins”, in Proc of the
European Signal Processing Conference (EUSIPCO-2006), Sept. 2006,
Florence, Italy.
M. Betser, P. Collen, G. Richard,
B. David, “Review and Discussion on Classical STFT-Based Frequency
Estimators”, International Convention of the Audio Engineering
Society (AES), Paris, France, May 2006
O. Gillet & G. Richard,
“ENST-Drums: an extensive audio-visual database for drum signals
processing”. In Proc of 7th International Conference on Music
Information Retrieval, ISMIR 2006, Oct. 2006, Victoria, Canada.
S. Essid, G. Richard and B. David,
“Hierarchical Classification of Musical Instruments on Solo
Recordings” IEEE International Conference on Acoustics, Speech,
and Signal Processing, ICASSP'06, Toulouse, France, 15-19 mai
2006,
R. Badeau, B. David and G. Richard “YAST
Algorithm for Minor Subspace Tracking”, IEEE International
Conference on Acoustics, Speech, and Signal Processing, ICASSP'06,
Toulouse, France, 15-19 mai 2006, vol. III, pp. 552-555
O. Gillet and G. Richard,
“Comparing Audio and Video Segmentations for Music Videos
Indexing”. IEEE International Conference on Acoustics, Speech, and
Signal Processing, ICASSP'06, Toulouse, France, may 2006.
B. David, R. Badeau and G. Richard,
“HRHATRAC Algorithm for Spectral Line Tracking of Musical Signals,
IEEE International Conference on Acoustics, Speech, and Signal
Processing, ICASSP'06, Toulouse, France, 15-19 mai 2006, vol. III,
pp. 45-48
C. Clavel, I. Vasilescu, L. Devillers, G.
Richard, T. Ehrette, and C. Sedogbo. “The SAFE Corpus:
illustrating extreme emotions in dynamic situations.” In Proc. of
LREC Workshop on Corpora for Research on Emotion and Affect,
Genoa, Italy, May 2006.
C. Clavel, I. Vasilescu, G. Richard
and L. Devillers. “Voiced and Unvoiced content of fear-type
emotions in the SAFE Corpus.” In Proc. of Speech Prosody 2006,
Dresden, Germany, May 2006.
R.
Badeau, G. Richard and B. David, “Fast adaptive ESPRIT algorithm”,
International Conference on Statistical Signal Processing
(SSP’05), Bordeaux, France, July 2005.
O.
Gillet & G. Richard, “Extraction and Remixing of Drum Tracks
from Polyphonic Music Signals”. IEEE Workshop on Applications of
Signal Processing to Audio and Acoustics, WASPAA'05, New Paltz,
USA.
S.
Essid, G. Richard, B. David,”Inferring Efficient Hierarchical
Taxonomies for MIR Tasks: Application to Musical Instruments”,
International Conference on Music Information Retrieval (ISMIR),
London, Great-Britain, Sept. 2005
O.
Gillet, Gael Richard,”Drum Track Transcription of Polyphonic Music
Using Noise Subspace Projection”, International Conference on
Music Information Retrieval (ISMIR), London, Great-Britain, Sept.
2005.
C.
Clavel, T. Ehrette and G. Richard, “Events Detection for An
Audio-Based Surveillance System”, International Conference on
Multimedia and Expo (IEEE-ICME’05), Amsterdam, The Netherlands,
July 2005.Miguel Alonso, Gaël Richard and Bertrand David, “Extracting Note Onsets from Musical Recordings”, International Conference on Multimedia and Expo (IEEE-ICME’05), Amsterdam, The Netherlands, July 2005
O.
Gillet and G. Richard, “Indexing and querying drum loops
databases”, International workshop on Content Based on Multimedia
and Indexing (CBMI’05), Riga, Latvia, June 2005. Received the CBMI
BEST PAPER Award
S.
Essid, P. Leveau, G. Richard, L. Daudet and B. David, “On the
usefulness of differentiated transient/steady-state processing in
machine recognition of musical instruments”, International
Convention of the Audio Engineering Society (AES), Barcelona,
Spain, May 2005
R.
Badeau, B. David and G. Richard, “Yet Another Subspace Tracker”,
International Conference on Acoustics, Speech, and Signal
Processing ICASSP’05, Philadelphia, USA, March 2005.
S.
Essid, G. Richard and B. David, “Instrument recognition in
polyphonic music”, International Conference on Acoustics, Speech,
and Signal Processing ICASSP’05, Philadelphia, USA, March 2005.
M.
Guillaume, Y. Grenier and G. Richard, “Iterative Algorithms for
Multichannel Equalization in Sound Reproduction Systems”,
International Conference on Acoustics, Speech, and Signal
Processing ICASSP’05, Philadelphia, USA, March 2005.
O.
Gillet and G. Richard, “Automatic Transcription of Drum Sequences
using Audiovisual Features”, International Conference on
Acoustics, Speech, and Signal Processing ICASSP’05, Philadelphia,
USA, March 2005.
P.
Leveau, L. Daudet et G. Richard ,”Methodology and Tools for the
evaluation of automatic onset detection algorithms in music”,
International Symposium on Music Information Retrieval (ISMIR),
Barcelone, Espagne, oct 2004.
M.
Alonso, B. David et G. Richard, “Tempo And Beat Estimation Of
Musical Signals” International Symposium on Music Information
Retrieval (ISMIR), Barcelone, Espagne, oct 2004.
S.
Essid, G. Richard et B. David, “Musical instrument recognition
based on class pairwise feature selection”, International
Symposium on Music Information Retrieval (ISMIR), Barcelone,
Espagne, oct 2004. M. Guillaume, Y. Grenier et G. Richard ,”Iterative Algorithms for Multichannel Equalization “, 23rd VDT International Audio Convention, Leigzig, Allemagne, Nov. 2004.
S.
Essid, G. Richard et B. David « Musical instrument recognition on
solo performances », European Signal Processing Conference
EUSIPCO’04, Vienna, Austria, Sept. 7-10, 2004.
S.
Essid, G. Richard et B. David « Efficient musical instrument
recognition on solo performance music using basic features», 25th
International AES Conference, London, UK, June 17-19, 2004.
O.
Gillet et G. Richard, “Automatic transcription of drum loops”,
International Conference on Acoustics, Speech, and Signal
Processing ICASSP’04, Montréal, Québec, 17-21 mai 2004
R.
Badeau, B. David et G. Richard “Selecting the modeling order for
the ESPRIT high resolution method: an alternative approach”
International Conference on Acoustics, Speech, and Signal
Processing ICASSP’04, Montréal, Québec, 17-21 mai 2004
O.
Gillet et G. Richard , “Automatic Labelling of Tabla Signals”,
Proc of ISMIR 2003, Baltimore, USA Oct. 2003
R.
Badeau, K. Abed-Meraim, G. Richard et B. David, Sliding Window
Orthonormal PAST Algorithm, Proceedings of the 2003 International
Conference on Acoustics, Speech, and Signal Processing ICASSP’03,
Hong Kong, Chine, 6-10 avril 2003, vol. V, pp. 261-264
R.
Badeau, G. Richard et B. David, Adaptive ESPRIT algorithm based on
the PAST subspace tracker, Proceedings of the 2003 International
Conference on Acoustics, Speech, and Signal Processing ICASSP’03,
Hong Kong, Chine, 6-10 avril 2003, vol. VI, pp. 229-232
R.
Badeau, G. Richard, B. David et K. Abed-Meraim, Approximated power
iterations for fast subspace tracking, Proceedings of the 7th
International Symposium on Signal Processing and its Applications
ISSPA 2003, Paris, France, 1-4 juillet 2003, vol. II, pp. 583-586
B.
David, G. Richard, R. Badeau, An EDS modelling tool for tracking
and modifying musical signals, Stockholm Music Acoustics
Conference 2003, Stockholm, Suède, 6-9 août 2003
R.
Badeau, G. Richard et B. David, Suivi d'espace dominant par la
méthode des puissances itérées, 19ème colloque GRETSI sur le
traitement du signal et des images, Paris, France, 8-11 septembre
2003, (in French)
M.
Alonso, R. Badeau, B. David et G. Richard, Musical tempo
estimation using noise subspace projections, IEEE Workshop on
Applications of Signal Processing to Audio and Acoustics (WASPAA
'03), New Paltz, New York, 19-22 octobre 2003,
M.
Alonso, B. David et G. Richard, Tempo Tracking Algorithms from
Polyphonic Music Signals”. A Study of Tempo Tracking Algorithms
from Polyphonic Music Signals”. 4th COST 276 Workshop, France, Mar
2003.
G. Richard. “Towards large databases for Music Information
Retrieval systems development and evaluation”, White Paper,
Evaluation Panel, ISMIR 2002
B.
David, R. Badeau, G. Richard, Sintrack analysis for tracking
components of musical signals, Proc. of the Forum Acusticum
Sevilla 2002, Séville, Espagne, 16-20 septembre 2002
A.
Moreno, B. Lindberg, C. Draxler, G. Richard, K. Choukri, S.
Euler, J. Allen. “SPEECH DAT CAR. A Large Speech Database For
Automotive Environments”, Proc. of LREC 2000, Athens, June 2000.Richard G., “The SpeechDat-Car Project: Overview of a very large multilingual speech database recorded in cars”, Proc. of XLDB 2000 (satellite workshop to LREC2000), Athens, Greece, May 29th, 2000.
Van den Heuvel
H., Boudy J., Comeyne R., Euler S., Moreno A., Richard G., "The
SpeechDat-Car Multilingual Speech databases for in-car
applications: some first validation results", Proc. of
Eurospeech'99, Budapest, Sept. 1999.
Sala M., Sánchez
F., Wengelnik H., Van den Heuvel H., Moreno A., Deregibus E.,
Richard G., Le Chevalier E., "SpeechDat-Car : Speech Databases for
Voice Driven teleservices and control of in-car applications",
EAEC Congress, Spain, July 1999.
Van den Heuvel
H., Bonafonte A., Boudy J., Dufour S., Lockwood P., Moreno A.,
Richard G., "SpeechDat-Car : Towards a collection of Speech
Databases for Automotive Environment", Cost 249 Workshop on
Speech Recognition Robustness, Finland, June 1999.
Richard G.,
Menguy Y. , Guis I., Lockwood P., "Secured Access to terminals and
teleservices using biometrics verification", Proc. of COST 254,
May 1999, Lausanne, Switzerland.
Richard G, Menguy
Y., Guis I., Suaudeau N., Boudy J. & al. "Multi Modal
Verification for Teleservices and Security Applications (M2VTS)",
Proc. of IEEE-ICMCS'99, Firenze, June 1999.
Tefas A., Menguy
Y., Kotropoulos C., Richard G., Pitas I., Lockwood P., «
Compensating for variable recording conditions in frontal face
authentication algorithms », Proc. of IEEE - ICASSP99, May 1999,
Phoenix, USA.
Richard G. &
al. "The M2VTS project: towards Multi Modal Verification for
Teleservices and Security Applications", Proc. of ECMAST'98, May
1998.
Chennoukh S.,
Sinder D., Richard G., Flanagan J.L., ``Improved Techniques for
Voice Mimic Systems Using Articulatory Codebooks,'' in Proceedings
of EUROSPEECH '97, (Rhodes, Greece), pp. 429--432, Sept. 1997.
Richard G., Le
Doré A., Sibade C., Boudy J., Lockwood P., Horbach H., Rosenthal
M., "Audio Coding and 3D Sound Simulation", Proc. of ADVICE'97,
Bristol, Great-Britain.Levinson S, Krane M, Kubli R., Coker C., Flanagan J.L., Richard G., Sinder D., Davis D., Slimon S., "Studying the effects of fluid dynamics on speech production", in Proceedings of the International Symposium on Simulation, Visualization and Auralization for Acoustic Research and Education (ASVA97), (Tokyo, Japan), Apr. 1997.
Richard G.,
Goirand M., Sinder D., Flanagan J.L., ``Simulation and
visualization of articulatory trajectories estimated from speech
signals,'' in Proceedings of the International Symposium on
Simulation, Visualization and Auralization for Acoustic Research
and Education (ASVA97), (Tokyo, Japan), Apr. 1997.
Sinder D.,
Richard G., Duncan H., Flanagan J., Slimon S., Davis D., Krane M.,
Levinson S., ``Flow visualization in stylized vocal tracts,'' in
Proceedings of the International Symposium on Simulation,
Visualization and Auralization for Acoustic Research and Education
(ASVA97), (Tokyo, Japan), Apr. 1997.Levinson S., Krane M., Slimon S., Richard G., Sinder D., Duncan H., Lin Q., Flanagan J., Davis D., (1996), " Fluid flow measurements and simulations in stylized geometries of dental fricatives", Int. Conf. of Spoken Lang. Proc. (ICSLP96), Philadelphia, PA, Oct 3-6 1996.
Richard G.,
Sinder D., Duncan H., Lin Q., Flanagan J., Levinson, S., Krane M.,
Davis D., Slimon S., (1996). "Low Mach Number, Low Reynolds
Number simulation of the fluid flow in the vocal tract", 2nd AIAA
Aeroacoustics Conference, Pen State University, May 4-7 1996.
Sinder D., Richard G., Duncan H., Lin Q., Flanagan J.,
Levinson S., Davis D., Slimon S., (1996). ``A fluid flow approach
to speech generation'', in the proceedings of the first ESCA
Tutorial and Research Workshop on Speech Production Modeling: From
control strategies to Acoustic, Autrans, France, May 21-24, 1996.Richard G., Liu M., Sinder D., Duncan H., Lin Q., Flanagan J., Levinson S., Davis D., Slimon S., (1995). ``Numerical simulations of fluid flow in the vocal tract,'' Proc. of Eurospeech, Madrid, Spain, septembre 18-21, pp. 1297-1300.
Richard G., d'Alessandro C., (1994). "Time-domain Analysis and synthesis of speech noises", ESCA/IEEE Workshop on Speech Synthesis, New Paltz, NY, USA, Sept. 22-25.
Grau S., d'Alessandro C. & Richard G., (1993). "A speech formant synthesizer based on harmonic + random formant-waveforms representations", Proc. of EUROSPEECH., Sept. 1993, Berlin, Allemagne.
d'Alessandro C., Richard G., (1992). "Random wavelet representation of unvoiced speech", IEEE symposium on Time-Frequency and Time-Scale Analysis, Victoria, British Columbia, Canada, Oct. 4-6, 1992.
Richard G., d'Alessandro C., Grau S. (1992). "Unvoiced speech analysis and synthesis using Poissonian random formant wave functions", Proc of 6th Eur. Signal Processing Conf., Aou. 25-281992, Bruxelles, in Signal Processing VI, Theories and applications, Elsevier Science Publishers, Arnsterdam
Richard G., d'Alessandro C., Grau S. (1992). "Synthèse de bruits par Formes d'Ondes Formantiques aléatoires", (en français), Proc. of l9th "Journées d'études sur la Parole, 19-22 mai, 1992, Bruxelles, Belgique (in French)
Richard G., d'Alessandro C. & Grau S., (1993). "Musical noise synthesis using random waveforms", Proc. of Stockholm Musical Acoustic Conference (SMAC93). Juil. 1993, Stockhohm, Suède.
Castellengo M., Richard G., d'Alessandro C. (1989). "Study of vocal pitch vibrato perception using synthesis", Proc. of 13th Int. Cong. on Acoust., Aou. 24-31, 1989, Yougoslavie (Belgrade).
Other Publications: Specific publications (summary only)
Olivier Derrien, Gaël Richard et Roland Badeau,, Damped sinusoids and subspace based approach for lossy audio coding, Acoustics’08, Paris, France, 29 juin - 4 juillet 2008
Durrieu J.-L., Richard G. and David B.,Single sensor singer/music separation using a source/filter model of the singer voice , Acoustics’08, Paris, France, 29 juin - 4 juillet 2008
B. David, R. Badeau, N. Bertin, V. Emiya et G. Richard, Multipitch detection for piano music: Benchmarking a few approaches,, The Journal of the Acoustical Society of America, novembre 2007, vol. 122, no. 5, pp. 2962
S. Chennoukh, D. Sinder, G. Richard and J. Flanagan, "Articulatory based low bit-rate speech coding", J. Acous. Soc. Of Amer., vol. 102, no. 5, p. 3163, Nov. 1997
Zussa F., Lin Q., Richard G., Sinder D., Flanagan J., (1995). ``Open-loop acoustic-to-articulatory mapping,'' JASA, Vol. 98, No 5, Pt 2, novembre 1995, pp2931.
Richard G., Lin Q., Zussa F., Sinder D., Che C., Flanagan J., (1995). ``Vowel recognition using an articulatory representation,'' JASA, Vol. 98, No 5, Pt 2, novembre 1995, pp2965-2966.
Richard G., Liu M., Sinder D., Duncan H., Lin Q., Flanagan J., Levinson S., Davis D., Slimon S., (1995). ``Vocal tract simulations based on fluid dynamic analysis'', JASA, Vol. 97, No 5, Pt 2, Mai 1995, pp3245.
Lin Q., Richard G., Zou J., Sinder D., Flanagan J., (1995). ``Use of TRACTTALK for adaptive voice mimic,'', JASA, Vol. 97, No 5, Pt 2, Mai 1995, pp3247
Internal reports
Richard G., (1990). Rules for fundamental frequency transition in singing synthesis, Trita/tom-90/03, ISSN 0280-9850, Mars 1990, Institut Royal de Technologie, Stockholm, Suede.
d'Alessandro C., Richard G., (1992). Random representation of speech noises, Notes et Documents LIMSI, 92-7,0rsay.
Richard G., Sinder D., Duncan H., Flanagan J., Levinson S., Krane M., Davis D., Slimon S.," Computational models for Speech Generation", CAIP Update, Vol 10, N°3, 1996.
MPEG input documents
Matra Nortel Com., “Test results of the evaluation of G722 compared to AMR-EFR and AMR-7.4 kbit/s”, ETSI SMG11-SQ, Tdoc SMG11, June 1999.
G. Richard, C. Venot,, “Report of practical complexity evaluation of an optimised HILN decoder”, March 1998, document M3293, Tokyo, Japan.
G. Richard, A. Le Doré, P. Lockwood, ,"Test results on speech codecs (MPEG4 CELP, G723.1, Scalable Speech codec based on G723.1 (M2917))", July 1998, document m3758, Dublin, Ireland.
C. Sibade, S. Weiss, A. Ledore, G. Richard, "MPEG4 Audio demonstrator", July 1998, document m3783, Dublin, Ireland.
G. Richard, C. Murgia, J-L Bonifas, A. Le Dore, P. Lockwood "Revised technical description of Matra's scalable speech/audio codec", Oct. 1997, input Document M2917, Fribourg, Switzerland.
G. Richard, A. Le Dore, "Results of Core Experiment on an extension of the narrow-band CELP VM coder to a bandwidth scalable CELP (m2486)", Oct. 1997, input Document M2682, Fribourg, Switzerland.
G. Richard, "Results of Core Experiment on Lossless Coding in the CELP core of the MPEG-4 Audio VM (m2495)", Oct. 1997, input Document M2698, Fribourg, Switzerland.
G. Richard, C. Murgia, J-L Bonifas, A. Le Dore, P. Lockwood, "Results of Core Experiment on Matra's low to medium bit rates scalable audio/speech codec (m2346)" , Oct. 1997, input Document M2699, Fribourg, Switzerland.
G. Richard, A. Le Doré, C. Murgia, C. Lacas, P. Lockwood “A Scalable Audio and Speech coder based on a Core Coder”, July 97, input document MPEG97/ M2346, Stockholm, Sweden.
P. Bonnard, G. Richard, C. Sibade, F. Rigoulet, “An implementation of graceful-degradation concept in a 3D audio compositor”, April 1997, document m1998, Bristol, UK.
G. Richard, P. Bonnard, A. Le Doré, “Solution for a Scalable Audio and Speech Coder based on Core Coders”, April 97, input Document MPEG97/M1997, Bristol, 1997.
J. Klaine, G. Richard, “Discrepancies between the MPEG-4 audio TTS reference software and written document (N2503, subpart 6)”, input Document MPEG99/M4435, Seoul, March 1999.
© Copyright MyWebSite.
Design by Dream Web Templates
