If you want to have the list of publications which belongs to these following topics, write in the search field the category's name (cat-1, ...)
  • cat-1: Infrastructure, data collection (incl. scenarios) and data management (annotation and standardization)
  • cat-2: Audio and speech processing (tracking, ASR, etc)
  • cat-3: Visual and joint audio-visual processing
  • cat-4: Multimodal structure and content analysis
  • cat-5: Human factors, HCI, system evaluation, and applications
  • cat-6: Project overviews
  • If you want to display all the publications of a specific author, use the shortcut called -Authors- located in the main menu
 



All publications in the database, sorted on year



2010

AMIDA/Klewel Mini-Project, Petr Motlicek, Philip N. Garner, Mael Guillemot and Vincent Bozzo, number Idiap-RR-03-2010, 2010.
 
An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features, David Imseng and G. Friedland, in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010.
 
Application of Out-Of-Language Detection To Spoken-Term Detection, Petr Motlicek and Fabio Valente, in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010.
 
Cascaded Model Adaptation for Dialog Act Segmentation and Tagging, U. Guz, Gokhan Tur, D. Hakkani-Tür and S. Cuendet, in: Journal of Computer Speech and Language, 2010.
 
Catchup: A Useful Application of Time-Travel in Meetings, Simon Tucker, A. Ramamoorthy, O. Bergman and Steve Whittaker, in: Proceedings of CSCW, 2010.
 
Differences in head orientation behavior for speakers and listeners: An experiment in a virtual environment, Rutger Rienks, Ronald Poppe and Dirk Heylen, in: ACM Transactions on Applied Perception, volume 7, number 1, pages 1-13, 2010.
 
Face Alignment Using Boosting and Evolutionary Search, Hua Zhang, Duanduan Liu, Mannes Poel and Anton Nijholt, in: Proceedings of the Asian Conference on Computer Vision (ACCV) 2009, pages 110-119, Springer Lecture Notes, 2010.
 
Leveraging Speaker Diarization for Meeting Recognition from Distant Microphones, D. Imseng A. Stolcke, G. Friedland, in: IEEE ICASSP, Dallas, TX (USA), 2010.
 
Machine Learning for Human Motion Analysis: Theory and Practice, Ronald Poppe, pages 55-73, chapter Common Spa, IGI Global, ISBN 978-1-60566-900-7, 2010.
 
On-line Video Synchronization Based on Visual Vocabularies, Vítezslav Beran, Adam Herout and Pavel Zemčík, in: Proceedings of WSCG'10, pages 7, University of West Bohemia in Pilsen, Plzeň, CZ, 2010.
 
Tuning-Robust Initialization Methods for Speaker Diarization, G. Friedland D. Imseng, in: IEEE Transactions on Audio, Speech and Language Processing, 2010.
 
Using Audio and Visual Cues for Speaker Diarisation Initialisation, Giulia Garau and Hervé Bourlard, in: Proc. International Conference on Acoustics, Speech and Signal Processing, 2010.
 

2009

A Human Benchmark for Language Recognition, Rosemary Orr and David van Leeuwen, in: Proc.Interspeech, ISCA, 2009.
 
A Multimedia Retrieval System Using Speech Input, Andrei Popescu-Belis, Peter Poller, J. Kilgour, Erik Boertjes, Jean Carletta, Sandro Castronovo, Michal Fapso, Alexandre Nanchen, Theresa Wilson, Joost de Wit and Majid Yazdani, in: Proceedings of ICMI-MLMI 2009 (11th International Conference on Multimodal Interfaces and 6th Workshop on Machine Learning for Multimodal Interaction), Cambridge, MA, 2009.
 
A Parallel Training Algorithm for Hierarchical Pitman-Yor Process Language Models, Songfang Huang and Steve Renals, in: Proc. Interspeech'09, 2009.
 
Accessing a Large Multimodal Corpus using an Automatic Content Linking Device, Andrei Popescu-Belis, Jean Carletta, J. Kilgour and Peter Poller, in: Multimodal Corpora: From Models of Natural Interaction to Systems and Applications, pages 189-206, Springer-Verlag, 2009. [DOI]
 
Agreement Detection in Multiparty Conversation, Sebastian Germesin and Theresa Wilson, in: ICMI-MLMI '09, 2009.
 
Any Questions? Automatic Question Detection in Meetings, Kofi Boakye, Benoit Favre and D. Hakkani-Tür, in: Proceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), Merano, Italy, 2009.
 
Audio spatialisation strategies for multitasking during teleconferences, S. N. Wrigley, Simon Tucker, G. J. Brown and Steve Whittaker, in: Interspeech 2009, pages 2935-2938, 2009.
 
Automatic Out-of-Language Detection Based on Confidence Measures Derived fromLVCSR Word and Phone Lattices, Petr Motlicek, in: 10thAnnual Conference of the International Speech Communication Association, pages 1215-1218, ISCA, Brighton, England, 2009.
 
Automatic vs. human question answering over multimedia meeting recordings, Quoc Anh Le and Andrei Popescu-Belis, in: Proc. of 10th Annual Conference of the International Speech Communication Association, 2009.
 
Boosting Multi-Modal Camera Selection with Semantic Features, Benedikt Hörnler, D. Arsic, B. Schuller and Gerhard Rigoll, in: Proc. Int. Conf. on Multimedia & Expo, ICME 2009, New York, NY, USA, pages 1298-1301, IEEE, 2009.
 
Brno University of Technology System for Interspeech 2009 Emotion Challenge, Marcel Kockmann, Lukas Burget and Jan Černock\'y, in: Proc. Interspeech 2009, pages 348-351, International Speech Communication Association, Brighton, GB, 2009.
 
BUT system for NIST 2008 speaker recognition evaluation, Lukas Burget, Michal Fapso, Valiantsina Hubeika, Ondřej Glembek, Martin Karafiat, Marcel Kockmann, Pavel Matějka, Petr Schwarz and Jan Černock\'y, in: Proc. Interspeech 2009, pages 2335-2338, International Speech Communication Association, Brighton, GB, 2009.
 
Cancer Stage Interpretation System, Anthony N. Nguyen, Michael J. Lawley and David P. Hansen, number CSIRO ICT Centre Technical Report 09/118, 2009.
 
Comparison of Scoring Methods used in Speaker Recognition with Joint Factor Analysis, Ondřej Glembek, Lukas Burget, Najim Dehak, Niko Brümmer and Patrick Kenny, in: Proc. ICASSP 2009, pages 4, IEEE Signal Processing Society, Taipei, TW, 2009.
 
Discovering group nonverbal conversational patterns with topics, Dinesh Jayagopi and Daniel Gatica-Perez, in: ICMI-MLMI '09: Proceedings of the 2009 international conference on Multimodal interfaces, pages 3-6, ACM, Cambridge, Massachusetts, USA, 2009. [DOI]
 
Disfluency Classification and Correction with a Hybrid Machine Learning and Rule-based Approach, Sebastian Germesin, Saarland University, 2009.
 
Extrinsic summarization evaluation: A decision audit task, Gabriel Murray, Thomas Kleinbauer, Peter Poller, Tilman Becker, Steve Renals and Jonathan Kilgour, in: ACM Trans. Speech Lang. Process., volume 6, number 2, pages 1-29, ISSN 1550-4875, 2009.
 
'Girlfriends and Strawberry Jam': Tagging Memories, Experiences, and Events for Future Retrieval, Anton Nijholt, in: Proceedings 11th International Symposium on Social Communication, Santiago de Cuba, pages 765-768, Centre for Applied Linguistics, Santiago de Cuba, 2009. [DOI]
 
GMs in On-Line Handwritten Whiteboard Note Recognition: The Influence of Implementation and Modeling, Joachim Schenk, Benedikt Hörnler, B. Schuller, A. Braun and Gerhard Rigoll, in: Proc. of the Int. Conf. on Document Analysis and Recognition, ICDAR 2009, Barcelona, Spain, pages 877-880, IEEE, 2009.
 
Graphical Models For Multi-Modal Automatic Video Editing in Meetings, Benedikt Hörnler, D. Arsic, B. Schuller and Gerhard Rigoll, in: Proc. Intern. Conf. on Digital Signal Processing (DSP 2009), Santorini, Greece, IEEE, 2009.
 
Graphical Models: Statistical Inference Vs. Determination, Joachim Schenk, Benedikt Hörnler, A. Braun and Gerhard Rigoll, in: Proc. Int. Conf. on Acoustics, Speech, and Signal Processing, ICASSP 2009, Taipei, Taiwan, pages 1717-1720, IEEE, 2009.
 
Have a say over what you see: evaluating interactive compression techniques, Simon Tucker and Steve Whittaker, in: Proceedings of IUI, 2009.
 
Integrating Prosodic Features in Extractive Meeting Summarization, S. Xie, D. Hakkani-Tür and Gokhan Tur, in: Proceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), Merano, Italy, 2009.
 
Investigating the Use of Visual Focus of Attention for Audio-Visual Speaker Diarisation, Giulia Garau, Sileye O. Ba, Hervé Bourlard and J-M. Odobez, in: Proc. ACM Multimedia, 2009.
 
Investigation into bottle-neck features for meeting speech recognition, Frantisek Grezl, Martin Karafiat and Lukas Burget, in: Proc. Interspeech 2009, pages 2947-2950, International Speech Communication Association, Brighton, GB, 2009.
 
Investigation into variants of Joint Factor Analysis for speaker recognition, Lukas Burget, Pavel Matějka, Valiantsina Hubeika and Jan Černock\'y, in: Proc. Interspeech 2009, pages 1263-1266, International Speech Communication Association, Brighton, GB, 2009.
 
Learning Large Margin Likelihood for Realtime Head Pose Tracking, E. Ricci and J-M. Odobez, .
 
Managing Multimodal Data, Metadata and Annotations: Challenges and Solutions, Andrei Popescu-Belis, in: Multimodal Signal Processing for Human-Computer Interaction, pages 183-203, Elsevier / Academic Press, 2009.
 
Multi-Modal Activity and Dominance Detection in Smart Meeting Rooms, Benedikt Hörnler and Gerhard Rigoll, in: Proc. Int. Conf. on Acoustics, Speech, and Signal Processing, ICASSP 2009, Taipei, Taiwan, pages 1777-1780, IEEE, 2009.
 
Multi-View Semi-Supervised Learning for Dialog Act Segmentation of Speech, G. Tur U. Guz, S. Cuendet and D. Hakkani-Tür, in: IEEE Transactions on Audio, Speech and Language Processing, 2009.
 
On-Line Object Behaviour Analysis for Surveillance Systems, Vítezslav Beran, Roman Juranek, Jozef Mlích, Pavel ák, Adam Herout and Pavel Zemčík, in: 10th Annual ICT Conference, pages 5, Nairobi,, 2009.
 
Overal performance metrics for multi-condition Speaker Recognition Evaluations, David van Leeuwen, in: Proc.Interspeech, pages 908-911, ISCA, 2009.
 
Posterior-based Out of Vocabulary Word Detection in Telephone Speech, Stefan Kombrink, Lukas Burget, Pavel Matějka, Martin Karafiat and Hynek Heřmansk\'y, in: Proc. Interspeech 2009, pages 80-83, International Speech Communication Association, Brighton, GB, 2009.
 
Predicting remote versus collocated group interactions using nonverbal cues, Dairazalia Sanchez-Cortes, Dinesh Jayagopi and Daniel Gatica-Perez, in: ICMI-MLMI '09: Proceedings of the ICMI-MLMI '09 Workshop on Multimodal Sensor-Based Systems and Mobile Phones for Social Computing, pages 1-4, ACM, Cambridge, Massachusetts, 2009. [DOI]
 
Prosodic and other Long-Term Features for Speaker Diarization, Gerald Friedland, O. Vinyals, Yan Huang and C. Müller, in: IEEE Transactions on Audio, Speech, and Language Processing, 2009.
 
Prosodic and other Long-Term Features for Speaker Diarization, Y. Huang G. Friedland, O. Vinyals, in: IEEE Transactions on Audio, Speech, and Language Processing, volume 17, number 5, pages 985-993, 2009.
 
Prosodic Similarities of Dialog Act Boundaries Across Speaking Styles, J. Fung E. Shriberg, B. Favre and S. Cuendet, in: Linguistic Patterns in Spontaneous Speech, pages 213-239, 2009.
 
Real-Time ASR from Meetings, Philip N. Garner, John Dines, Thomas Hain, Asmaa El Hannani, Martin Karafiat, Danil Korchagin, Mike Lincoln, Vincent Wan and Le Zhang, 2009.
 
Real-Time ASR from Meetings, Philip N. Garner, John Dines, Thomas Hain, Asmaa Hannani El, Martin Karafiat, Danil Korchagin, Mike Lincoln, Vincent Wan and Le Zhang, in: Proc. Interspeech 2009, pages 2119-2122, International Speech Communication Association, Brighton, GB, 2009.
 
Recognizing Contextual Polarity: An exploration of features for phrase-level sentiment analysis, Theresa Wilson, Janyce Wiebe and Paul Hoffmann, in: Computational Linguistics, volume 35, number 3, pages 399-433, 2009.
 
Recognizing Human Visual Focus of Attention from Head Pose in Meetings, Sileye O. Ba and J-M. Odobez, in: IEEE Trans. on System, Man and Cybernetics: part B, Man, volume 39, number 1, pages 16-34, 2009.
 
Robust Speaker Diarization for Short Speech Recordings, David Imseng and G. Friedland, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, pages 432-437, Merano, Italy, 2009.
 
Robuste Analyse des Diskussionsstandes von Gruppenbesprechungen mit Hilfe eines wissensbasierten DiskursgedÃchtnisses, Sandro Castronovo, Saarland University, 2009.
 
Selecting Features in On-Line Handwritten Whiteboard Note Recognition: SFS or SFFS?, Joachim Schenk, M. Kaiser and Gerhard Rigoll, in: Proc. of the Int. Conf. on Document Analysis and Recognition, ICDAR 2009, Barcelona, Spain, pages 1251-1254, IEEE, 2009.
 
Speaker Identification and Diarization, Gerald Friedland and David van Leeuwen, IEEE Press/Wileys and Sons, 2009.
 
Speech overlap detection in a two-pass speaker diarization system, Marijn Huijbregts, David van Leeuwen and Franciska de Jong, in: Proc.Interspeech, ISCA, 2009.
 
Study of linear transformations applied to training of cross-domain adapted large vocabulary continuous speech recognition systems, Martin Karafiat, 2009.
 
The majority wins: a method for combining speaker diarization systems, Marijn Huijbregts, David van Leeuwen and Franciska de Jong, in: Proc.Interspeech, ISCA, 2009.
 
Trajectory classification using HMMs, Jozef Mlích, Pavel Zemčík and Leo Jiřík, in: WSCG 2009 Communication Papers, pages 67-72, University of West Bohemia in Pilsen, Plzeň, CZ, 2009.
 
Using the iCat as Avatar in Remote Meetings, D. K. J. Heylen, Mannes Poel and Anton Nijholt, in: Multimodal Signals: Cognitive and Algorithmic Issues, Vietri sul Mare, Italy, pages 60-66, Springer Verlag, Vietri sul Mare, Italy, 2009.
 
Visual Activity Context For Focus of Attention Estimation in Dynamic Meetings, Sileye O. Ba, Hayley Hung and J-M. Odobez, in: IEEE Proc. Int. Conf. on Multimedia and Expo (ICME), 2009.
 
Visual Speaker Localization Aided by Acoustic Models, C. Yeo. H. Hung G. Friedland, in: ACM Multimedia, pages 195-202, Beijing, China, 2009.
 
Wiimote Gesture Recognition, Jozef Mlích, in: Proceedings of the 15th Conference and Competition STUDENT EEICT 2009 Volume 4, pages 344-349, Faculty of Electrical Engineering and Communication BUT, Brno, CZ, 2009.
 

2008

A generic layout-tool for summaries of meetings in a constraint-based approach, Sandro Castronovo, Jochen Frey and Peter Poller, in: 5th Joint Workshop on Machine Learning and Multimodal Interaction (MLMI 2008), pages 248-259, Springer, Heidelberg, TNO, Utrecht, 2008.
 
A Keyphrase Based Approach to Interactive Meeting Summarization, Korbinian Riedhammer, Benoit Favre and Dilek Hakkani Tür, in: Proc. 2nd IEEE/ACL Workshop on Spoken Language Technologies (SLT2008), Goa, India, pages 153-156, Goa, India, 2008. [DOI]
 
Acquisition of Telephone Data from Radio Broadcasts with Applications to Language Recognition, Oldřich Plchot, Valiantsina Hubeika, Lukas Burget, Petr Schwarz and Pavel Matějka, in: Lecture Notes in Computer Science, volume 2008, number 5246, pages 477-483, ISSN 0302-9743, 2008.
 
Adaptive Beamforming with a Maximum Negentropy Criterion, Kenichi Kumatani, John McDonough, Dietrich Klakow, Philip N. Garner and Weifeng Li, in: Proceedings of The Joint Workshop on Hands-free Speech Communication and Microphone Arrays, pages 180-183, 2008.
 
Advances in Phonotactic Language Recognition, Ondřej Glembek, Pavel Matějka, Lukas Burget and Tomá Mikolov, in: Proc. Interspeech 2008, pages 4, International Speech Communication Association, Brisbane, AU, 2008.
 
Annotating Subjective Content In Meetings, Theresa Wilson, in: Proceedings of the Language Resources and Evaluation Conference, Springer, LREC-2008, Marrakech, Maroc, 2008.
 
Annotations and Subjective Machines of annotators, embodied agents, users, and other humans, Dennis Reidsma, University of Twente, 2008.
 
Associating Audio-Visual Activity Cues in a Dominance Estimation Framework, Hayley Hung, Yan Huang, Chuohao Yeo and Daniel Gatica-Perez, in: Computer Vision and Pattern Recognition Workshop on Human Communicative Behaviour, Ankorage, Alaska, 2008.
 
Automatic Speech Recognition for Scientific Purposes - webASR, Thomas Hain, Asmaa El Hannani, Stuart Wrigley and Vincent Wan, in: In proc. Interspeech, 2008.
 
Automatic Video Editing for Multimodal Meetings, Adam Herout, Radek Kubíček, Pavel Zemčík and Pavel ák, in: Proceedings of International Conference on Computer Vision and Graphics 2008, pages 1-12, Springer Verlag, Heidelberg, DE, 2008.
 
Bob: A Lexicon and Pronunciaiton Dictionary Generator, Vincent Wan, John Dines, Asmaa El Hannani and Thomas Hain, in: Proc. IEEE Workshop on Spoken Language Technology, 2008, 2008.
 
Body-Part Templates for Recovery of 2D Human Poses under Occlusion, Ronald Poppe and Mannes Poel, in: International Workshop on Articulated Motion and Deformable Objects (AMDO'08), pages 289-298, Springer-Verlag, 2008.
 
Brno University of Technology at TRECVid 2008, Petr Chmelař, Vítezslav Beran, Adam Herout, Michal Hradis, Roman Juranek, Ale Láník, Jozef Mlích, Jan Navrátil, Ivo Řezníček, Pavel ák and Pavel Zemčík, in: Proceedings of TRECVID 2008, pages 1-16, National Institute of Standards and Technology, Gaithersburg, US, 2008.
 
BUT language recognition system for NIST 2007 evaluations, Pavel Matějka, Lukas Burget, Ondřej Glembek, Petr Schwarz, Valiantsina Hubeika, Michal Fapso, Tomá Mikolov, Oldřich Plchot and Jan Černock\'y, in: Proc. Interspeech 2008, pages 4, International Speech Communication Association, Brisbane, Australia, AU, 2008.
 
BUT system description: NIST SRE 2008, Lukas Burget, Michal Fapso, Valiantsina Hubeika, Ondřej Glembek, Martin Karafiat, Marcel Kockmann, Pavel Matějka, Petr Schwarz and Jan Černock\'y, in: Proc. 2008 NIST Speaker Recognition Evaluation Workshop, pages 1-4, National Institute of Standards and Technology, Montreal, CA, 2008.
 
Combination of strongly and weakly constrained recognizers for reliable detection of OOVs, Lukas Burget, Petr Schwarz, Pavel Matějka, Mirko Hannemann, Ariya Rastrow, Christopher White, Sanjeev Khudanpur, Hynek Heřmansk\'y and Jan Černock\'y, in: Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pages 4, IEEE Signal Processing Society, Las Vegas, US, 2008.
 
Combining Spectral Representations for Large Vocabulary Continuous Speech Recognition, Giulia Garau and Steve Renals, in: IEEE Transactions on Audio Speech and Language Processing, volume 16-3, pages 508-518, 2008.
 
Comparing word, character, and phoneme n-grams for subjective utterance recognition, Theresa Wilson and Stephan Raaijmakers, in: Interspeech 2008, Brisbane, Australia, 2008.
 
Contour modeling of prosodic and acoustic features for speaker recognition, Marcel Kockmann and Lukas Burget, in: Proc. 2008 IEEE Workshop on Spoken Language Technology, pages 4, IEEE Signal Processing Society, Goa, IN, 2008.
 
Dealing with Uncertainty in Microphone Placement in a Microphone Array Speech Recognition System, I. Himawan, S. Sridharan and I. McCowan, in: Proceedings of 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE Signal Processing Society, Nevada, US, 2008.
 
Decision-Level Fusion for Audio-Visual Laughter Detection, B. Reuderink, Mannes Poel, K. P. Truong, Ronald Poppe and Maja Pantic, in: 5th Joint Workshop on Machine Learning and Multimodal Interaction, MLMI 2008, pages 137-148, Springer Verlag, 2008.
 
Design and Evaluation of Systems to Support Interaction Capture and Retrieval, Steve Whittaker, Simon Tucker, K. Swampillai and Rachel Laban, in: Personal and Ubiquitous Computing, volume 12, number 3, pages 197-221, 2008.
 
Designing Awareness Support for Distributed Cooperative Design Teams, Dhaval Vyas, Dirk Heylen and Anton Nijholt, in: 15th European Conference on Cognitive Ergonomics, pages 23-26, ACM, 2008.
 
Detecting Uncertainty in Spoken Dialogues: An explorative research to the automatic detection of speakers uncertainty by using prosodic markers, Jeroen Dral, Dirk Heylen and Rieks op den Akker, in: Sentiment analysis: emotion, metaphor, ontology and terminology, pages 72-78, Marrakech, Marocco, 2008. [DOI]
 
Determining Latency for on-line Dialog Act Classification, Sebastian Germesin, in: MLMI'08, 2008.
 
Discriminative human action recognition using pairwise CSP classifiers, Ronald Poppe and Mannes Poel, in: 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008.
 
Discriminative Training and Channel Compensation for Acoustic Language Recognition, Valiantsina Hubeika, Lukas Burget, Pavel Matějka and Petr Schwarz, in: Proc. Interspeech 2008, pages 4, International Speech Communication Association, Brisbane, AU, 2008.
 
Discrimininative training of narrow band - wide band adaptated systems for meeting recognition, Martin Karafiat, Lukas Burget, Thomas Hain and Jan Černock\'y, in: Proc. Interspeech 2008, pages 4, International Speech Communication Association, Brisbane, AU, 2008.
 
Domain-specific Classification Methods for Disfluency Detection, Sebastian Germesin, Tilman Becker and Peter Poller, in: Interspeech 2008, 2008.
 
Effect of sound spatialisation on multitasking in remote meetings, S. N. Wrigley, Simon Tucker, G. J. Brown and Steve Whittaker, in: Proceedings of Acoustics'08, 2008.
 
Estimating the Dominant Person in Multi-Party Conversations Using Speaker Diarization Strategies, Hayley Hung, Yan Huang, Gerald Friedland and Daniel Gatica-Perez, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2008.
 
Estimating the Dominant Person in Multi-Party Conversations Using Speaker Diarization Strategies, Yan Huang, Gerald Friedland, Daniel Gatica-Perez and Huang Hung, in: Proceedings of IEEE ICASSP, pages 2197-2200, Las Vegas, NV, USA, 2008.
 
Exploiting `Subjective' Annotations, Dennis Reidsma and H. J. A. op den Akker, in: Coling 2008: Proceedings of the workshop on Human Judgements in Computational Linguistics, pages 8-16, Coling 2008 Organizing Committee, 2008.
 
Exploring Features and Classifiers for Dialogue Act Segmentation, C. Schulz and H. op den Akker, in: Machine Learning for Multimodal Interaction, MLMI 2008, Utrecht, the Netherlands, pages 196-207, Springer Verlag, Utrecht, the Netherlands, 2008.
 
Exploring Mediated Interactions: A Design Exercise, Dhaval Vyas, Yang Liu and Anton Nijholt, in: 15th European Conference on Cognitive Ergonomics, pages 1-2, ACM, 2008.
 
Extrinsic Summarization Evaluation: A Decision Audit Task, Gabriel Murray, Thomas Kleinbauer, Peter Poller, Steve Renals, Jonathan Kilgour and Tilman Becker, in: Machine Learning for Multimodal Interaction - 5th International Workshop, MLMI 2008, pages 349-361, 2008.
 
Filter Bank Design Based on Minimization of Individual Aliasing Terms for Minimum Mutual Information Subband Adaptive Beamforming, Kenichi Kumatani, John McDonough, Stefan Schacht, Dietrich Klakow, Philip N. Garner and Weifeng Li, in: Proceedings International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008), pages 1609-1612, IEEE, 2008.
 
How do I address you? Modelling addressing behaviour based on an analysis of multi-modal corpora of conversational discourse, Rieks op den Akker and Mariet Theune, in: AISB 2008 Symposium on Multimodal Output Generation (MOG 2008), pages 10-17, Aberdeen, UK, 2008.
 
Hybrid Multi-Step Disfluency Detection, Sebastian Germesin, Tilman Becker and Peter Poller, in: MLMI'08, 2008.
 
Hybrid word-subword decoding for spoken term detection, Igor Szoke, Michal Fapso, Lukas Burget and Jan Černock\'y, in: Proc. SSCS 2008: Speech search workshop at SIGIR, pages 4, Association for Computing Machinery, Singapore, SG, 2008.
 
Hybrid word-subword decoding for spoken term detection, Igor Szoke, Michal Fapso, Lukas Burget and Jan Černock\'y, in: Proc. SSCS 2008: Speech search workshop at SIGIR, pages 4, Association for Computing Machinery, Singapore, SG, 2008.
 
Identifying Dominant People in Meetings from Audio-Visual Sensors, Hayley Hung and Daniel Gatica-Perez, in: Proc. IEEE Int. Conf. on Automatic Face and Gesture Recognition (FG), Special Session on Multi-Sensor HCI for Smart Environments, Amsterdam, 2008.
 
Interpretation of Multiparty Meetings: The AMI and AMIDA Projects, Steve Renals, Thomas Hain and Hervé Bourlard, in: HSCMA 2008 (6-8 May), pages 115-118, Trento-Italy, 2008.
 
Investigating Automatic Dominance Estimation in Groups from Visual Attention and Speaking Activity, Hayley Hung, Dinesh Jayagopi, Sileye O. Ba, J-M. Odobez and Daniel Gatica-Perez, in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), Chania, 2008.
 
Juicer and Tracter software release, Philip N. Garner, Darren Moore, Octavian Cheng, John Dines and Danil Korchagin, 2008.
 
Live Speaker Identification in Conversation, G. Friedland and O. Vinyals, in: Proceedings of the ACM International Conference on Multimedia, pages 1017-1018, Vancouver, BC, Canada, 2008.
 
Machine Learning for Multimodal Interaction V, Andrei Popescu-Belis and Rainer Stiefelhagen, LNCS, volume 5237, Springer-Verlag, ISBN 978-3-540-85852-2, 2008. [DOI]
 
Making remote meeting hopping work: assistance to initiate, join and leave meetings, A. H. M. Cremers, Maaike Duistermaat, P Groenewegen and Jacomien de Jong, in: 5th Joint Workshop on Machine Learning and Multimodal Interaction (MLMI 2008), pages 315-324, 2008.
 
Maximum kurtosis beamforming with the generalized sidelobe canceller, Kenichi Kumatani, John McDonough, Barbara Rauch, Philip N. Garner, Weifeng Li and John Dines, in: Proceedings of INTERSPEECH, September 2008, Brisbane, Australia, 2008.
 
Meeting Behavior Detection in Smart Environments: Nonverbal Cues that Help to Obtain Natural Interaction, Mannes Poel, Ronald Poppe and Anton Nijholt, in: Proceedings 8th IEEE International Conference on Automatic face and Gesture Recognition (FG 2008), pages 1-6, IEEE Computer Society Press, 2008.
 
Microphone Array Calibration in Diffuse Noise Fields, I. McCowan, Mike Lincoln and I. Himawan, in: IEEE Transactions on Audio, Speech and Language Processing, 2008.
 
Microphone Array Shape Calibration in Diffuse Noise Fields, I. McCowan, Mike Lincoln and I. Himawan, in: IEEE Transactions on Audio, Speech and Language Processing, volume 16-3, pages 666-670, ISSN 1558-7916, 2008.
 
Modeling dominance in group conversations from non-verbal activity cues, Dinesh Jayagopi, Hayley Hung, Chuohao Yeo and Daniel Gatica-Perez, in: IEEE Transactions on Audio, Speech and Language Processing, accepted for publication, 2008.
 
Modulation Spectrogram Features for Speaker Diarization, O. Vinyals and Gerald Friedland, in: Proceedings of the 9th International Conference of the ISCA, pages 630-633, Interspeech 2008, Brisbane, Australia, 2008.
 
Multimedia Information Extraction Roadmap, Greg Myers, Gokhan Tur, Lynn Voss, Bob Bolles, Sachin Kajarekar, Elizabeth Shriberg and Dilek Hakkani Tür, in: Proceedings of the AAAI Fall Symposium on Multimedia Information Extraction, Association for the Advancement of Artificial Intelligence, Arlington, Virginia, 2008.
 
Multi-modal Speaker Diarization of Real-World Meetings Using Compressed-Domain Video Features, Gerald Friedland, Chuohao Yeo and Huang Hung, number 08-2007, 2008.
 
Multimodal Subjectivity Analysis of Multiparty Conversation, Stephan Raaijmakers, K. P. Truong and Theresa Wilson, in: Proceedings of EMNLP, 2008.
 
Multi-party focus of attention recognition in meetings from head pose and multimodal contextual cues, Sileye O. Ba and J-M. Odobez, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008.
 
Mutually Coordinated Anticipatory Multimodal Interaction, Anton Nijholt, Dennis Reidsma, H. van Welbergen, H. J. A. op den Akker and Z. Ruttkay, in: Nonverbal Features of Human-Human and Human-Machine Interaction, 29-31 October 2007, pages 73-93, Springer Verlag, Berlin, Patras, Greece, 2008.
 
On the Contextual Analysis of Agreement Scores, Dennis Reidsma, Dirk Heylen and H. J. A. op den Akker, in: Proceedings of the LREC Workshop on Multimodal Corpora, pages 52-55, ELRA, ELRA, Marrakech, Morrocco, 2008.
 
Optimizing Bottle-Neck Features for LVCSR, Frantisek Grezl and Petr Fousek, pages 4729-4732, IEEE Signal Processing Society, 2008.
 
Overlapped Speech Detection for Improved Speaker Diarization in Multiparty Meetings, Kofi Boakye, B. Trueba-Hornero, O. Vinyals and Gerald Friedland, in: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2008.
 
Overlapped Speech Detection for Improved Speaker Diarization in Multiparty Meetings, Kofi Boakye, Beatriz Trueba-Hornero, Oriol Vinyals and Gerald Friedland, in: Proceedings of IEEE ICASSP, pages 4353-4356, Las Vegas, NV, USA, 2008.
 
Packing the Meeting Summarization Knapsack, K. Riedhammer, D. Gillick, B. Favre and D. Hakkani-Tür, in: Interspeech 2008, Brisbane, Australia, 2008.
 
Physicality and Cooperative Design, Vyas Dhaval and Dirk Heylen, in: 5th Joint Workshop on Machine Learning, Springer LNCS proceedings, MLMI08, 5th Joint Workshop on Machine Learning, Utrecht, The Netherlands, 2008.
 
Predicting the Dominant Clique in Meetings through Fusion of Nonverbal Cues, Dinesh Jayagopi, Hayley Hung, Chuohao Yeo and Daniel Gatica-Perez, in: Proc. ACM Int. Conf. on Multimedia (MM), pages 809-812, ACM New York, NY, USA, Vancouver, 2008. [DOI]
 
Predicting Two Facets of Social Verticality in Meetings from Five-minute Time Slices and Nonverbal Cues, Dinesh Jayagopi, Sileye O. Ba, J-M. Odobez and Daniel Gatica-Perez, in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), Special Session on Social Signal Processing, Chania, 2008.
 
Recognition and Understanding of Meetings Overview of the European AMI and AMIDA Projects, Hervé Bourlard and Steve Renals, number 27, 2008.
 
Recognition of Dialogue Acts in Multiparty Meetings using a Switching DBN, Alfred Dielmann and Steve Renals, in: In IEEE Transactions on Audio, Speech and Language Processing, volume 16-5, pages 1303-1314, ISSN 1558-7916, 2008.
 
Reliability measurement without limits, Dennis Reidsma and Jean Carletta, in: Computational Linguistics, volume 34, number 3, pages 319-326, ISSN 0891-2017, 2008. [DOI]
 
Role recognition for meeting participants: an approach based on lexical information and social network analysis, Neha P. Garg, Sarah Favre, Hugues Salamin, Dilek Hakkani Tür and Alessandro Vinciarelli, in: MM '08: Proceeding of the 16th ACM international conference on Multimedia, pages 693-696, ACM, Vancouver, British Columbia, Canada, 2008. [DOI]
 
Silence Models in Weighted Finite-State Transducers, Philip N. Garner, in: Interspeech, Brisbane, Australia, 2008.
 
Sub-word modeling of out of vocabulary words in spoken term detection, Igor Szoke, Lukas Burget, Jan Černock\'y and Michal Fapso, in: Proc. 2008 IEEE Workshop on Spoken Language Technology, pages 4, IEEE Signal Processing Society, Goa, IN, 2008.
 
Syllable based Feature-Contours for Speaker Recognition, Marcel Kockmann and Lukas Burget, in: Proc. 14th International Workshop on Advances in Speech Technology, pages 4, Maribor, SI, 2008.
 
Temporal Compression Of Speech: An Evaluation, Simon Tucker and Steve Whittaker, in: IEEE Transactions on Audio, Speech and Language Processing, 2008.
 
The AMIDA Automatic Content Linking Device: Just-in-Time Document Retrieval in Meetings, Andrei Popescu-Belis, Erik Boertjes, Jonathan Kilgour, Peter Poller, Sandro Castronovo, Theresa Wilson, Alejandro Jaimes, Jean Carletta and Rainer Stiefelhagen, in: Machine Learning for Multimodal Interaction, pages 272-283, Springer, TNO, Utrecht, 2008.
 
The influence of audio presentation style on multitasking during teleconferences, Stuart Wrigley, Simon Tucker, G. J. Brown and Steve Whittaker, in: Interspeech 2008, pages 801-804, 2008.
 
Time-Compressing Speech: ASR Transcripts are an Effective Way to Support Gist Extraction, Simon Tucker, N. Kyprianou and Steve Whittaker, in: 5th Joint Workshop on Machine Learning and Multimodal Interaction (MLMI 2008), Utrecht, The Netherlands, 2008.
 
Towards an Objective Test for Meeting Browsers: the BET4TQB Pilot Experiment, Andrei Popescu-Belis, Philippe Baudrion, Mike Flynn and Pierre Wellner, Lecture Notes in Computer Science, volume LNCS 4892/2008, pages 108-119, Springer Verlag, ISBN 978-3-540-78154-7, 2008. [DOI]
 
Towards Audio-Visual On-line Diarization Of Participants In Group Meetings, Huang Hung and Gerald Friedland, in: Workshop Proceedings of the European Conference on Computer Vision (ECCV), Marseille, France, 2008.
 
Towards Semantic Analysis of Conversations: A System for the Live Identification of Speakers in Meetings, O. Vinyals and Gerald Friedland, in: Proceedings of IEEE International Conference on Semantic Computing, pages 426-431, Santa Clara, CA, 2008.
 
Tracking the Visual Focus of Attention for a Varying Number of Wandering People, Kevin Smith, Sileye O. Ba, Daniel Gatica-Perez and J-M. Odobez, in: IEEE Trans. on Pattern Analysis and Machine Intelligence, 2008.
 
TRECVID 2007 by the Brno Group, Adam Herout, Vítezslav Beran, Michal Hradis, Igor Potúček, Pavel Zemčík and Petr Chmelař, in: Proceedings of TRECVID 2007, pages 1-6, National Institute of Standards and Technology, Gaithersburg, US, 2008.
 
Two's a Crowd: Improving Speaker Diarization by Automatically Identifying and Excluding Overlapped Speech, Kofi Boakye, O. Vinyals and Gerald Friedland, in: Interspeech 2008, Brisbane, Australia, 2008.
 
Visual Focus of Attention Estimation from Head Pose Posterior Probability Distributions, Sileye O. Ba and J-M. Odobez, in: IEEE Proc. Int. Conf. on Multimedia and Expo (ICME), 2008.
 

2007

A Approach for Robust, faster than Real-Time Speaker Diarization, Yan Huang, O. Vinyals, Gerald Friedland, C. Müller, Nikki Mirghafori and Chuck Wooters, in: Proceedings of IEEE ASRU, pages 693-698, 2007.
 
A Cognitive and Unsupervised MAP Adaptation approach to the Recognition of the Focus of Attention from Head Pose, J-M. Odobez and Sileye O. Ba, in: Proc. of the IEEE International Conference on Multimedia and Expo (ICME'07), 2007.
 
A Comprehensive Disfluency Model for Multi-Party Interaction, Jana Besser and Jan Alexandersson, in: Proceedings of the 8th SIGDial Workshop on Discourse and Dialogue, 2007.
 
A Microphone Array Beamforming Approach to Blind Speech Separation, I. McCowan, I. Himawan and Mike Lincoln, in: Machine Learning for Multimodal Interaction IV, pages 294-304, Springer, 2007.
 
Accuracy of head orientation perception in triadic situations: Experiment in a virtual environment, Ronald Poppe, Rutger Rienks and Dirk Heylen, in: Perception, number 36(7):971-979, pages 971-979, ISSN 0301-0066, 2007.
 
Adaboost Engine, Pavel Zemcik and Martin Zadnik, in: International Conference on Field Programmable Logic and Applications, FPL 2007, pages 656-660, IEEE Computer Society, Amsterdam, NL, 2007. [DOI]
 
Adaptive Beamforming with a Minimum Mutual Information Criterion, Kenichi Kumatani, Tobias Gehrig, Uwe Mayer, Emilian Stoimenov, John McDonough and Matthias Wolfel, in: IEEE Trans. Audio, Speech and Language Processing, volume 15, pages 2527-2541, 2007.
 
AMI/DA STT and SASTT 2007, Thomas Hain, Lukas Burget, Martin Karafiat, John Dines, David van Leeuwen, Giulia Garau, Mike Lincoln and Vincent Wan, in: Proceedings of the RT07 Workshop 2007, 2007.
 
An Analysis of Sentence Segmentation Features for Broadcast News, Broadcast Conversations, and Meetings, S. Cuendet, Elizabeth Shriberg, B. Favre, J. Fung and D. Hakkani-Tür, in: SIGIR Workshop on Searching Conversational Spontaneous Speech, 2007.
 
Analysis of feature extraction and channel compensation in GMM speaker recognition system, Lukas Burget, Pavel Matejka, Petr Schwarz, Ondřej Glembek and Jan Cernocky, in: IEEE Transactions on Audio, Speech, and Language Processing, volume 15, number 7, pages 1979-1986, ISSN 1558-7916, 2007.
 
Application of CMLLR in narrow band wide band adapted systems, Martin Karafiat, Lukas Burget, Thomas Hain and Jan Cernocky, in: 8th Annual Conference of the International Speech Communication Association, pages 4, International Speech Communication Association, Antwerp, Belgium, 2007.
 
Artefact Ecologies: Supporting Embodied Meeting Practices with Distance Access, Dhaval Vyas and Anne Bajart, in: Proceedings of UbiComp (Ubiquitous Computing) 2007 Workshops, pages 117-122, Ubicomp, University of Innsbruck, Austria, 2007.
 
Audio-based unsupervised segmentation of multiparty dialogue, Pei-Yun Hsueh, in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2008 (ICASSP 2008), pages 5049-5052, Las Vegas, Nevada, USA, 2007. [DOI]
 
Audio-Visual Probabilistic Tracking of Multiple Speakers in Meetings, Daniel Gatica-Perez, Guillaume Lathoud, J-M. Odobez and I. McCowan, in: IEEE Trans. on Audio, Speech, and Language Processing, volume 15, number 5, pages 1696-1710, 2007.
 
Audio-Visual Processing in Meetings: Seven Questions and Current AMI Answers, Marc Al-Hames, Thomas Hain, Jan Cernocky, Sascha Schreiber, Mannes Poel, Ronald Müller, Sebastien Marcel, David van Leeuwen, J-M. Odobez, Sileye O. Ba, Hervé Bourlard, Fabien Cardinaux, Daniel Gatica-Perez, Adam Janin, Petr Motlicek, Stephan Reiter, Steve Renals, Jeroen van Rest, Rutger Rienks, Gerhard Rigoll, Kevin Smith, Andrew Thean and Pavel Zemcik, in: MLMI 2006, 3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms, pages 24-35, Springer, 2007.
 
Automatic Decision Detection in Meeting Speech, Pei-Yun Hsueh and Johanna D. Moore, in: Machine Learning for Multimodal Interaction IV, Springer, 2007.
 
Automatic Dialogue Act Recognition using a Dynamic Bayesian Network, Alfred Dielmann and Steve Renals, in: Proc. Multimodal Interaction and Related Machine Learning Algorithms Workshop (MLMI--06), pages 178-189, Springer, 2007.
 
Automatic Labeling Inconsistencies Detection And Correction For Sentence Unit Segmentation In Conversational Speech, S. Cuendet, D. Hakkani-Tür and Elizabeth Shriberg, in: Proceedings of MLMI, 2007.
 
Automatic Laughter Detection Using Neural Networks, M. Knox and Nikki Mirghafori, in: Interspeech, 2007.
 
Automatic Meeting Segmentation using Dynamic Bayesian Networks, Alfred Dielmann and Steve Renals, in: IEEE Transactions on Multimedia, volume 9, number 1, pages 25-36, 2007.
 
Automatic meeting segmentation using dynamic Bayesian networks, Alfred Dielmann and Steve Renals, in: IEEE Transactions on Multimedia, volume 9, number 1, pages 25-36, 2007. [DOI]
 
Automatic Multi-Modal Meeting Camera Selection for Video-Conferences and Meeting Browsing, Marc Al-Hames, Benedikt Hörnler, Ronald Müller, Joachim Schenk and Gerhard Rigoll, in: Proceedings of the 8th International Conference on Multimedia and Expo (ICME), 2007.
 
Automatic Segmentation and Summarization of Meeting Speech, Gabriel Murray, Pei-Yun Hsueh, Simon Tucker, J. Kilgour, Jean Carletta, Johanna D. Moore and Steve Renals, in: Proc. of NAACL/HLT, 2007.
 
Binaural speech separation using recurrent timing neural networks for joint F0-localisation estimation, S. N. Wrigley and G. J. Brown, Lecture Notes in Computer Science, volume LNCS 4892, pages 271-282, Springer Berlin, ISBN 978-3-540-78154-7, 2007. [DOI]
 
Can unquantised articlatory feature continuums be modelled?, Odette Scharenborg and Vincent Wan, in: Proc Interspeech 2007, 2007.
 
Challenges for Virtual Humans in Human Computing, Dennis Reidsma, Z. Ruttkay and Anton Nijholt, volume LNAI State-of-th, pages 316-338, Springer Verlag, 2007.
 
Combining Multiple Information Layers for the Automatic Generation of Indicative Meeting Abstracts, Thomas Kleinbauer, Stephanie Becker and Tilman Becker, in: 11th European Workshop on Natural Language Generation (ENLG07), 2007.
 
Combining Multiple Knowledge Sources for Dialogue Segmentation in Multimedia Archives., Pei-Yun Hsueh and Johanna D. Moore, in: Proceedings of the 45th Annual Meeting of the ACL, Association for Computational Linguistics, 2007.
 
Constraint-basierte Generierung parametrisierbarer, multimodaler Comic-Layouts für verlaufsorientierte Meeting-Zusammenfassungen, Jochen Frey, University of Saarbrücken, 2007.
 
Cross-Genre Feature Comparisons for Spoken Sentence Segmentation, S. Cuendet, D. Hakkani-Tür, Elizabeth Shriberg, J. Fung and B. Favre, in: International Conference on Semantic Computing (ICSC), 2007.
 
DBN based joint dialogue act recognition of multiparty meetings, Alfred Dielmann and Steve Renals, in: Proc IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP '07), 2007.
 
DEMO: Automatic Decision Detection in Meeting Speech, Pei-Yun Hsueh, J. Kilgour, Jean Carletta, Johanna D. Moore and Steve Renals, in: Proc. MLMI, 2007.
 
Evaluating Example-based Pose Estimation: Experiments on the HumanEva Sets, Ronald Poppe, in: Online Proceedings of the Workshop on Evaluation of Articulated Human Motion and Pose Estimation (EHuM) at the International Conference on Computer Vision and Pattern Recognition (CVPR), number TR-CTIT-07-72, pages 1-8, 2007.
 
Evaluating Meeting Support Tools, Wilfried Post, M. A. A. HuisIntVeld and S.A.A. van den Boogaard, in: Personal and Ubiquitous Computing, 2007.
 
Evaluating the Future of HCI: Challenges for the Evaluation of Emerging Applications, Ronald Poppe, Rutger Rienks and Betsy van Dijk, Lecture Notes in Artificial Intelligence, pages 234-250, Springer Verlag, ISBN ISBN=3-540-72346-2, 2007.
 
Evaluating the Future of HCI: Challenges for the Evaluation of Upcoming Applications, Ronald Poppe and Rutger Rienks, in: Proceedings of the International Workshop on Artificial Intelligence for Human Computing at the International Joint Conference on Artificial Intelligence IJCAI'07, pages 89-96, IJCAI, 2007.
 
Evaluation and comparison of tracking methods using meeting omnidirectional images, Igor Potucek, Vítezslav Beran, Stanislav Sumec and Pavel Zemcik, in: Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), pages 12, Brno, CZ, 2007.
 
Evaluation of Automatic Video Editing, Stanislav Sumec and Igor Potucek, in: Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), pages 12, Brno, CZ, 2007.
 
Experiencing-in-the-World : Using Pragmatist Philosophy to Design for Aesthetic Experience, Dhaval Vyas, Dirk Heylen, Anton Eliens and Anton Nijholt, in: Proceedings of the 3rd International Conference on Designing for Users Experience, pages 1-16, ACM Press, 2007.
 
Experiencing-in-the-World: Using Pragmatist Philosophy to Design for Aesthetic Experience, Dhaval Vyas, Dirk Heylen, Anton Nijholt and Anton Eliens, pages 16, 2007.
 
Experimental Comparison of Multimodal Meeting Browsers, Wilfried Post, E. Elling, A. H. M. Cremers and Wessel Kraaij, in: HCII 2007, Beijing, China, 2007.
 
Exploring Contextual Information in a Layered Framework for Group Action Recognition, Dong Zhang and Samy Bengio, in: 2007 IEEE International Conference on Multimedia and Expo, pages 2022-2025, Beijing, 2007. [DOI]
 
Feedback loops in communication and human computing, H. J. A. op den Akker and Dirk Heylen, in: Artificial Intelligence for Human Computing, pages 215-447, Springer Verlag, 2007.
 
Filtering the Unknown: Speech Activity Detection in Heterogeneous Video Collections, Marijn Huijbregts, Chuck Wooters and R.J.F Ordelman, in: Proceedings of Interspeech 2007, pages 4, International Speech Communication Association, 2007.
 
Finding Maximum Margin Segments in Speech, Yago Pereiro Estevan, Vincent Wan and Odette Scharenborg, in: Proceedings of the 32nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), 2007.
 
Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006, Niko Brümmer, Lukas Burget, Jan Cernocky, Ondřej Glembek, Frantisek Grezl, Martin Karafiat, David van Leeuwen, Pavel Matejka, Petr Schwarz and Albeert Strasheim, in: IEEE Transactions on Audio, Speech, and Language Processing, volume 15, number 7, pages 2072-2084, ISSN 1558-7916, 2007. [DOI]
 
Hardware Acceleration of AdaBoost Classifier, Jiri Granat, Adam Herout, Michal Hradis and Pavel Zemcik, in: Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), pages 1-12, Brno, CZ, 2007.
 
Hidden Conditional Random Fields for Meeting Segmentation, Stephan Reiter, B. Schuller and Gerhard Rigoll, in: Proceedings of the 8th IEEE International Conference on Multimedia and Expo (ICME 2007), 2007.
 
Hierarchical Pitman-Yor Language Models for ASR in Meetings, Songfang Huang and Steve Renals, in: Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on, Kyoto, pages 124-129, 2007.
 
Hierarchical Pitman-Yor language models for ASR in meetings, Songfang Huang and Steve Renals, in: Proc. IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU '07), 2007.
 
Human-Centered Computing: Toward a Human Revolution, Alejandro Jaimes, Daniel Gatica-Perez, Nicu Sebe and Thomas Huang, number 57, 2007.
 
Indicative Abstractive Summaries of Meetings, Thomas Kleinbauer, Stephanie Becker and Tilman Becker, in: Proceedings of MLMI 2007., 2007.
 
Machine Understanding of Human Behavior, Maja Pantic, Alex Pentland, Anton Nijholt and Thomas Huang, in: Proceedings AI for Human Computing (AI4HC'07), workshop at IJCAI 2007,20th International Joint Conference on Artificial Intelligence, pages 13-24, IJCAI, Hyderabad, India, 2007.
 
Maximum Likelihood and Maximum Mutual Information Training in Gender and Age Recognition System, Valiantsina Hubeika, Igor Szoke, Lukas Burget and Jan Cernocky, in: Lecture Notes in Computer Science, volume 4629/2007, number 9, pages 496-501, ISSN 0302-9743, 2007. [DOI]
 
Meetings in smart environments. Implications of progressing technology, Rutger Rienks, University of Twente, 2007.
 
Meetings in Smart Environments; Implications of Progressing Technology, Rutger Rienks, University of Twente Repository, 2007.
 
Microphone Array Beamforming Approach to Blind Speech Separation, I. Himawan, I. McCowan and Mike Lincoln, in: Proceedings of the 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 07), pages 295-305, Springer-Verlag, 2007. [DOI]
 
Microphone Array Beamforming Approach to Blind Speech Separation, I. Himawan, I. McCowan and Mike Lincoln, Springer-Verlag, 2007.
 
Minimum Mutual Information Beamforming for Simultaneous Active Speakers, Kenichi Kumatani, John McDonough, Uwe Mayer, Tobias Gehrig, Emilian Stoimenov and Matthias Wolfel, in: IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU), pages 71-76, IEEE, 2007. [DOI]
 
Modeling Prosodic Features in Language Models for Meetings, Songfang Huang and Steve Renals, in: Machine Learning for Multimodal Interaction IV, pages 191-202, Springer, 2007.
 
Multimodal Human Computer Interaction: A Survey, Alejandro Jaimes and Nicu Sebe, in: Computer Vision and Image Understanding, volume 108, number 1-2, pages 116-134, 2007.
 
Mutaphrase: Paraphrasing with FrameNet, Michael Ellsworth and Adam Janin, in: Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing, pages 143-150, Association for Computational Linguistics, 2007.
 
Optimizing Bottle-Neck Features for LVCSR, Frantisek Grezl and Petr Fousek, pages 4729-4732, IEEE Signal Processing Society, 2007.
 
Parameterisierbares Layout inhaltsorientierter, multimodaler Zusammenfassungen von Meetings anhand der Zeitungsmetapher in einem Constraint-basierten Ansatz, Benjamin Lang, University of Saarbrücken, 2007.
 
Platform for Evaluation of Image Classifiers, Jana Silhava, Vítezslav Beran, Petr Chmelar, Adam Herout, Michal Hradis, Roman Juranek and Pavel Zemcik, in: Spring Conference on Computer Graphics, pages 103-109, Comenius University in Bratislava, Budmerice, SK, 2007.
 
Probabilistic and bottle-neck features for LVCSR of meetings, Frantisek Grezl, Martin Karafiat, Stanislav Kontar and Jan Cernocky, in: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007), pages 757-760, IEEE Signal Processing Society, Hononulu, US, 2007. [DOI]
 
Probabilistic Head Pose Tracking Evaluation in Single and Multiple Camera Setups, Sileye O. Ba and J-M. Odobez, in: Proc. second Workshop on Classification of Events, Activities and Relationships (CLEAR'07), 2007.
 
QA with Attitude: Exploiting Opinion Type Analysis for Improving Question Answering in On-line Discussions and the News, Swapna Somasundaran, Theresa Wilson, Janyce Wiebe and Veselin Stoyanov, in: Proceedings of the International Conference on Weblogs and Social Media, 2007.
 
Recognition and interpretation of meetings: The AMI and AMIDA projects, Steve Renals, Thomas Hain and Hervé Bourlard, in: Proc. IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU '07), 2007.
 
Recognition and Understanding of Meetings: The AMI and AMIDA Projects, Steve Renals, Thomas Hain and Hervé Bourlard, in: IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU), pages 238-247, Kyoto, 2007. [DOI]
 
Recording, Indexing, Summarizing, and Accessing Meeting Videos: An Overview of the AMI Project, Alejandro Jaimes, Hervé Bourlard, Steve Renals and Jean Carletta, in: 4th Intl. Conference on Image Analysis and Processing (ICIAP 2007) and Workshop on Visual and Multimedia Digital Libraries (VMDL 2007), pages 59-64, 2007.
 
Recurrent timing neural networks for joint F0-localisation based speech separation, S. N. Wrigley and G. J. Brown, in: Proceedings of the 32nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), pages 157-160, Honolulu, Hawai'i, 2007.
 
Robust and Rapid Speaker Diarization, Yan Huang, University of California, Berkeley, 2007.
 
Robust Multi-Modal Group Action Recognition in Meetigs from Disturbed Videos with the Asynchronous Hidden Markov Model, Marc Al-Hames, Claus Lenz, Stephan Reiter, J. Wallhoff, Joachim Schenk and Gerhard Rigoll, in: IEEE International Conference on Image Processing, 2007 (ICIP 2007), pages II:213-216, San Antonio, TX, 2007. [DOI]
 
Robust Multi-Modal Group Action Recognition in Meetings from Disturbed Videos with the Asynchronous Hidden Markov Model, Marc Al-Hames, Claus Lenz, Stephan Reiter, Joachim Schenk, F. Wallhoff and Gerhard Rigoll, in: Proceedings of the International Conference on Image Processing (ICIP), 2007.
 
Search in speech for public security and defense, Jan Cernocky, Igor Szoke, Michal Fapso, Martin Karafiat, Lukas Burget, Jiri Kopecky, Frantisek Grezl, Petr Schwarz, Ondřej Glembek, Ilya Oparin, Pavel Smrz and Pavel Matejka, in: Proc. IEEE Workshop on Signal Processing Applications for Public Security and Forensics, 2007 (SAFE '07), pages 1-7, IEEE Signal Processing Society, Washington DC, USA, 2007.
 
Segmentation of speech: Child's play?, Odette Scharenborg, Mirjam Ernestus and Vincent Wan, in: Proc Interspeech 2007, 2007.
 
Sentiment Classification with Interpolated Information Diffusion Kernels, Stephan Raaijmakers, in: Proceedings of the 1st international workshop on Data mining and audience intelligence for advertising (ADKDD'07), pages 34-39, ISSN 978-1-59593-833-6, 2007. [DOI]
 
Speaker Adaptation of Language Models for Automatic Dialog Act Segmentation of Meetings, J. Kolar, Yang Liu and Elizabeth Shriberg, in: Interspeech, 2007.
 
Speech Enhancement and Recognition in Meetings with an Audio-Visual Sensor Array, Hari Krishna Maganti, Daniel Gatica-Perez and I. McCowan, in: IEEE Trans. on Audio, Speech, and Language Processing, volume 15, number 11, pages 2257-2269, 2007.
 
Speeding Up Speaker Diarization by Using Prosodic Features, Yan Huang, Gerald Friedland, C. Müller and Nikki Mirghafori, number TR-07-004, 2007.
 
Spoken Term Detection System Based on a Combination of LVCSR and Phonetic Search, Szoke Igor, Fapso Michal, Karafiát Martin, Burget Lukas, Frantisek Grezl, Schwarz Petr, Glembek Ondrej, Matejka Pavel, Kopecky Jiria and Jan Cernocky, in: Machine Learning and Multimodal Interaction, 28.-30.6.2007, pages 1, Brno, CZ, 2007.
 
STBU system for the NIST 2006 speaker recognition evaluation, Pavel Matejka, Lukas Burget, Petr Schwarz, Ondřej Glembek, Martin Karafiat, Frantisek Grezl, Jan Cernocky, David van Leeuwen, Niko Brümmer and Albeert Strasheim, in: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007), pages 221-224, IEEE Signal Processing Society, Honolulu, US, 2007.
 
Term-Weighting for Summarization of Multi-Party Spoken Dialogues, Gabriel Murray and Steve Renals, in: Proc. of MLMI 2007, Brno, Czech Republic, 2007.
 
The AMI speaker diarization system for NIST RT06s Meeting Data, David van Leeuwen and Marijn Huijbregts, in: NIST Rich Transcription 2006 Spring Meeting Recognition Evaluation, RT06s, pages 371-385, Springer Verlag, 2007.
 
The AMI System for the Transcription of Speech in Meetings, Thomas Hain, Lukas Burget, John Dines, Giulia Garau, Martin Karafiat, Mike Lincoln, Jithendra Vepa and Vincent Wan, in: Proc. ICASSP, 2007.
 
The AMI system for the Transcription of Speech in Meetings, Thomas Hain, Lukas Burget, John Dines, Giulia Garau, Martin Karafiat, Mike Lincoln, Jithendra Vepa and Vincent Wan, in: Proceedings of the 32nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), 2007.
 
The Blame Game: Performance Analysis of Speaker Diarization System Components, Marijn Huijbregts and Chuck Wooters, in: Proceedings of Interspeech 2007, pages 4, International Speech Communication Association, 2007.
 
The ICSI RT07s Speaker Diarization System., Chuck Wooters and Marijn Huijbregts, in: CLEAR, pages 509-519, Springer, Baltimore, 2007. [DOI]
 
The ICSI-SRI Spring 2006 Meeting Recognition System, Adam Janin, Andreas Stolcke, Xavier Anguera, Kofi Boakye, Özgür Çetin, Joe Frankel and Jing Zheng, in: Machine Learning for Multimodal Interaction: Third International Workshop (MLMI 2006), pages 444-456, Springer, 2007.
 
The listening room: a speech-based interactive art installation, Alexa Wright, Alun Evans, Alf Linney and Mike Lincoln, in: MULTIMEDIA '07: Proceedings of the 15th International Conference on Multimedia, pages 681-690, ACM, Augsburg, Germany, 2007. [DOI]
 
The Project Browser: Supporting Information Access for a Project team, A. H. M. Cremers, P Groenewegen, I. Kuiper and Wilfried Post, in: HCII 2007, Beijing, China, 2007.
 
The Role of Artefacts in Presence Mediation, Dhaval Vyas, in: 1st Peach Summer School Program, pages 32, PEACH Research Consortium, 2007.
 
To separate speech! a system for recognizing simultaneous speech, John McDonough, Kenichi Kumatani, Tobias Gehrig, Emilian Stoimenov, UweMayer, Stefan Schacht, Matthias Wolfel and Dietrich Klakow, in: Proc. 4th Joint Workshop on Machine Learning and Multimodal Interaction, 2007.
 
To Whom It May Concern - Addressee Identification in Face-to-Face Meetings, N. Jovanovic, University of Twente, 2007.
 
Toward Automatic Decision Detection: Empirical, Statistical and Machine Learning Approach, Pei-Yun Hsueh, in: MMKM Workshop (Multimedia Knowledge management): Industry meets academia, 2007.
 
Towards Automated Observational Analysis of Leadership in Clinical Networks, I. McCowan and H. Harden, in: Third International Conference Information Technology in Health Care (ITHC2007): Socio-technical approaches, pages 133-141, IOS Press, 2007.
 
Towards capturing fine phonetic variation in speech using articulatory features, Odette Scharenborg, Vincent Wan and Roger Moore, in: Speech Communication, volume 49, pages 811-826, 2007.
 
Towards online speech summarization, Gabriel Murray and Steve Renals, in: Proc. Interspeech '07, 2007.
 
Unleashing the killer corpus: experiences in creating the multi-everything AMI Meeting Corpus, Jean Carletta, in: Language Resources and Evaluation, volume 41, number 2, pages 181-190, 2007.
 
Unsupervised Speech/Non-speech Detection for Automatic Speech Recognition in Meeting Rooms, Hari Krishna Maganti, Petr Motlicek and Daniel Gatica-Perez, in: Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), Honolulu, 2007.
 
Using audio and video features to classify the most dominant person in a group meeting, Hayley Hung, Dinesh Jayagopi, Chuohao Yeo, Gerald Friedland, Sileye O. Ba, J-M. Odobez, Kannan Ramchandran, Nikki Mirghafori and Daniel Gatica-Perez, in: ACM Multimedia, pages 835-838, 2007.
 
Using audio and video features to classify the most dominant person in meetings, Hayley Hung, Dinesh Jayagopi, Chuohao Yeo, Gerald Friedland, Sileye O. Ba, J-M. Odobez, Kannan Ramchandran, Nikki Mirghafori and Daniel Gatica-Perez, in: Proceedings of ACM Multimedia, pages 835-838, 2007.
 
Verbal behavior of the more and the less influential meeting participant, Rutger Rienks, Anton Nijholt and Dirk Heylen, ACM, ISBN 978-1-59593-870-115, 2007.
 
Video Summarization at Brno University of Technology, Vítezslav Beran, Adam Herout, Michal Hradis, Petr Chmelar, Igor Potucek, Stanislav Sumec and Pavel Zemcik, in: ACM Multimedia, pages 16-19, Association for Computing Machinery, Augsburg, Bavaria, DE, 2007.
 
Virtual Meeting Rooms: From Observation to Simulation, Dennis Reidsma, H. J. A. op den Akker, Rutger Rienks, Ronald Poppe, Anton Nijholt, Dirk Heylen and Job Zwiers, in: AI and Society, The Journal of Human-Centred Systems, volume 22, pages 133-144, 2007.
 
Vision-Based Human Motion Analysis: An Overview, Ronald Poppe, in: Computer Vision and Image Understanding, volume 108(1-2), pages 4-18, ISSN 1077-3142, 2007.
 
What Decisions Have You Made: Automatic Decision Detection in Conversational Speech, Pei-Yun Hsueh and Johanna D. Moore, in: Proceedings of NACCL/HLT 2007, 2007.
 

2006

A corpus for studying addressing behavior in multi-party dialogues, N. Jovanovic, Rieks op den Akker and Anton Nijholt, in: Language Resources and Evaluation, volume 40, number 1, pages 5-23, 2006.
 
A Corpus-based Approach to the Classification and Correction of Disfluencies in Spontaneous Speech, Jana Besser, .
 
A Study on Visual Focus of Attention Recognition from Head Pose in a Meeting Room, Sileye O. Ba and J-M. Odobez, in: 3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms, 2006.
 
Active Transaction Approach for Collaborative Virtual Environments, Jan Peciva, in: ACM International Conference on Virtual Reality Continuum and its Applications (VRCIA), pages 171-178, Association for Computing Machinery, Chinese University of Hong Kong, HK, 2006.
 
Addressee Identification in Face-to-Face Meetings, N. Jovanovic, Rieks op den Akker and Anton Nijholt, in: Proceedings EACL 2006, 11th Conference of the European Chapter of the Association for Computational Linguistics, pages 169-176, ACL, 2006.
 
An Acoustic Model Based on Kullback-Leibler Divergence for Posterior Features, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, in: ICASSP, 2006.
 
Analyzing Human Interaction in Conversations: a Review, Daniel Gatica-Perez, in: Proc. IEEE Int. Conf. on Multisensor Fusion and Integration for Intelligent Systems (MFI), Special Session on Multisensor Fusion for Human-Activity Analysis, 2006.
 
Annotating Emotions in Meetings, Dennis Reidsma, Dirk Heylen and R.J.F Ordelman, in: Proc. of the fifth international conference on Language Resources and Evaluation, LREC 2006, pages 1117-1122, ELRA, 2006.
 
Annotating State of Mind in Meeting Data, Dirk Heylen, Dennis Reidsma and R.J.F Ordelman, in: Proc. of the LREC2006 Workshop on Corpora for Research on Emotion and Affect, pages 84-87, 2006.
 
Automatic Language Identification System, Jan Cernocky, Pavel Matejka, Lukas Burget and Petr Schwarz, in: Proceedings of scientific seminar: "Nove technologie v radiokomunikacich", pages 1-6, Brno University of Defence, 2006.
 
Automatic Segmentation of Multiparty Dialogue, Pei-Yun Hsueh, Johanna D. Moore and Steve Renals, in: the Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics, 2006.
 
Automatic Topic Segmentation and Labeling in Multiparty Dialogue, Pei-Yun Hsueh and Johanna D. Moore, in: Proceedings of the first IEEE/ACM workshop on Spoken Language Technology (SLT), 2006.
 
Automatic Topic Segmentation and Lablelling in Multiparty Dialogue, Pei-Yun Hsueh and Johanna D. Moore, in: the first IEEE/ACM workshop on Spoken Language Technology (SLT) 2006, 2006.
 
Brno University of Technology System for NIST 2005 Language Recognition Evaluation, Pavel Matejka, Lukas Burget, Petr Schwarz and Jan Cernocky, in: Proceedings of Odyssey 2006: The Speaker and Language Recognition Workshop, pages 57-64, 2006.
 
Challenges for Virtual Humans in Human Computing, Dennis Reidsma, Z. Ruttkay and Anton Nijholt, Lecture Notes in Artificial Intelligence 4451, pages 316-338, Springer Verlag, ISBN 978-3-540-72346-2, 2006.
 
Comparison of Silhouette Shape Descriptors for Example-based Human Pose Recovery, Ronald Poppe and Mannes Poel, in: Proceedings of the IEEE Conference on Automatic Face and Gesture Recognition (FG 2006), pages 541-546, IEEE Computer Society, 2006.
 
Detection and Application of Influence Rankings in Small Group Meetings, Rutger Rienks, Daniel Gatica-Perez and Wilfried Post, in: Proceedings of the International Conference on Multimodal Interfaces (ICMI), pages 257-264, ACM, 2006.
 
Determining what people feel and think when interacting with humans and machines, Dirk Heylen, Anton Nijholt and Dennis Reidsma, in: Recent Advances in Engineering Mechanics, pages 1-6, California State University, Fullerton, Orange County, LA, USA, 2006.
 
Dialogue Act Compression Via Pitch Contour Preservation, Gabriel Murray and Steve Renals, in: Proceedings of Interspeech 2006, 2006.
 
Dialogue-act tagging using smart feature selection: results on multiple corpora, A. T. Verbree, Rutger Rienks and Dirk Heylen, in: The first International IEEE Workshop on Spoken Language Technology (SLT), 2006.
 
Dimensionality Reduction Aids Term Co-Occurrence Based Multi-Document Summarization, B. Hachey, Gabriel Murray and D. Reitter, in: Proceedings of COLING-ACL Summarization Workshop, 2006, 2006.
 
Discriminative Training Techniques for Acoustic Language Identification, Lukas Burget, Pavel Matejka and Jan Cernocky, in: Proceedings of ICASSP 2006, pages 209-212, 2006.
 
Embedding Motion in Model-Based Stochastic Tracking, J-M. Odobez, Daniel Gatica-Perez and Sileye O. Ba, in: IEEE Transaction on Image Processing, volume 15, number 11, 2006.
 
Facial Displays, Emotional Expressions and Conversational Acts, Dirk Heylen, in: Cybernetics and Systems 2006, pages 625-630, Medical University of Vienna and Austrian Society for Cybernetic Studies, 2006.
 
First Steps Towards the Automatic Construction of Argument-Diagrams from Real Discussions, A. T. Verbree, Rutger Rienks and Dirk Heylen, in: 1st International Conference on Computational Models of Argument, pages 183-194, IOS press ISBN=1-58603-652-1, 2006.
 
Hierarchical structures of neural networks for phoneme recognition, Petr Schwarz, Pavel Matejka and Jan Cernocky, in: Proceedings of ICASSP 2006, pages 325-328, 2006.
 
Human Computing and Machine Understanding of Human Behavior: A Survey, Maja Pantic, Alex Pentland, Anton Nijholt and Thomas Huang, in: CM SIGCHI Proceedings Eighth International Conference on Multimodal Interfaces (ACM ICMI 2006), pages 239-248, ACM, New York, ACM, 2006.
 
Human Computing, Virtual Humans and Artificial Imperfection, Z. Ruttkay, Dennis Reidsma and Anton Nijholt, in: ACM SIGCHI Proceedings Eighth International Conference on Multimodal Interfaces (ACM ICMI 2006), pages 179-184, ACM, 2006.
 
Incorporating Speaker and Discourse Features into Speech Summarization, Gabriel Murray, Steve Renals, Jean Carletta and Johanna D. Moore, in: Proceedings of the Human Language Technology conference / North American chapter of the Association for Computational Linguistics annual meeting, New York City, USA, 2006.
 
Indexing and search methods for spoken documents, Lukas Burget, Jan Cernocky, Michal Fapso, Martin Karafiat, Pavel Matejka, Petr Schwarz, Pavel Smrz and Igor Szoke, in: Proceedings of the Ninth International Conference on Text, Speech and Dialogue, TSD 2006, pages 351-358, Springer Verlag, 2006.
 
Investigating Mind Markers in Design Meetings, Dirk Heylen, in: Group Decision and Negotiation 2006, pages 132-134, Universitatsverlag Karlsruhe, 2006.
 
Juicer: A Weighted Finite State Transducer speech decoder, Darren Moore, John Dines, M. Magimai Doss, Jithendra Vepa, O. Cheng and Thomas Hain, in: Proceedings of MLMI, 2006.
 
Keyword Spotting in Meeting Data, Igor Szoke, in: Proceedings of the 12th Conference Student EEICT 2006 Volume 4, pages 440-444, Faculty of Electrical Engineering and Communication BUT, 2006.
 
Measuring the quality of multi-document cluster headlines, F. van Kesteren and Wessel Kraaij, in: Proceedings of the IIIA 2006 workshop, 2006.
 
Meetings and Meeting Modeling in Smart Environments, Anton Nijholt, Dirk Heylen and Rieks op den Akker, in: AI & Society. The Journal of Human-Centred Systems, pages 202-220, Springer-Verlag, 2006.
 
Meetings and Meeting Support in Ambient Intelligence, Rutger Rienks, Anton Nijholt and Dennis Reidsma, Mobile communication series, pages 359-378, chapter 17, Artech House, ISBN 1-58053-963-7,, 2006.
 
Multi-stream ASR: An Oracle Perspective, Hemant Misra, Jithendra Vepa and Hervé Bourlard, in: Proceedings of Interspeech (ICSLP), 2006.
 
Multistream Recognition of Dialogue Acts in Meetings, Alfred Dielmann and Steve Renals, in: 3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms, volume LNCS 4299, pages 178-189, 2006.
 
Online and Off-line Visualization of Meeting Information and Meeting Support,, Anton Nijholt, Rutger Rienks, Job Zwiers and Dennis Reidsma, in: The Visual Computer, volume 22, number 12, pages 965-976, ISSN 0178-2789, 2006. [DOI]
 
Parallel training of neural networks for speech recognition, Stanislav Kontar, in: Proc. 12th International Conference on Soft Computing MENDEL'06, pages 6, 2006.
 
Pro-active Meeting Assistants: Attention Please, Rutger Rienks, Anton Nijholt and Paulo Barthelmess, in: Proceedings of the 5th workshop on Social Intelligence Design, pages 213-228, Osaka University, 2006.
 
Prosodic Correlates of Rhetorical Relations, Gabriel Murray, M. Taboada and Steve Renals, in: Proceedings of ACTS Workshop, HLT/NAACL 2006, 2006.
 
Real-time Visual Processing Using Views, Pavel Zemcik, Adam Herout, Vítezslav Beran, Stanislav Sumec and Igor Potucek, in: 3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms, 2006.
 
Recurrent timing neural networks for joint F0-localisation estimation, S. N. Wrigley and G. J. Brown, in: Proceedings of the Advances in Models for Acoustic Processing NIPS 2006 workshop, 2006.
 
Robust heteroscedastic linear discriminant analysis and LCRC posterior features in large vocabulary continuous speech recognition, Martin Karafiat, Frantisek Grezl, Petr Schwarz, Lukas Burget and Jan Cernocky, in: Proc. Fifth Slovenian and First International Language Technologies Conference, pages 1-4, 2006.
 
Robust heteroscedastic linear discriminant analysis and LCRC posterior features in meeting data recognition, Martin Karafiat, Frantisek Grezl, Petr Schwarz, Lukas Burget and Jan Cernocky, in: Proc. 3nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006), pages 10, 2006.
 
Search Engine for Information Retrieval from Speech Records, Michal Fapso, Petr Schwarz, Igor Szoke, Pavel Smrz, Milan Schwarz, Jan Cernocky, Martin Karafiat and Lukas Burget, in: Proceedings of the Third International Seminar on Computer Treatment of Slavic and East European Languages, pages 100-101, 2006.
 
Speaker localization for microphone array-based ASR: the effects of accuracy on overlapping speech, Hari Krishna Maganti and Daniel Gatica-Perez, in: ICMI '06: Proceedings of the 8th International Conference on Multimodal Interfaces, pages 35-38, ACM, Banff, Alberta, Canada, 2006. [DOI]
 
Speaker Localization for Microphone-Array-Based ASR: the Effects of Accuracy on Overlapping Speech, Hari Krishna Maganti and Daniel Gatica-Perez, in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), 2006.
 
Syntactic chunking across different corpora, Weiqun Xu, Jean Carletta and Johanna D. Moore, in: Proceedings of MLMI'06, 2006.
 
Task based evaluation of exploratory search systems, Wessel Kraaij and Wilfried Post, in: SIGIR 2006 workshop, Evaluating Exploratory Search Systems, 2006.
 
The AMI meeting transcription system: Progress and performance, Thomas Hain, Lukas Burget, John Dines, Giulia Garau, Martin Karafiat, Mike Lincoln, Jithendra Vepa and Vincent Wan, in: Proceedings of NIST RT06 Spring workshop, 2006.
 
The NITE XML Toolkit: data model and query language, Jean Carletta, S. Evert, U. Heid and J. Kilgour, in: Language Resources and Evaluation Journal, 2006.
 
The segmentation of multi-channel meeting recordings for automatic speech recognition, John Dines, Jithendra Vepa and Thomas Hain, in: Proceedings of Interspeech (ICSLP), 2006.
 
Towards On- and Off-line Search, Browse and Replay of Home Activities, Anton Nijholt, in: Proceedings 3rd IFIP Conference on Artificial Intelligence Applications & Innovations (AIAI), pages 385-392, 2006.
 
Towards the Automatic Generation of Virtual Presenter Agents, Anton Nijholt, in: Informing Science: International Journal of an Emerging Discipline, volume 9, number ISSN 1547-9684 (print), pages 97-110, 2006.
 
Towards the Automatic Generation of Virtual Presenter Agents, Anton Nijholt, in: Proceedings InSITE 2006, Informing Science Conference, 2006.
 
Tracking the Multi-Person Wandering Visual Focus of Attention, Kevin Smith, Sileye O. Ba, Daniel Gatica-Perez and J-M. Odobez, in: International Conference on Multi-Modal Interfaces ICMI 06, 2006.
 
Use of anti-models to furher improve state-of-the-art PRLM language recognition system, Pavel Matejka, Petr Schwarz, Lukas Burget and Jan Cernocky, in: Proceedings of ICASSP 2006, pages 197-200, 2006.
 
Using Audio, Visual, and Lexical Features in a Multi-Modal Virtual Meeting Director, Marc Al-Hames, Benedikt Hörnler, Christoph Scheuermann and Gerhard Rigoll, in: MLMI 2006, 3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms, 2006.
 
Using Posterior-Based Features in Template Matching for Speech Recognition, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, in: Proceedings of ICSLP, 2006.
 
VISA - Corpus Annotation with OWL, Stephanie Becker, Thomas Kleinbauer and Stephan Lesch, in: Proceedings of 10th Workshop on the Semantics and Pragmatics of Dialogue, online, 2006.
 

2005

A Multi-modal Graphical Model for Robust Recognition of Group Actions in Meetings from Disturbed Videos, Marc Al-Hames and Gerhard Rigoll, in: Proc. IEEE International Conference on Image Processing (ICIP), 2005.
 
A Multi-Modal Mixed-State Dynamic Bayesian Network for Robust Meeting Event Recognition from Disturbed Data, Marc Al-Hames and Gerhard Rigoll, in: Proc. 6th International Conference on Multimedia and Expo, IEEE ICME, 2005.
 
A New Metric for the Evaluation of Dialog Act Classification, Stephan Lesch; Thomas Kleinbauer; Jan Alexandersson, in: In Proceedings of the 9th Workshop on the Semantics and Pragmatics of Dialogue (DIALOR), 2005.
 
A Rao-Blackwellized Mixed State particle Filter For Head Pose Tracking In Meetings, Sileye O. Ba and J-M. Odobez, in: Proceedings of 7th ACM-ICMI Workshop on Multimodal Multiparty Meeting Processing, pages 9-16, 2005.
 
AMI: Augmented Multiparty Interaction, Steve Renals, in: Proc. NIST Meeting Transcription Workshop, 2005.
 
AmiGram - a General-Purpose Tool for Multimodal Corpus Annotation, Christoph Lauer, Jochen Frey, Benjamin Lang, Jan Alexandersson, Tilman Becker, Thomas Kleinbauer and Harald Lochert, in: Proceedings of the 2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms, Springer, 2005.
 
AmiGram - a General-Purpose Tool for Multimodal Corpus Annotation, Christoph Lauer, Jochen Frey, Benjamin Lang, Jan Alexandersson, Tilman Becker, Thomas Kleinbauer and Harald Lochert, in: Proceedings of the 2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms, Springer, 2005.
 
Analysing Meeting Records: An Ethnographic Study and Technological Implications, Steve Whittaker, Rachel Laban and Simon Tucker, in: 2nd Joint Workshop on Multimodal Interaction and Machine Learning Algorithms, 2005.
 
Applying Vocal Tract Length Normalization to Meeting Recordings, Giulia Garau, Steve Renals and Thomas Hain, in: Proc. Interspeech, 2005.
 
Argument Diagramming of Meeting Conversations, Rutger Rienks and Dirk Heylen, in: Multimodal Multiparty Meeting Processing, Workshop at the 7th International Conference on Multimodal Interfaces (to appear), 2005.
 
Automatic Dialog Act Segmentation and Classification in Multiparty Meetings, Jeremy Ang, Yang Liu and Elizabeth Shriberg, in: to appear in ICASSP'05, 2005.
 
Automatic Dominance Detection in Meetings using easily detectable features, Rutger Rienks and Dirk Heylen, in: 2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms, Springer Verlag, 2005.
 
Designing Focused and Efficient Annotation Tools, Dennis Reidsma, D. H. W. Hofs and N. Jovanovic, in: Symp. on Annotating and measuring meeting behavior, MB2005, pages 149-152, 2005.
 
Detecting Group Interest-level in Meetings, Daniel Gatica-Perez, I. McCowan, Dong Zhang and Samy Bengio, in: in Proceedings IeeeeInt, Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2005.
 
Developing and Enhancing Posterior Based Speech Recognition Systems, Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Hervé Bourlard, in: Proceedings of Interspeech, 2005.
 
Evaluating Automatic Summaries of Meeting Recordings, Gabriel Murray, Steve Renals, Jean Carletta and Johanna D. Moore, in: Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics, Ann Arbor, MI, USA, 2005.
 
Evaluating Multi-Object Tracking, Kevin Smith, Daniel Gatica-Perez, J-M. Odobez and Sileye O. Ba, in: Proc. IEEE Conf. on Computer Vision and Pattern Recognition, Workshop on Empirical Evaluation Methods in Computer Vision (CVPR-EEMCV), 2005.
 
Extracting Information from Multimedia Meeting Collections, Daniel Gatica-Perez, Dong Zhang and Samy Bengio, in: Proc. ACM Int. Conf. on Multimedia, Workshop on Multimedia Information Retrieval (ACM MM MIR), invited paper, 2005.
 
Extractive Summarization of Meeting Recordings, Gabriel Murray, Steve Renals and Jean Carletta, in: Proceedings of the 9th European Conference on Speech Communication and Technology, Lisbon, Portugal, 2005.
 
Hierarchical Multi-Stream Posterior Based Speech Recognition System, Hamed Ketabdar, Samy Bengio and Hervé Bourlard, in: Proceedings of MLMI, 2005.
 
Hierarchical topic detection in large digital news archives: Exploring a sample based approach, Dolf Trieschnigg and Wessel Kraaij, in: Journal of Digital Information Management, volume 3, number 1, 2005.
 
Improving Speech Recognition Using a Data-Driven Approach, Guillermo Aradilla, Jithendra Vepa and Hervé Bourlard, in: Proceedings of Interspeech, 2005.
 
Introducing an Embodied Virtual Presenter Agent in a Virtual Meeting Room, Anton Nijholt, H. van Welbergen and Job Zwiers, in: Proceedings IASTED International Conference on Artificial Intelligence and Applications (AIA 2005), 2005.
 
Learning influence among interacting Markov chains, Dong Zhang, Daniel Gatica-Perez, Samy Bengio and Deb Roy, in: NIPS, 2005.
 
Meeting Modelling in the Context of Multimodal Research, Dennis Reidsma, Rutger Rienks and N. Jovanovic, in: Machine Learning for Multimodal Interaction, First International Workshop 2004, Revised Selected Papers, pages 22-35, Springer Verlag, 2005.
 
Meetings in the Virtuality Continuum: Send Your Avatar., Anton Nijholt, in: International Conference on CYBERWORLDS, pages 75-82, IEEE Computer Society Press, 2005.
 
Modeling Individual and Group Actions in Meetings with Layered HMMs, Dong Zhang, Daniel Gatica-Perez, Samy Bengio and I. McCowan, in: IEEE Transactions on Multimedia, 2005.
 
Multimodal Integration for Meeting Group Action Segmentation and Recognition, Marc Al-Hames, Alfred Dielmann, Daniel Gatica-Perez, Stephan Reiter, Steve Renals, Gerhard Rigoll and Dong Zhang, in: 2nd international Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms, volume LNCS 3869, pages 52-63, 2005.
 
Multimodal Meeting Analysis by Segmentation and Classification of Meeting Events based on a Higher Level Semantic Approach, Stephan Reiter and Gerhard Rigoll, in: in Proceedings of the 30th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2005.
 
Multimodal Multispeaker Probabilistic Tracking in Meetings, Daniel Gatica-Perez, Guillaume Lathoud, J-M. Odobez and I. McCowan, in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), 2005.
 
Multi-party Interaction in a Virtual Meeting Room, Rutger Rienks, Ronald Poppe, Anton Nijholt, Dirk Heylen and N. Jovanovic, in: Proceedings of Measuring Behavior 2005, the 5th International Conference on Methods and Techniques in Behavioral Research (To appear), 2005.
 
Novalist: Content Reduction for Cross-media Browsing, Franciska de Jong and Wessel Kraaij, in: Proceedings of the RANLP workshop Crossing barriers in Text Summarization Research, 2005.
 
Novel techniques for time-compressing speech: An exploratory study, Simon Tucker and Steve Whittaker, in: IEEE International Conference n Acoustics, Speech and SignalProcessing (ICASSP), 2005.
 
Omnipresent Collaborative Virtual Environments for Open Inventor Applications, Peciva Jan, in: Lecture Notes in Computer Science, pages 272-276, Springer, 2005.
 
Phoneme based acoustics keyword spotting in informal continuous speech, Igor Szoke, Petr Schwarz, Lukas Burget, Martin Karafiat and Jan Cernocky, in: In: Proceedings of the international conference Radioelektronika 2005, pages 195-198, FEEC BUT, FEEC BUT, 2005.
 
Polynomial Dynamic Time Warping Kernel Support Vector Machines for Speech Recognition with Sparse Training Data, Vincent Wan and J. Carmichael, in: In Proc. Interspeech, pages 3321-3324, 2005.
 
Semi-supervised Adapted HMMs for Unusual Event Detection, Dong Zhang, Daniel Gatica-Perez, Samy Bengio and I. McCowan, in: Pro. IEEE CVPR, 2005.
 
Semi-supervised Meeting Event Recognition with Adapted HMMs, Dong Zhang, Daniel Gatica-Perez and Samy Bengio, in: IEEE ICME, 2005.
 
Speaker Prediction based on Head Orientations, Rutger Rienks, Ronald Poppe and Mannes Poel, in: Proceedings of the Fourteenth Annual Machine Learning Conference of Belgium and the Netherlands (Benelearn 2005), pages 73-79, 2005.
 
Speech Acquisition in Meetings with an Audio-visual Sensor Array, I. McCowan, Hari Krishna Maganti, Daniel Gatica-Perez, Darren Moore and Sileye O. Ba, in: Proceedings of the IEEE ICME,2005, 2005.
 
The AMI Meeting Corpus: A Pre-Announcement, Jean Carletta, Simone Ashby, Sebastien Bourban, Mike Flynn, Mael Guillemot, Thomas Hain, Jaroslav Kadlec, Vasilis Karaiskos, Wessel Kraaij, Melissa Kronenthal, Guillaume Lathoud, Mike Lincoln, Agnes Lisowska, I. McCowan, Wilfried Post, Dennis Reidsma and Pierre Wellner, in: MLMI'05: Proceedings of the Workshop on Machine Learning for Multimodal Interaction, pages 28-39, Springer-Verlag, 2005.
 
The AMI Meetings Corpus, Jean Carletta, Simone Ashby, Sebastien Bourban, Mike Flynn, Mael Guillemot, Thomas Hain, Jaroslav Kadlec, Vasilis Karaiskos, Wessel Kraaij, Melissa Kronenthal, Guillaume Lathoud, Mike Lincoln, Agnes Lisowska, I. McCowan, Wilfried Post, Dennis Reidsma and Pierre Wellner, in: Proceedings of the Measuring Behavior 2005 symposium on "Annotating and measuring Meeting Behavior", 2005.
 
The Distributed Virtual Meeting Room Exercise., Anton Nijholt, Job Zwiers and Jan Peciva, in: ICMI 2005 Workshop on Multimodal multiparty meeting processing, pages 93-99, 2005.
 
The Embra System at DUC 2005: Query-oriented Multi-document Summarization with a Very Large Latent Semantic Space, B. Hachey, Gabriel Murray and D. Reitter, in: Proceedings of the Document Understanding Conference (DUC) 2005, Vancouver, BC, Canada, 2005.
 
The Multi-channel Wall Street Journal Audio Visual Corpus (MC-WSJ-AV): Specification and Initial Experiments, Mike Lincoln, in: Proc. ASRU05, 2005.
 
Towards a Decent Recognition Rate for the Automatic Classification of a Multidimensional Dialogue Act Tagset, Stephan Lesch, Thomas Kleinbauer and Jan Alexandersson, in: In Workshop notes of the Fourth IJCAI Workshop on Knowledge and Reasoning in Practical Dialogue Systems. Edinburgh, pages 46-53, 2005.
 
Towards real-time pose estimation for presenters in meeting environments, Ronald Poppe, Dirk Heylen, Anton Nijholt and Mannes Poel, in: in Proceedings of the 13-th International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision'2005, (WSCG2005),, pages 41-44, 2005.
 
Towards Simulating Humans in Augmented Multi-party Interaction., Anton Nijholt, in: Computer Simulation in Information and Communication Engineering (CSICE'05), pages 47-51, 2005.
 
Tracking People in Meetings with Particles, Daniel Gatica-Perez, J-M. Odobez, Sileye O. Ba, Kevin Smith and Guillaume Lathoud, in: The International Workshop on Image Analysis for Multimedia Interactive Services WIAMIS, 2005.
 
Using Particles to Track Varying Numbers of Objects, Kevin Smith, Daniel Gatica-Perez and J-M. Odobez, in: Computer Vision and Pattern Recognition CVPR, 2005.
 
Virtual Meeting Rooms: From Observation to Simulation, Dennis Reidsma, H. J. A. op den Akker, Rutger Rienks, Ronald Poppe, Anton Nijholt, Dirk Heylen and Job Zwiers, in: Proceedings Social Intelligence Design 2005, 2005.
 

2004

A probabilistic framework for joint head tracking and pose estimation, Sileye O. Ba and J-M. Odobez, in: 17th Int. Conf. Pattern Recognition (ICPR), 2004.
 
A research environment for meeting behavior, Wilfried Post, A. H. M. Cremers and Olivier Blanson Henkemans, in: Workshop on Social Intelligence Design (SID), 2004.
 
Accessing Multimodal Meeting Data: Systems, Problems and Possibilities, Simon Tucker and Steve Whittaker, in: Proc. MLMI'04, Springer-Verlag LNCS, 2004.
 
Animated Particle Rendering in DSP and FPGA, Herout Adam and Zemcík Pavel, in: SCCG 2004 Proceedings, pages 237-242, 2004.
 
Augmented Multi-User Communication System, Vítezslav Beran, in: In: Proceedings of the working conference on Advanced visual interfaces, pages 257-260, 2004.
 
AV16.3: an audio-visual corpus for speaker localization and tracking, Guillaume Lathoud, J-M. Odobez and Daniel Gatica-Perez, in: Proc.MLMI Workshop, 2004.
 
Boosting Pixel-based Classifiers for Face Verification, Sebastien Marcel and Yann Rodriguez, in: Biometric Authentication Workshop of the 8th European Conference on Computer Vision, BIOAW2004, 2004.
 
Clustering And Segmenting Speakers And Their Locations In Meetings, Jitendra Ajmera, Guillaume Lathoud and I. McCowan, in: Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04), 2004.
 
Combinations of TRAP-based systems, Frantisek Grezl, in: In: Proc. Seventh International conference on Text, Speech and Dialogue, pages 323-330, Fakulta informatiky MUNI, 2004.
 
Disclosure of non-scripted video content: InDiCo and M4/AMI, Franciska de Jong, in: In: Proceedings of CIVR2004. Lecture Notes in Computer Science,, pages 647-655, 2004.
 
Effect of Recognition Errors on Information Retrieval Performance, Alessandro Vinciarelli, in: Proceedings of International Workshop on Frontiers in Handwriting Recognition (IWFHR), 2004.
 
Embedding motion in model-based stochastic tracking, J-M. Odobez and Daniel Gatica-Perez, in: 17th Int. Conf. Pattern Recognition (ICPR), 2004.
 
Estimating the Quality of Face Localization for Face Verification, Yann Rodriguez, Fabien Cardinaux, Samy Bengio and Johnny Marithoz, in: IEEE International Conference on Image Processing, ICIP, 2004.
 
Face Verification Using Adapted Generative Models, Fabien Cardinaux, Conrad Sanderson and Samy Bengio, in: The 6th International Conference on Automatic Face and Gesture Recognition, FG2004, 2004.
 
From Switchboard to Meetings: Development of the 2004 ICSI-SRI-UW Meeting Recognition System, Nikki Mirghafori, Andreas Stolcke, Chuck Wooters, T. Pirinen, I. Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin and M. Ostendorf, in: Proc. ICSLP'04, 2004.
 
Meetings, Gatherings and Events in Smart Environments, Anton Nijholt, in: In: Proceedings VRCAI 2004: ACM SIGGRAPH International Conference on Virtual-Reality Continuum and its Applications in Industry, pages 229-232, ACM, 2004.
 
Modeling Individual and Group Actions in Meetings: a Two-Layer HMM Framework, Dong Zhang, Daniel Gatica-Perez, Samy Bengio, I. McCowan and Guillaume Lathoud, in: the Second IEEE Workshop on Event Mining: Detection and Recognition of Events in Video, In Association with CVPR, 2004.
 
Multimodal Group Action Clustering in Meetings, Dong Zhang, Daniel Gatica-Perez, Samy Bengio, I. McCowan and Guillaume Lathoud, in: ACM 2nd International Workshop on Video Surveillance & Sensor Networks in conjunction with 12th ACM International Conference on Multimedia, 2004.
 
Multimodal Speech Processing Using Asynchronous Hidden Markov Models, Samy Bengio, in: Information Fusion, volume 5, 2004.
 
Noise-Robust Multi-Stream Fusion for Text-Independent Speaker Authentication, Norman Poh and Samy Bengio, in: The Speaker and Recognition Workshop, 2004.
 
Noisy Text Categorization, Alessandro Vinciarelli, in: Proceedings of International Conference on Pattern Recognition (ICPR), 2004.
 
Offline Recognition of Unconstrained Handwritten Texts Using HMMs and Statistical Language Models, Alessandro Vinciarelli, Samy Bengio and H. Bunke, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 26, pages 709-720, 2004.
 
Order Matters: A Distributed Sampling Method for Multi-Object Tracking, Kevin Smith and Daniel Gatica-Perez, in: Proc. British Machine Vision Conference BMVC, 2004.
 
Progress in Meeting Recognition: The ICSI-SRI-UW Spring 2004, Andreas Stolcke, Chuck Wooters, Nikki Mirghafori, T. Pirinen, I. Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin and M. Ostendorf, in: Proc. NIST 2004 Meeting Recognition Workshop, 2004.
 
Segmentation and Classification of Meeting Events using Multiple Classifier Fusion and Dynamic Programming, Stephan Reiter and Gerhard Rigoll, in: in IEEE Proceedings of the International Conference on Pattern Recognition (ICPR), 2004.
 
Smart exposition rooms: the ambient intelligence view, Anton Nijholt, in: In: Proceedings Electronic Imaging & the Visual Arts (EVA 2004), pages 100-105, Pitagora Editrice Bologna, 2004.
 
The 2004 ICSI-SRI-UW Meeting Recognition System, Chuck Wooters, Nikki Mirghafori, Andreas Stolcke, T. Pirinen, I. Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin and M. Ostendorf, in: Proc. MLMI'04, 2004.
 
The ICSI Meeting Project: Resources and Research, Adam Janin, Jeremy Ang, S. Bhagat, R. Dhillon, J. Edwards, J. Macias-Guarasa, N. Morgan, B. Peskin, Elizabeth Shriberg, Andreas Stolcke, Chuck Wooters and B. Wrede, in: NIST 2004 Meeting Recognition Workshop, 2004.
 
The ICSI Meeting Recorder Dialog Act MRDA Corpus, Elizabeth Shriberg, R. Dhillon, S. Bhagat, Jeremy Ang and H. Carvey, in: Proc. HLT-NAACL SIGDIAL Workshop, Boston, April-May, 2004.
 
The NITE XML Toolkit meets the ICSI Meeting Corpus: import, annotation, and browsing, Jean Carletta and J. Kilgour, in: MLMI'04: Proceedings of the Workshop on Machine Learning for Multimodal Interaction, Springer-Verlag LNCS, 2004.
 
TRAP based features for LVCSR of meeting data, Karafiát Martin, Frantisek Grezl and Jan Cernocky, in: In: Proc. 8th International Conference on Spoken Language Processing, pages 4, 2004.
 
Unsupervised Location-Based Segmentation of Multi-Party Speech, Guillaume Lathoud, I. McCowan and J-M. Odobez, in: Proceedings of the 2004 ICASSP-NIST Meeting, 2004.
 
Powered by Aigaion