default search action
3rd MLMI 2006: Bethesda, MD, USA
- Steve Renals, Samy Bengio, Jonathan G. Fiscus:
Machine Learning for Multimodal Interaction, Third International Workshop, MLMI 2006, Bethesda, MD, USA, May 1-4, 2006, Revised Selected Papers. Lecture Notes in Computer Science 4299, Springer 2006, ISBN 3-540-69267-3
Invited Paper
- Parisa Eslambolchilar, Roderick Murray-Smith:
Model-Based, Multimodal Interaction in Document Browsing. 1-12
Multimodal Processing
- Martial Michel, Jerome Ajot, Jonathan G. Fiscus:
The NIST Meeting Room Corpus 2 Phase 1. 13-23 - Marc A. Al-Hames, Thomas Hain, Jan Cernocký, Sascha Schreiber, Mannes Poel, Ronald Müller, Sébastien Marcel, David A. van Leeuwen, Jean-Marc Odobez, Sileye O. Ba, Hervé Bourlard, Fabien Cardinaux, Daniel Gatica-Perez, Adam Janin, Petr Motlícek, Stephan Reiter, Steve Renals, Jeroen van Rest, Rutger Rienks, Gerhard Rigoll, Kevin Smith, Andrew H. C. Thean, Pavel Zemcík:
Audio-Visual Processing in Meetings: Seven Questions and Current AMI Answers. 24-35 - Lei Chen, Mary P. Harper, Amy Franklin, R. Travis Rose, Irene Kimbara, Zhongqiang Huang, Francis K. H. Quek:
A Multimodal Analysis of Floor Control in Meetings. 36-49 - Xiao Huang, Sharon L. Oviatt, Rebecca Lunsford:
Combining User Modeling and Machine Learning to Predict Users' Multimodal Integration Patterns. 50-62 - Marc A. Al-Hames, Benedikt Hörnler, Christoph Scheuermann, Gerhard Rigoll:
Using Audio, Visual, and Lexical Features in a Multi-modal Virtual Meeting Director. 63-74
Image and Video Processing
- Sileye O. Ba, Jean-Marc Odobez:
A Study on Visual Focus of Attention Recognition from Head Pose in a Meeting Room. 75-87 - Kevin Smith, Sascha Schreiber, Igor Potucek, Vítezslav Beran, Gerhard Rigoll, Daniel Gatica-Perez:
Multi-person Tracking in Meetings: A Comparative Study. 88-101 - Andreas Humm, Jean Hennebert, Rolf Ingold:
Gaussian Mixture Models for CHASM Signature Verification. 102-113 - Aristodemos Pnevmatikakis, Lazaros Polymenakos:
Kalman Tracking with Target Feedback on Adaptive Background Learning. 114-122 - Dennis J. Lin, Jilin Tu, Shyamsundar Rajaram, ZhenQiu Zhang, Thomas S. Huang:
Da Vinci's Mona Lisa. 123-128
HCI and Applications
- Maria Danninger, Erica Robles, Leila Takayama, Qianying Wang, Tobias Kluge, Rainer Stiefelhagen, Clifford Nass:
The Connector Service-Predicting Availability in Mobile Contexts. 129-141 - Agnes Lisowska, Susan Armstrong:
Multimodal Input for Meeting Browsing and Retrieval Interfaces: Preliminary Findings. 142-153
Discourse and Dialogue
- Jacob Eisenstein, Randall Davis:
Gesture Features for Coreference Resolution. 154-165 - Weiqun Xu, Jean Carletta, Johanna D. Moore:
Syntactic Chunking Across Different Corpora. 166-177 - Alfred Dielmann, Steve Renals:
Multistream Recognition of Dialogue Acts in Meetings. 178-189 - Matthias Zimmermann, Dilek Hakkani-Tür, Elizabeth Shriberg, Andreas Stolcke:
Text Based Dialog Act Classification for Multiparty Meetings. 190-199 - Matthew Purver, Patrick Ehlen, John Niekrasz:
Detecting Action Items in Multi-party Meetings: Annotation and Initial Experiments. 200-211 - Özgür Çetin, Elizabeth Shriberg:
Overlap in Meetings: ASR Effects and Analysis by Dialog Factors, Speakers, and Collection Site. 212-224
Speech and Audio Processing
- Mikko Parviainen, Tuomo W. Pirinen, Pasi Pertilä:
A Speaker Localization System for Lecture Room Environment. 225-235 - Dusan Macho, Climent Nadeu, Andrey Temko:
Robust Speech Activity Detection in Interactive Smart-Room Environments. 236-247 - Xavier Anguera, Chuck Wooters, Javier Hernando:
Automatic Cluster Complexity and Quantity Selection: Towards Robust Speaker Diarization. 248-256 - José M. Pardo, Xavier Anguera, Chuck Wooters:
Speaker Diarization for Multi-microphone Meetings Using Only Between-Channel Differences. 257-264 - Matthias Wölfel:
Warped and Warped-Twice MVDR Spectral Estimation With and Without Filterbanks. 265-274 - Martin Karafiát, Frantisek Grézl, Petr Schwarz, Lukás Burget, Jan Cernocký:
Robust Heteroscedastic Linear Discriminant Analysis and LCRC Posterior Features in Meeting Data Recognition. 275-284 - Darren Moore, John Dines, Mathew Magimai-Doss, Jithendra Vepa, Octavian Cheng, Thomas Hain:
Juicer: A Weighted Finite-State Transducer Speech Decoder. 285-296 - Sebastian Stüker, Chengqing Zong, Jürgen Reichert, Wenjie Cao, Muntsin Kolss, Guodong Xie, Kay Peterson, Peng Ding, Victoria Arranz, Jian Yu, Alex Waibel:
Speech-to-Speech Translation Services for the Olympic Games 2008. 297-308
NIST Meeting Recognition Evaluation
- Jonathan G. Fiscus, Jerome Ajot, Martial Michel, John S. Garofolo:
The Rich Transcription 2006 Spring Meeting Recognition Evaluation. 309-322 - Etienne Marcheret, Gerasimos Potamianos, Karthik Visweswariah, Jing Huang:
The IBM RT06s Evaluation System for Speech Activity Detection in CHIL Seminars. 323-335 - Dominique Vaufreydaz, Rémi Emonet, Patrick Reignier:
A Lightweight Speech Detection System for Perceptive Environments. 336-345 - Xavier Anguera, Chuck Wooters, José M. Pardo:
Robust Speaker Diarization for Meetings: ICSI RT06S Meetings Evaluation System. 346-358 - Corinne Fredouille, Grégory Senay:
Technical Improvements of the E-HMM Based Speaker Diarization System for Meeting Records. 359-370 - David A. van Leeuwen, Marijn Huijbregts:
The AMI Speaker Diarization System for NIST RT06s Meeting Data. 371-384 - Elias Rentzeperis, Andreas Stergiou, Christos Boukis, Aristodemos Pnevmatikakis, Lazaros C. Polymenakos:
The 2006 Athens Information Technology Speech Activity Detection and Speaker Diarization Systems. 385-395 - Xuan Zhu, Claude Barras, Lori Lamel, Jean-Luc Gauvain:
Speaker Diarization: From Broadcast News to Lectures. 396-406 - Christian Fügen, Shajith Ikbal, Florian Kraft, Ken'ichi Kumatani, Kornel Laskowski, John W. McDonough, Mari Ostendorf, Sebastian Stüker, Matthias Wölfel:
The ISL RT-06S Speech-to-Text System. 407-418 - Thomas Hain, Lukás Burget, John Dines, Giulia Garau, Martin Karafiát, Mike Lincoln, Jithendra Vepa, Vincent Wan:
The AMI Meeting Transcription System: Progress and Performance. 419-431 - Jing Huang, Martin Westphal, Stanley F. Chen, Olivier Siohan, Daniel Povey, Vit Libal, Alvaro Soneiro, Henrik Schulz, Thomas Ross, Gerasimos Potamianos:
The IBM Rich Transcription Spring 2006 Speech-to-Text System for Lecture Meetings. 432-443 - Adam Janin, Andreas Stolcke, Xavier Anguera, Kofi Boakye, Özgür Çetin, Joe Frankel, Jing Zheng:
The ICSI-SRI Spring 2006 Meeting Recognition System. 444-456 - Lori Lamel, Éric Bilinski, Gilles Adda, Jean-Luc Gauvain, Holger Schwenk:
The LIMSI RT06s Lecture Transcription System. 457-468
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.