default search action
Benjamin Elizalde
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c29]Benjamin Elizalde, Soham Deshmukh, Huaming Wang:
Natural Language Supervision For General-Purpose Audio Representations. ICASSP 2024: 336-340 - [c28]Soham Deshmukh, Benjamin Elizalde, Dimitra Emmanouilidou, Bhiksha Raj, Rita Singh, Huaming Wang:
Training Audio Captioning Models without Audio. ICASSP 2024: 371-375 - [c27]Hira Dhamyal, Benjamin Elizalde, Soham Deshmukh, Huaming Wang, Bhiksha Raj, Rita Singh:
Prompting Audios Using Acoustic Properties for Emotion Representation. ICASSP 2024: 11936-11940 - [i26]Alon Vinnikov, Amir Ivry, Aviv Hurvitz, Igor Abramovski, Sharon Koubi, Ilya Gurvich, Shai Pe'er, Xiong Xiao, Benjamin Martinez Elizalde, Naoyuki Kanda, Xiaofei Wang, Shalev Shaer, Stav Yagev, Yossi Asher, Sunit Sivasankaran, Yifan Gong, Min Tang, Huaming Wang, Eyal Krupka:
NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription. CoRR abs/2401.08887 (2024) - [i25]Soham Deshmukh, Dareen Alharthi, Benjamin Elizalde, Hannes Gamper, Mahmoud Al Ismail, Rita Singh, Bhiksha Raj, Huaming Wang:
PAM: Prompting Audio-Language Models for Audio Quality Assessment. CoRR abs/2402.00282 (2024) - [i24]Soham Deshmukh, Shuo Han, Hazim T. Bukhari, Benjamin Elizalde, Hannes Gamper, Rita Singh, Bhiksha Raj:
Audio Entailment: Assessing Deductive Reasoning for Audio Understanding. CoRR abs/2407.18062 (2024) - 2023
- [c26]Benjamin Elizalde, Soham Deshmukh, Mahmoud Al Ismail, Huaming Wang:
CLAP Learning Audio Concepts from Natural Language Supervision. ICASSP 2023: 1-5 - [c25]Daniel Tompkins, Dimitra Emmanouilidou, Soham Deshmukh, Benjamin Elizalde:
Multi-View Learning for Speech Emotion Recognition with Categorical Emotion, Categorical Sentiment, and Dimensional Scores. ICASSP 2023: 1-5 - [c24]Soham Deshmukh, Benjamin Elizalde, Huaming Wang:
Audio Retrieval with WavText5K and CLAP Training. INTERSPEECH 2023: 2948-2952 - [c23]Soham Deshmukh, Benjamin Elizalde, Rita Singh, Huaming Wang:
Pengi: An Audio Language Model for Audio Tasks. NeurIPS 2023 - [i23]Laurie M. Heller, Benjamin Elizalde, Bhiksha Raj, Soham Deshmukh:
Synergy between human and machine approaches to sound/scene recognition and processing: An overview of ICASSP special session. CoRR abs/2302.09719 (2023) - [i22]Soham Deshmukh, Benjamin Elizalde, Rita Singh, Huaming Wang:
Pengi: An Audio Language Model for Audio Tasks. CoRR abs/2305.11834 (2023) - [i21]Benjamin Elizalde, Soham Deshmukh, Huaming Wang:
Natural Language Supervision for General-Purpose Audio Representations. CoRR abs/2309.05767 (2023) - [i20]Soham Deshmukh, Benjamin Elizalde, Dimitra Emmanouilidou, Bhiksha Raj, Rita Singh, Huaming Wang:
Training Audio Captioning Models without Audio. CoRR abs/2309.07372 (2023) - [i19]Hira Dhamyal, Benjamin Elizalde, Soham Deshmukh, Huaming Wang, Bhiksha Raj, Rita Singh:
Prompting Audios Using Acoustic Properties For Emotion Representation. CoRR abs/2310.02298 (2023) - 2022
- [i18]Benjamin Elizalde, Soham Deshmukh, Mahmoud Al Ismail, Huaming Wang:
CLAP: Learning Audio Concepts From Natural Language Supervision. CoRR abs/2206.04769 (2022) - [i17]Soham Deshmukh, Benjamin Elizalde, Huaming Wang:
Audio Retrieval with WavText5K and CLAP Training. CoRR abs/2209.14275 (2022) - [i16]Hira Dhamyal, Benjamin Elizalde, Soham Deshmukh, Huaming Wang, Bhiksha Raj, Rita Singh:
Describing emotions with acoustic property prompts for speech emotion recognition. CoRR abs/2211.07737 (2022) - 2021
- [c22]Benjamin Elizalde, Radu Revutchi, Samarjit Das, Bhiksha Raj, Ian R. Lane, Laurie M. Heller:
Identifying Actions for Sound Event Classification. WASPAA 2021: 26-30 - [e2]Frederic Font, Annamaria Mesaros, Daniel P. W. Ellis, Eduardo Fonseca, Magdalena Fuentes, Benjamin Elizalde:
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), Online, November 15-19, 2021. 2021, ISBN 978-84-09-36072-7 [contents] - [i15]Benjamin Elizalde, Radu Revutchi, Samarjit Das, Bhiksha Raj, Ian R. Lane, Laurie M. Heller:
Identifying Actions for Sound Event Classification. CoRR abs/2104.12693 (2021) - [i14]Benjamin Elizalde, Daniel Tompkins:
COVID-19 Detection Using Recorded Coughs in the 2021 DiCOVA Challenge. CoRR abs/2105.10619 (2021) - 2020
- [b1]Benjamin Elizalde:
Never-Ending Learning of Sounds. Carnegie Mellon University, USA, 2020 - [c21]Jianyu Fan, Eric Nichols, Daniel Tompkins, Ana Elisa Méndez Méndez, Benjamin Elizalde, Philippe Pasquier:
Multi-Label Sound Event Retrieval Using A Deep Learning-Based Siamese Structure With A Pairwise Presence Matrix. ICASSP 2020: 3482-3486 - [i13]Jianyu Fan, Eric Nichols, Daniel Tompkins, Ana Elisa Méndez Méndez, Benjamin Elizalde, Philippe Pasquier:
Multi-label Sound Event Retrieval Using a Deep Learning-based Siamese Structure with a Pairwise Presence Matrix. CoRR abs/2002.09026 (2020)
2010 – 2019
- 2019
- [j3]Annamaria Mesaros, Aleksandr Diment, Benjamin Elizalde, Toni Heittola, Emmanuel Vincent, Bhiksha Raj, Tuomas Virtanen:
Sound Event Detection in the DCASE 2017 Challenge. IEEE ACM Trans. Audio Speech Lang. Process. 27(6): 992-1006 (2019) - [c20]Benjamin Elizalde, Shuayb Zarar, Bhiksha Raj:
Cross Modal Audio Search and Retrieval with Joint Embeddings Based on Text and Audio. ICASSP 2019: 4095-4099 - 2018
- [j2]Sebastian Säger, Benjamin Elizalde, Damian Borth, Christian Schulze, Bhiksha Raj, Ian R. Lane:
AudioPairBank: towards a large-scale tag-pair-based audio content analysis. EURASIP J. Audio Speech Music. Process. 2018: 12 (2018) - [c19]Abelino Jimenez, Benjamin Elizalde, Bhiksha Raj:
Acoustic Scene Classification Using Discrete Random Hashing for Laplacian Kernel Machines. ICASSP 2018: 146-150 - [c18]Rohan Badlani, Ankit Shah, Benjamin Elizalde, Anurag Kumar, Bhiksha Raj:
Framework for Evaluation of Sound Event Detection in Web Videos. ICASSP 2018: 3096-3100 - [c17]Pranay Manocha, Rohan Badlani, Anurag Kumar, Ankit Shah, Benjamin Elizalde, Bhiksha Raj:
Content-Based Representations of Audio Using Siamese Neural Networks. ICASSP 2018: 3136-3140 - [i12]Abelino Jimenez, Benjamin Elizalde, Bhiksha Raj:
DCASE 2017 Task 1: Acoustic Scene Classification Using Shift-Invariant Kernels and Random Features. CoRR abs/1801.02690 (2018) - [i11]Benjamin Elizalde, Rohan Badlani, Ankit Shah, Anurag Kumar, Bhiksha Raj:
NELS - Never-Ending Learner of Sounds. CoRR abs/1801.05544 (2018) - 2017
- [c16]Abelino Jimenez, Benjamin Elizalde, Bhiksha Raj:
DCASE 2017 Task 1: Acoustic Scene Classification Using Shift-Invariant Kernels and Random Features. DCASE 2017: 55-58 - [c15]Annamaria Mesaros, Toni Heittola, Aleksandr Diment, Benjamin Elizalde, Ankit Shah, Emmanuel Vincent, Bhiksha Raj, Tuomas Virtanen:
DCASE2017 Challenge Setup: Tasks, Datasets and Baseline System. DCASE 2017: 85-92 - [c14]Benjamin Elizalde, Ankit Shah, Siddharth Dalmia, Min Hun Lee, Rohan Badlani, Anurag Kumar, Bhiksha Raj, Ian R. Lane:
An approach for self-training audio event detectors using web data. EUSIPCO 2017: 1863-1867 - [c13]Anurag Kumar, Benjamin Elizalde, Bhiksha Raj:
Audio Content Based Geotagging in Multimedia. INTERSPEECH 2017: 1874-1878 - [e1]Tuomas Virtanen, Annamaria Mesaros, Toni Heittola, Aleksandr Diment, Emmanuel Vincent, Emmanouil Benetos, Benjamin Elizalde:
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, DCASE 2017, Munich, Germany, November 16-17, 2017. 2017, ISBN 978-952-15-4042-4 [contents] - [i10]Mirco Ravanelli, Benjamin Elizalde, Karl Ni, Gerald Friedland:
Audio Concept Classification with Hierarchical Deep Neural Networks. CoRR abs/1710.04288 (2017) - [i9]Pranay Manocha, Rohan Badlani, Anurag Kumar, Ankit Shah, Benjamin Elizalde, Bhiksha Raj:
Content-based Representations of audio using Siamese neural networks. CoRR abs/1710.10974 (2017) - [i8]Rohan Badlani, Ankit Shah, Benjamin Elizalde, Anurag Kumar, Bhiksha Raj:
Framework for evaluation of sound event detection in web videos. CoRR abs/1711.00804 (2017) - 2016
- [j1]Bart Thomee, David A. Shamma, Gerald Friedland, Benjamin Elizalde, Karl Ni, Douglas Poland, Damian Borth, Li-Jia Li:
YFCC100M: the new data in multimedia research. Commun. ACM 59(2): 64-73 (2016) - [c12]Benjamin Elizalde, Guan-Lin Chao, Ming Zeng, Ian R. Lane:
City-Identification of Flickr Videos Using Semantic Acoustic Features. BigMM 2016: 303-306 - [c11]Benjamin Elizalde, Anurag Kumar, Ankit Shah, Rohan Badlani, Emmanuel Vincent, Bhiksha Raj, Ian R. Lane:
Experiments on the DCASE Challenge 2016: Acoustic Scene Classification and Sound Event Detection in Real Life Recording. DCASE 2016: 20-24 - [i7]Anurag Kumar, Benjamin Elizalde, Bhiksha Raj:
Audio Content based Geotagging in Multimedia. CoRR abs/1606.02816 (2016) - [i6]Benjamin Elizalde, Guan-Lin Chao, Ming Zeng, Ian R. Lane:
City-Identification of Flickr Videos Using Semantic Acoustic Features. CoRR abs/1607.03257 (2016) - [i5]Sebastian Säger, Damian Borth, Benjamin Elizalde, Christian Schulze, Bhiksha Raj, Ian R. Lane, Andreas Dengel:
AudioSentibank: Large-scale Semantic Ontology of Acoustic Concepts for Audio Content Analysis. CoRR abs/1607.03766 (2016) - [i4]Benjamin Elizalde, Anurag Kumar, Ankit Shah, Rohan Badlani, Emmanuel Vincent, Bhiksha Raj, Ian R. Lane:
Experiments on the DCASE Challenge 2016: Acoustic Scene Classification and Sound Event Detection in Real Life Recording. CoRR abs/1607.06706 (2016) - [i3]Ankit Shah, Rohan Badlani, Anurag Kumar, Benjamin Elizalde, Bhiksha Raj:
An Approach for Self-Training Audio Event Detectors Using Web Data. CoRR abs/1609.06026 (2016) - 2015
- [c10]Khalid Ashraf, Benjamin Elizalde, Forrest N. Iandola, Matthew W. Moskewicz, Julia Bernd, Gerald Friedland, Kurt Keutzer:
Audio-Based Multimedia Event Detection with DNNs and Sparse Sampling. ICMR 2015: 611-614 - [c9]Julia Bernd, Damian Borth, Carmen Carrano, Jaeyoung Choi, Benjamin Elizalde, Gerald Friedland, Luke R. Gottlieb, Karl Ni, Roger A. Pearce, Douglas Poland, Khalid Ashraf, David A. Shamma, Bart Thomee:
Kickstarting the Commons: The YFCC100M and the YLI Corpora. MMCommons@ACM Multimedia 2015: 1-6 - [c8]Mirco Ravanelli, Benjamin Elizalde, Julia Bernd, Gerald Friedland:
Insights into Audio-Based Multimedia Event Classification with Neural Networks. MMCommons@ACM Multimedia 2015: 19-23 - [i2]Bart Thomee, David A. Shamma, Gerald Friedland, Benjamin Elizalde, Karl Ni, Douglas Poland, Damian Borth, Li-Jia Li:
The New Data and New Challenges in Multimedia Research. CoRR abs/1503.01817 (2015) - [i1]Julia Bernd, Damian Borth, Benjamin Elizalde, Gerald Friedland, Heather Gallagher, Luke R. Gottlieb, Adam Janin, Sara Karabashlieva, Jocelyn Takahashi, Jennifer Won:
The YLI-MED Corpus: Characteristics, Procedures, and Plans. CoRR abs/1503.04250 (2015) - 2014
- [c7]Mirco Ravanelli, Benjamin Elizalde, Karl Ni, Gerald Friedland:
Audio concept classification with Hierarchical Deep Neural Networks. EUSIPCO 2014: 606-610 - [c6]Benjamin Elizalde, Mirco Ravanelli, Karl Ni, Damian Borth, Gerald Friedland:
Audio-concept features and hidden Markov models for multimedia event detection. SLAM@INTERSPEECH 2014: 3-8 - [c5]Jaeyoung Choi, Bart Thomee, Gerald Friedland, Liangliang Cao, Karl Ni, Damian Borth, Benjamin Elizalde, Luke R. Gottlieb, Carmen Carrano, Roger A. Pearce, Douglas Poland:
The Placing Task: A Large-Scale Geo-Estimation Challenge for Social-Media Videos and Images. GeoMM 2014: 27-31 - 2013
- [c4]Benjamin Elizalde, Gerald Friedland:
Lost in segmentation: Three approaches for speech/non-speech detection in consumer-produced videos. ICME 2013: 1-6 - [c3]Benjamin Elizalde, Mirco Ravanelli, Gerald Friedland:
Audio Concept Ranking for Video Event Detection on User-Generated Content. SLAM@INTERSPEECH 2013: 9-14 - [c2]Benjamin Elizalde, Howard Lei, Gerald Friedland:
An i-Vector Representation of Acoustic Environments for Audio-Based Video Event Detection on User Generated Content. ISM 2013: 114-117 - 2012
- [c1]Hui Cheng, Jingen Liu, Saad Ali, Omar Javed, Qian Yu, Amir Tamrakar, Ajay Divakaran, Harpreet S. Sawhney, R. Manmatha, James Allan, Alexander G. Hauptmann, Mubarak Shah, Subhabrata Bhattacharya, Afshin Dehghan, Gerald Friedland, Benjamin Elizalde, Trevor Darrell, Michael Witbrock, Jon Curtis:
SRI-Sarnoff AURORA System at TRECVID 2012 Multimedia Event Detection and Recounting. TRECVID 2012
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-11 21:25 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint