default search action
Kalin Stefanov
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c22]Zhixi Cai, Shreya Ghosh, Aman Pankaj Adatia, Munawar Hayat, Abhinav Dhall, Tom Gedeon, Kalin Stefanov:
AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset. ACM Multimedia 2024: 7414-7423 - [c21]Zhixi Cai, Abhinav Dhall, Shreya Ghosh, Munawar Hayat, Dimitrios Kollias, Kalin Stefanov, Usman Tariq:
1M-Deepfakes Detection Challenge. ACM Multimedia 2024: 11355-11359 - [i15]Hrishav Bakul Barua, Ganesh Krishnasamy, KokSheik Wong, Abhinav Dhall, Kalin Stefanov:
HistoHDR-Net: Histogram Equalization for Single LDR to HDR Image Translation. CoRR abs/2402.06692 (2024) - [i14]Mahsa Salehi, Kalin Stefanov, Ehsan Shareghi:
Human Brain Exhibits Distinct Patterns When Listening to Fake Versus Real Audio: Preliminary Evidence. CoRR abs/2402.14982 (2024) - [i13]Hrishav Bakul Barua, Kalin Stefanov, KokSheik Wong, Abhinav Dhall, Ganesh Krishnasamy:
GTA-HDR: A Large-Scale Synthetic Dataset for HDR Image Reconstruction. CoRR abs/2403.17837 (2024) - [i12]Zhixi Cai, Abhinav Dhall, Shreya Ghosh, Munawar Hayat, Dimitrios Kollias, Kalin Stefanov, Usman Tariq:
1M-Deepfakes Detection Challenge. CoRR abs/2409.06991 (2024) - [i11]Hrishav Bakul Barua, Kalin Stefanov, Lemuel Lai En Che, Abhinav Dhall, Koksheik Wong, Ganesh Krishnasamy:
A Cycle Ride to HDR: Semantics Aware Self-Supervised Framework for Unpaired LDR-to-HDR Image Translation. CoRR abs/2410.15068 (2024) - 2023
- [j3]Zhixi Cai, Shreya Ghosh, Abhinav Dhall, Tom Gedeon, Kalin Stefanov, Munawar Hayat:
Glitch in the matrix: A large scale benchmark for content driven audio-visual forgery detection and localization. Comput. Vis. Image Underst. 236: 103818 (2023) - [c20]Hrishav Bakul Barua, Ganesh Krishnasamy, KokSheik Wong, Kalin Stefanov, Abhinav Dhall:
ArtHDR-Net: Perceptually Realistic and Accurate HDR Content Creation. APSIPA ASC 2023: 806-812 - [c19]Zhixi Cai, Shreya Ghosh, Kalin Stefanov, Abhinav Dhall, Jianfei Cai, Hamid Rezatofighi, Reza Haffari, Munawar Hayat:
MARLIN: Masked Autoencoder for facial video Representation LearnINg. CVPR 2023: 1493-1504 - [i10]Zhixi Cai, Shreya Ghosh, Abhinav Dhall, Tom Gedeon, Kalin Stefanov, Munawar Hayat:
"Glitch in the Matrix!": A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization. CoRR abs/2305.01979 (2023) - [i9]Mohammad Adiban, Kalin Stefanov, Sabato Marco Siniscalchi, Giampiero Salvi:
S-HR-VQVAE: Sequential Hierarchical Residual Learning Vector Quantized Variational Autoencoder for Video Prediction. CoRR abs/2307.06701 (2023) - [i8]Hrishav Bakul Barua, Ganesh Krishnasamy, KokSheik Wong, Kalin Stefanov, Abhinav Dhall:
ArtHDR-Net: Perceptually Realistic and Accurate HDR Content Creation. CoRR abs/2309.03827 (2023) - [i7]Zhixi Cai, Shreya Ghosh, Aman Pankaj Adatia, Munawar Hayat, Abhinav Dhall, Kalin Stefanov:
AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset. CoRR abs/2311.15308 (2023) - 2022
- [c18]Mohammad Adiban, Kalin Stefanov, Sabato Marco Siniscalchi, Giampiero Salvi:
Hierarchical Residual Learning Based Vector Quantized Variational Autoencoder for Image Reconstruction and Generation. BMVC 2022: 636 - [c17]Garima Sharma, Kalin Stefanov, Abhinav Dhall, Jianfei Cai:
Graph-based Group Modelling for Backchannel Detection. ACM Multimedia 2022: 7190-7194 - [i6]Zhixi Cai, Kalin Stefanov, Abhinav Dhall, Munawar Hayat:
Do You Really Mean That? Content Driven Audio-Visual Deepfake Dataset and Multimodal Method for Temporal Forgery Localization. CoRR abs/2204.06228 (2022) - [i5]Kalin Stefanov, Bhawna Paliwal, Abhinav Dhall:
Visual Representations of Physiological Signals for Fake Video Detection. CoRR abs/2207.08380 (2022) - [i4]Mohammad Adiban, Kalin Stefanov, Sabato Marco Siniscalchi, Giampiero Salvi:
Hierarchical Residual Learning Based Vector Quantized Variational Autoencoder for Image Reconstruction and Generation. CoRR abs/2208.04554 (2022) - [i3]Zhixi Cai, Shreya Ghosh, Kalin Stefanov, Abhinav Dhall, Jianfei Cai, Hamid Rezatofighi, Reza Haffari, Munawar Hayat:
MARLIN: Masked Autoencoder for facial video Representation LearnINg. CoRR abs/2211.06627 (2022) - 2021
- [c16]Christopher Birmingham, Maja J. Mataric, Kalin Stefanov:
Group-Level Focus of Visual Attention for Improved Active Speaker Detection. ICMI Companion 2021: 37-42 - [c15]Chris Birmingham, Kalin Stefanov, Maja J. Mataric:
Group-Level Focus of Visual Attention for Improved Next Speaker Prediction. ACM Multimedia 2021: 4838-4842 - 2020
- [j2]Kalin Stefanov, Jonas Beskow, Giampiero Salvi:
Self-Supervised Vision-Based Detection of the Active Speaker as Support for Socially Aware Language Acquisition. IEEE Trans. Cogn. Dev. Syst. 12(2): 250-259 (2020) - [c14]Su Lei, Kalin Stefanov, Jonathan Gratch:
Emotion or expressivity? An automated analysis of nonverbal perception in a social dilemma. FG 2020: 544-551 - [c13]Leili Tavabi, Kalin Stefanov, Larry Zhang, Brian Borsari, Joshua D. Woolley, Stefan Scherer, Mohammad Soleymani:
Multimodal Automatic Coding of Client Behavior in Motivational Interviewing. ICMI 2020: 406-413 - [c12]Kalin Stefanov, Baiyu Huang, Zongjian Li, Mohammad Soleymani:
OpenSense: A Platform for Multimodal Data Acquisition and Behavior Perception. ICMI 2020: 660-664 - [c11]Kalin Stefanov, Mohammad Adiban, Giampiero Salvi:
Spatial Bias in Vision-Based Voice Activity Detection. ICPR 2020: 10433-10440
2010 – 2019
- 2019
- [j1]Kalin Stefanov, Giampiero Salvi, Dimosthenis Kontogiorgos, Hedvig Kjellström, Jonas Beskow:
Modeling of Human Visual Attention in Multiparty Open-World Dialogues. ACM Trans. Hum. Robot Interact. 8(2): 8:1-8:21 (2019) - [c10]Kalin Stefanov, Mayumi Bono:
Towards Digitally-Mediated Sign Language Communication. HAI 2019: 286-288 - [c9]Mohammad Soleymani, Kalin Stefanov, Sin-Hwa Kang, Jan Ondras, Jonathan Gratch:
Multimodal Analysis and Estimation of Intimate Self-Disclosure. ICMI 2019: 59-68 - [c8]Leili Tavabi, Kalin Stefanov, Setareh Nasihati Gilani, David R. Traum, Mohammad Soleymani:
Multimodal Learning for Identifying Opportunities for Empathetic Responses. ICMI 2019: 95-104 - 2018
- [b1]Kalin Stefanov:
Recognition and Generation of Communicative Signals: Modeling of Hand Gestures, Speech Activity and Eye-Gaze in Human-Machine Interaction. Royal Institute of Technology, Stockholm, Sweden, 2018 - [i2]Kalin Stefanov:
Webcam-based Eye Gaze Tracking under Natural Head Movement. CoRR abs/1803.11088 (2018) - 2017
- [i1]Kalin Stefanov, Jonas Beskow, Giampiero Salvi:
Self-Supervised Vision-Based Detection of the Active Speaker as a Prerequisite for Socially-Aware Language Acquisition. CoRR abs/1711.08992 (2017) - 2016
- [c7]Kalin Stefanov, Akihiro Sugimoto, Jonas Beskow:
Look who's talking: visual identification of the active speaker in multi-party human-robot interaction. ASSP4MI@ICMI 2016: 22-27 - [c6]Kalin Stefanov, Jonas Beskow:
A Multi-party Multi-modal Dataset for Focus of Visual Attention in Human-human and Human-robot Interaction. LREC 2016 - 2015
- [c5]Mathieu Chollet, Kalin Stefanov, Helmut Prendinger, Stefan Scherer:
Public Speaking Training with a Multimodal Interactive Virtual Audience Framework. ICMI 2015: 367-368 - 2014
- [c4]Samer Al Moubayed, Jonas Beskow, Bajibabu Bollepalli, Joakim Gustafson, Ahmed Hussen Abdelaziz, Martin Johansson, Maria Koutsombogera, José David Águas Lopes, Jekaterina Novikova, Catharine Oertel, Gabriel Skantze, Kalin Stefanov, Gül Varol:
Human-robot collaborative tutoring using multiparty multimodal spoken dialogue. HRI 2014: 112-113 - [c3]Maria Koutsombogera, Samer Al Moubayed, Bajibabu Bollepalli, Ahmed Hussen Abdelaziz, Martin Johansson, José David Águas Lopes, Jekaterina Novikova, Catharine Oertel, Kalin Stefanov, Gül Varol:
The Tutorbot Corpus ― A Corpus for Studying Tutoring Behaviour in Multiparty Face-to-Face Spoken Dialogue. LREC 2014: 4196-4201 - 2013
- [c2]Samer Al Moubayed, Jonas Beskow, Bajibabu Bollepalli, Ahmed Hussen Abdelaziz, Martin Johansson, Maria Koutsombogera, José David Águas Lopes, Jekaterina Novikova, Catharine Oertel, Gabriel Skantze, Kalin Stefanov, Gül Varol:
Tutoring Robots - Multiparty Multimodal Social Dialogue with an Embodied Tutor. eNTERFACE 2013: 80-113 - 2012
- [c1]Samer Al Moubayed, Gabriel Skantze, Jonas Beskow, Kalin Stefanov, Joakim Gustafson:
Multimodal multiparty social interaction with the furhat head. ICMI 2012: 293-294
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-27 20:33 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint