default search action
Paul Voigtlaender
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c21]Sabarinath Mahadevan, Idil Esen Zulfikar, Paul Voigtlaender, Bastian Leibe:
Point-VOS: Pointing Up Video Object Segmentation. CVPR 2024: 22217-22226 - [i23]Idil Esen Zulfikar, Sabarinath Mahadevan, Paul Voigtlaender, Bastian Leibe:
Point-VOS: Pointing Up Video Object Segmentation. CoRR abs/2402.05917 (2024) - [i22]Divya Kothandaraman, Kihyuk Sohn, Ruben Villegas, Paul Voigtlaender, Dinesh Manocha, Mohammad Babaeizadeh:
Text Prompting for Multi-Concept Video Customization by Autoregressive Generation. CoRR abs/2405.13951 (2024) - [i21]Lucas Beyer, Andreas Steiner, André Susano Pinto, Alexander Kolesnikov, Xiao Wang, Daniel Salz, Maxim Neumann, Ibrahim Alabdulmohsin, Michael Tschannen, Emanuele Bugliarello, Thomas Unterthiner, Daniel Keysers, Skanda Koppula, Fangyu Liu, Adam Grycner, Alexey A. Gritsenko, Neil Houlsby, Manoj Kumar, Keran Rong, Julian Eisenschlos, Rishabh Kabra, Matthias Bauer, Matko Bosnjak, Xi Chen, Matthias Minderer, Paul Voigtlaender, Ioana Bica, Ivana Balazevic, Joan Puigcerver, Pinelopi Papalampidi, Olivier J. Hénaff, Xi Xiong, Radu Soricut, Jeremiah Harmsen, Xiaohua Zhai:
PaliGemma: A versatile 3B VLM for transfer. CoRR abs/2407.07726 (2024) - 2023
- [c20]Paul Voigtlaender, Soravit Changpinyo, Jordi Pont-Tuset, Radu Soricut, Vittorio Ferrari:
Connecting Vision and Language with Video Localized Narratives. CVPR 2023: 2461-2471 - [c19]Emanuele Bugliarello, H. Hernan Moraldo, Ruben Villegas, Mohammad Babaeizadeh, Mohammad Taghi Saffar, Han Zhang, Dumitru Erhan, Vittorio Ferrari, Pieter-Jan Kindermans, Paul Voigtlaender:
StoryBench: A Multifaceted Benchmark for Continuous Story Visualization. NeurIPS 2023 - [c18]Ali Athar, Jonathon Luiten, Paul Voigtlaender, Tarasha Khurana, Achal Dave, Bastian Leibe, Deva Ramanan:
BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video. WACV 2023: 1674-1683 - [i20]Paul Voigtlaender, Soravit Changpinyo, Jordi Pont-Tuset, Radu Soricut, Vittorio Ferrari:
Connecting Vision and Language with Video Localized Narratives. CoRR abs/2302.11217 (2023) - [i19]Emanuele Bugliarello, Hernan Moraldo, Ruben Villegas, Mohammad Babaeizadeh, Mohammad Taghi Saffar, Han Zhang, Dumitru Erhan, Vittorio Ferrari, Pieter-Jan Kindermans, Paul Voigtlaender:
StoryBench: A Multifaceted Benchmark for Continuous Story Visualization. CoRR abs/2308.11606 (2023) - [i18]Xi Chen, Xiao Wang, Lucas Beyer, Alexander Kolesnikov, Jialin Wu, Paul Voigtlaender, Basil Mustafa, Sebastian Goodman, Ibrahim Alabdulmohsin, Piotr Padlewski, Daniel Salz, Xi Xiong, Daniel Vlasic, Filip Pavetic, Keran Rong, Tianli Yu, Daniel Keysers, Xiaohua Zhai, Radu Soricut:
PaLI-3 Vision Language Models: Smaller, Faster, Stronger. CoRR abs/2310.09199 (2023) - 2022
- [i17]Ali Athar, Jonathon Luiten, Paul Voigtlaender, Tarasha Khurana, Achal Dave, Bastian Leibe, Deva Ramanan:
BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video. CoRR abs/2209.12118 (2022) - 2021
- [b1]Paul Voigtlaender:
Video object segmentation and tracking. RWTH Aachen University, Germany, 2021 - [c17]Matej Kristan, Jirí Matas, Ales Leonardis, Michael Felsberg, Roman P. Pflugfelder, Joni-Kristian Kämäräinen, Hyung Jin Chang, Martin Danelljan, Luka Cehovin Zajc, Alan Lukezic, Ondrej Drbohlav, Jani Käpylä, Gustav Häger, Song Yan, Jinyu Yang, Zhongqun Zhang, Gustavo Fernández, Mohamed H. Abdelpakey, Goutam Bhat, Llukman Cerkezi, Hakan Cevikalp, Shengyong Chen, Xin Chen, Miao Cheng, Ziyi Cheng, Yu-Chen Chiu, Ozgun Cirakman, Yutao Cui, Kenan Dai, Mohana Murali Dasari, Qili Deng, Xingping Dong, Daniel K. Du, Matteo Dunnhofer, Zhen-Hua Feng, Zhiyong Feng, Zhihong Fu, Shiming Ge, Rama Krishna Gorthi, Yuzhang Gu, Bilge Günsel, Qing Guo, Filiz Gurkan, Wencheng Han, Yanyan Huang, Felix Järemo Lawin, Shang-Jhih Jhang, Rongrong Ji, Cheng Jiang, Yingjie Jiang, Felix Juefei-Xu, J. Yin, Xiao Ke, Fahad Shahbaz Khan, Byeong Hak Kim, Josef Kittler, Xiangyuan Lan, Jun Ha Lee, Bastian Leibe, Hui Li, Jianhua Li, Xianxian Li, Yuezhou Li, Bo Liu, Chang Liu, Jingen Liu, Li Liu, Qingjie Liu, Huchuan Lu, Wei Lu, Jonathon Luiten, Jie Ma, Ziang Ma, Niki Martinel, Christoph Mayer, Alireza Memarmoghadam, Christian Micheloni, Yuzhen Niu, Danda Pani Paudel, Houwen Peng, Shoumeng Qiu, Aravindh Rajiv, Muhammad Rana, Andreas Robinson, Hasan Saribas, Ling Shao, Mohamed S. Shehata, Furao Shen, Jianbing Shen, Kristian Simonato, Xiaoning Song, Zhangyong Tang, Radu Timofte, Philip H. S. Torr, Chi-Yi Tsai, Bedirhan Uzun, Luc Van Gool, Paul Voigtlaender, Dong Wang, Guangting Wang, Liangliang Wang, Lijun Wang, Limin Wang, Linyuan Wang, Yong Wang, Yunhong Wang, Chenyan Wu, Gangshan Wu, Xiaojun Wu, Fei Xie, Tianyang Xu, Xiang Xu, Wanli Xue, Bin Yan, Wankou Yang, Xiaoyun Yang, Yu Ye, Jun Yin, Chengwei Zhang, Chunhui Zhang, Haitao Zhang, Kaihua Zhang, Kangkai Zhang, Xiaohan Zhang, Xiaolin Zhang, Xinyu Zhang, Zhibin Zhang, Shao-Chuan Zhao, Ming Zhen, Bineng Zhong, Jiawen Zhu, Xuefeng Zhu:
The Ninth Visual Object Tracking VOT2021 Challenge Results. ICCVW 2021: 2711-2738 - [c16]Mark Weber, Jun Xie, Maxwell D. Collins, Yukun Zhu, Paul Voigtlaender, Hartwig Adam, Bradley Green, Andreas Geiger, Bastian Leibe, Daniel Cremers, Aljosa Osep, Laura Leal-Taixé, Liang-Chieh Chen:
STEP: Segmenting and Tracking Every Pixel. NeurIPS Datasets and Benchmarks 2021 - [c15]Paul Voigtlaender, Lishu Luo, Chun Yuan, Yong Jiang, Bastian Leibe:
Reducing the Annotation Effort for Video Object Segmentation Datasets. WACV 2021: 3059-3068 - [i16]Mark Weber, Jun Xie, Maxwell D. Collins, Yukun Zhu, Paul Voigtlaender, Hartwig Adam, Bradley Green, Andreas Geiger, Bastian Leibe, Daniel Cremers, Aljosa Osep, Laura Leal-Taixé, Liang-Chieh Chen:
STEP: Segmenting and Tracking Every Pixel. CoRR abs/2102.11859 (2021) - 2020
- [c14]Paul Voigtlaender, Jonathon Luiten, Philip H. S. Torr, Bastian Leibe:
Siam R-CNN: Visual Tracking by Re-Detection. CVPR 2020: 6577-6587 - [c13]Aljosa Osep, Paul Voigtlaender, Mark Weber, Jonathon Luiten, Bastian Leibe:
4D Generic Video Object Proposals. ICRA 2020: 10031-10037 - [i15]Paul Voigtlaender, Lishu Luo, Chun Yuan, Yong Jiang, Bastian Leibe:
Reducing the Annotation Effort for Video Object Segmentation Datasets. CoRR abs/2011.01142 (2020)
2010 – 2019
- 2019
- [c12]Paul Voigtlaender, Michael Krause, Aljosa Osep, Jonathon Luiten, Berin Balachandar Gnana Sekar, Andreas Geiger, Bastian Leibe:
MOTS: Multi-Object Tracking and Segmentation. CVPR 2019: 7942-7951 - [c11]Paul Voigtlaender, Yuning Chai, Florian Schroff, Hartwig Adam, Bastian Leibe, Liang-Chieh Chen:
FEELVOS: Fast End-To-End Embedding Learning for Video Object Segmentation. CVPR 2019: 9481-9490 - [c10]Jonathon Luiten, Paul Voigtlaender, Bastian Leibe:
Exploring the Combination of PReMVOS, BoLTVOS and UnOVOST for the 2019 YouTube-VOS Challenge. ICCV Workshops 2019: 705-708 - [c9]Aljosa Osep, Paul Voigtlaender, Jonathon Luiten, Stefan Breuers, Bastian Leibe:
Large-Scale Object Mining for Object Discovery from Unlabeled Video. ICRA 2019: 5502-5508 - [i14]Aljosa Osep, Paul Voigtlaender, Mark Weber, Jonathon Luiten, Bastian Leibe:
4D Generic Video Object Proposals. CoRR abs/1901.09260 (2019) - [i13]Paul Voigtlaender, Michael Krause, Aljosa Osep, Jonathon Luiten, Berin Balachandar Gnana Sekar, Andreas Geiger, Bastian Leibe:
MOTS: Multi-Object Tracking and Segmentation. CoRR abs/1902.03604 (2019) - [i12]Paul Voigtlaender, Yuning Chai, Florian Schroff, Hartwig Adam, Bastian Leibe, Liang-Chieh Chen:
FEELVOS: Fast End-to-End Embedding Learning for Video Object Segmentation. CoRR abs/1902.09513 (2019) - [i11]Aljosa Osep, Paul Voigtlaender, Jonathon Luiten, Stefan Breuers, Bastian Leibe:
Large-Scale Object Mining for Object Discovery from Unlabeled Video. CoRR abs/1903.00362 (2019) - [i10]Paul Voigtlaender, Jonathon Luiten, Bastian Leibe:
BoLTVOS: Box-Level Tracking for Video Object Segmentation. CoRR abs/1904.04552 (2019) - [i9]Paul Voigtlaender, Jonathon Luiten, Philip H. S. Torr, Bastian Leibe:
Siam R-CNN: Visual Tracking by Re-Detection. CoRR abs/1911.12836 (2019) - 2018
- [c8]Jonathon Luiten, Paul Voigtlaender, Bastian Leibe:
PReMVOS: Proposal-Generation, Refinement and Merging for Video Object Segmentation. ACCV (4) 2018: 565-580 - [c7]Sabarinath Mahadevan, Paul Voigtlaender, Bastian Leibe:
Iteratively Trained Interactive Segmentation. BMVC 2018: 212 - [c6]Aljosa Osep, Wolfgang Mehner, Paul Voigtlaender, Bastian Leibe:
Track, Then Decide: Category-Agnostic Vision-Based Multi-Object Tracking. ICRA 2018: 1-8 - [i8]Sabarinath Mahadevan, Paul Voigtlaender, Bastian Leibe:
Iteratively Trained Interactive Segmentation. CoRR abs/1805.04398 (2018) - [i7]Jonathon Luiten, Paul Voigtlaender, Bastian Leibe:
PReMVOS: Proposal-generation, Refinement and Merging for Video Object Segmentation. CoRR abs/1807.09190 (2018) - [i6]Aljosa Osep, Paul Voigtlaender, Jonathon Luiten, Stefan Breuers, Bastian Leibe:
Towards Large-Scale Video Video Object Mining. CoRR abs/1809.07316 (2018) - 2017
- [c5]Paul Voigtlaender, Bastian Leibe:
Online Adaptation of Convolutional Neural Networks for Video Object Segmentation. BMVC 2017 - [c4]Albert Zeyer, Patrick Doetsch, Paul Voigtlaender, Ralf Schlüter, Hermann Ney:
A comprehensive study of deep bidirectional LSTM RNNS for acoustic modeling in speech recognition. ICASSP 2017: 2462-2466 - [c3]Patrick Doetsch, Albert Zeyer, Paul Voigtlaender, Ilia Kulikov, Ralf Schlüter, Hermann Ney:
Returnn: The RWTH extensible training framework for universal recurrent neural networks. ICASSP 2017: 5345-5349 - [i5]Paul Voigtlaender, Bastian Leibe:
Online Adaptation of Convolutional Neural Networks for Video Object Segmentation. CoRR abs/1706.09364 (2017) - [i4]Aljosa Osep, Wolfgang Mehner, Paul Voigtlaender, Bastian Leibe:
Track, then Decide: Category-Agnostic Vision-based Multi-Object Tracking. CoRR abs/1712.07920 (2017) - [i3]Aljosa Osep, Paul Voigtlaender, Jonathon Luiten, Stefan Breuers, Bastian Leibe:
Large-Scale Object Discovery and Detector Adaptation from Unlabeled Video. CoRR abs/1712.08832 (2017) - 2016
- [c2]Paul Voigtlaender, Patrick Doetsch, Hermann Ney:
Handwriting Recognition with Large Multidimensional Long Short-Term Memory Recurrent Neural Networks. ICFHR 2016: 228-233 - [i2]Albert Zeyer, Patrick Doetsch, Paul Voigtlaender, Ralf Schlüter, Hermann Ney:
A Comprehensive Study of Deep Bidirectional LSTM RNNs for Acoustic Modeling in Speech Recognition. CoRR abs/1606.06871 (2016) - [i1]Patrick Doetsch, Albert Zeyer, Paul Voigtlaender, Ilya Kulikov, Ralf Schlüter, Hermann Ney:
RETURNN: The RWTH Extensible Training framework for Universal Recurrent Neural Networks. CoRR abs/1608.00895 (2016) - 2015
- [c1]Paul Voigtlaender, Patrick Doetsch, Simon Wiesler, Ralf Schlüter, Hermann Ney:
Sequence-discriminative training of recurrent neural networks. ICASSP 2015: 2100-2104
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-15 19:28 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint