default search action
Dara Bahri
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c25]Behrooz Tahmasebi, Ashkan Soleymani, Dara Bahri, Stefanie Jegelka, Patrick Jaillet:
A Universal Class of Sharpness-Aware Minimization Algorithms. ICML 2024 - [i29]Behrooz Tahmasebi, Ashkan Soleymani, Dara Bahri, Stefanie Jegelka, Patrick Jaillet:
A Universal Class of Sharpness-Aware Minimization Algorithms. CoRR abs/2406.03682 (2024) - [i28]Dara Bahri, John Wieting, Dana Alon, Donald Metzler:
A Watermark for Black-Box Language Models. CoRR abs/2410.02099 (2024) - 2023
- [j2]Yi Tay, Mostafa Dehghani, Dara Bahri, Donald Metzler:
Efficient Transformers: A Survey. ACM Comput. Surv. 55(6): 109:1-109:28 (2023) - [c24]Yi Tay, Mostafa Dehghani, Vinh Q. Tran, Xavier Garcia, Jason Wei, Xuezhi Wang, Hyung Won Chung, Dara Bahri, Tal Schuster, Huaixiu Steven Zheng, Denny Zhou, Neil Houlsby, Donald Metzler:
UL2: Unifying Language Learning Paradigms. ICLR 2023 - [c23]Maksym Andriushchenko, Dara Bahri, Hossein Mobahi, Nicolas Flammarion:
Sharpness-Aware Minimization Leads to Low-Rank Features. NeurIPS 2023 - [c22]Dara Bahri, Che Zheng, Yi Tay, Donald Metzler, Andrew Tomkins:
Surprise: Result List Truncation via Extreme Value Theory. SIGIR 2023: 2404-2408 - [i27]Maksym Andriushchenko, Dara Bahri, Hossein Mobahi, Nicolas Flammarion:
Sharpness-Aware Minimization Leads to Low-Rank Features. CoRR abs/2305.16292 (2023) - 2022
- [c21]Kai Hui, Honglei Zhuang, Tao Chen, Zhen Qin, Jing Lu, Dara Bahri, Ji Ma, Jai Prakash Gupta, Cícero Nogueira dos Santos, Yi Tay, Donald Metzler:
ED2LM: Encoder-Decoder to Language Model for Faster Document Re-ranking Inference. ACL (Findings) 2022: 3747-3758 - [c20]Dara Bahri, Hossein Mobahi, Yi Tay:
Sharpness-Aware Minimization Improves Language Model Generalization. ACL (1) 2022: 7360-7371 - [c19]Vamsi Aribandi, Yi Tay, Tal Schuster, Jinfeng Rao, Huaixiu Steven Zheng, Sanket Vaibhav Mehta, Honglei Zhuang, Vinh Q. Tran, Dara Bahri, Jianmo Ni, Jai Prakash Gupta, Kai Hui, Sebastian Ruder, Donald Metzler:
ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning. ICLR 2022 - [c18]Dara Bahri, Heinrich Jiang, Yi Tay, Donald Metzler:
Scarf: Self-Supervised Contrastive Learning using Random Feature Corruption. ICLR 2022 - [c17]Heinrich Jiang, Harikrishna Narasimhan, Dara Bahri, Andrew Cotter, Afshin Rostamizadeh:
Churn Reduction via Distillation. ICLR 2022 - [c16]Yi Tay, Vinh Q. Tran, Sebastian Ruder, Jai Prakash Gupta, Hyung Won Chung, Dara Bahri, Zhen Qin, Simon Baumgartner, Cong Yu, Donald Metzler:
Charformer: Fast Character Transformers via Gradient-based Subword Tokenization. ICLR 2022 - [c15]Tal Schuster, Adam Fisch, Jai Gupta, Mostafa Dehghani, Dara Bahri, Vinh Tran, Yi Tay, Donald Metzler:
Confident Adaptive Language Modeling. NeurIPS 2022 - [c14]Yi Tay, Vinh Tran, Mostafa Dehghani, Jianmo Ni, Dara Bahri, Harsh Mehta, Zhen Qin, Kai Hui, Zhe Zhao, Jai Prakash Gupta, Tal Schuster, William W. Cohen, Donald Metzler:
Transformer Memory as a Differentiable Search Index. NeurIPS 2022 - [i26]Yi Tay, Vinh Q. Tran, Mostafa Dehghani, Jianmo Ni, Dara Bahri, Harsh Mehta, Zhen Qin, Kai Hui, Zhe Zhao, Jai Prakash Gupta, Tal Schuster, William W. Cohen, Donald Metzler:
Transformer Memory as a Differentiable Search Index. CoRR abs/2202.06991 (2022) - [i25]Kai Hui, Honglei Zhuang, Tao Chen, Zhen Qin, Jing Lu, Dara Bahri, Ji Ma, Jai Prakash Gupta, Cícero Nogueira dos Santos, Yi Tay, Don Metzler:
ED2LM: Encoder-Decoder to Language Model for Faster Document Re-ranking Inference. CoRR abs/2204.11458 (2022) - [i24]Yi Tay, Mostafa Dehghani, Vinh Q. Tran, Xavier Garcia, Dara Bahri, Tal Schuster, Huaixiu Steven Zheng, Neil Houlsby, Donald Metzler:
Unifying Language Learning Paradigms. CoRR abs/2205.05131 (2022) - [i23]Tal Schuster, Adam Fisch, Jai Prakash Gupta, Mostafa Dehghani, Dara Bahri, Vinh Q. Tran, Yi Tay, Donald Metzler:
Confident Adaptive Language Modeling. CoRR abs/2207.07061 (2022) - [i22]Dara Bahri, Heinrich Jiang, Tal Schuster, Afshin Rostamizadeh:
Is margin all you need? An extensive empirical study of active learning on tabular data. CoRR abs/2210.03822 (2022) - 2021
- [j1]Donald Metzler, Yi Tay, Dara Bahri, Marc Najork:
Rethinking search: making domain experts out of dilettantes. SIGIR Forum 55(1): 13:1-13:27 (2021) - [c13]Yi Tay, Mostafa Dehghani, Jai Prakash Gupta, Vamsi Aribandi, Dara Bahri, Zhen Qin, Donald Metzler:
Are Pretrained Convolutions Better than Pretrained Transformers? ACL/IJCNLP (1) 2021: 4349-4359 - [c12]Yikang Shen, Yi Tay, Che Zheng, Dara Bahri, Donald Metzler, Aaron C. Courville:
StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling. ACL/IJCNLP (1) 2021: 7196-7209 - [c11]Yi Tay, Mostafa Dehghani, Samira Abnar, Yikang Shen, Dara Bahri, Philip Pham, Jinfeng Rao, Liu Yang, Sebastian Ruder, Donald Metzler:
Long Range Arena : A Benchmark for Efficient Transformers. ICLR 2021 - [c10]Yi Tay, Zhe Zhao, Dara Bahri, Donald Metzler, Da-Cheng Juan:
HyperGrid Transformers: Towards A Single Model for Multiple Tasks. ICLR 2021 - [c9]Dara Bahri, Heinrich Jiang:
Locally Adaptive Label Smoothing Improves Predictive Churn. ICML 2021: 532-542 - [c8]Yi Tay, Dara Bahri, Donald Metzler, Da-Cheng Juan, Zhe Zhao, Che Zheng:
Synthesizer: Rethinking Self-Attention for Transformer Models. ICML 2021: 10183-10192 - [c7]Yi Tay, Mostafa Dehghani, Vamsi Aribandi, Jai Prakash Gupta, Philip Pham, Zhen Qin, Dara Bahri, Da-Cheng Juan, Donald Metzler:
OmniNet: Omnidirectional Representations from Transformers. ICML 2021: 10193-10202 - [c6]Dara Bahri, Yi Tay, Che Zheng, Cliff Brunk, Donald Metzler, Andrew Tomkins:
Generative Models are Unsupervised Predictors of Page Quality: A Colossal-Scale Study. WSDM 2021: 301-309 - [i21]Dara Bahri, Heinrich Jiang, Yi Tay, Donald Metzler:
Label Smoothed Embedding Hypothesis for Out-of-Distribution Detection. CoRR abs/2102.05131 (2021) - [i20]Dara Bahri, Heinrich Jiang:
Locally Adaptive Label Smoothing for Predictive Churn. CoRR abs/2102.05140 (2021) - [i19]Yi Tay, Mostafa Dehghani, Vamsi Aribandi, Jai Prakash Gupta, Philip Pham, Zhen Qin, Dara Bahri, Da-Cheng Juan, Donald Metzler:
OmniNet: Omnidirectional Representations from Transformers. CoRR abs/2103.01075 (2021) - [i18]Donald Metzler, Yi Tay, Dara Bahri, Marc Najork:
Rethinking Search: Making Experts out of Dilettantes. CoRR abs/2105.02274 (2021) - [i17]Yi Tay, Mostafa Dehghani, Jai Prakash Gupta, Dara Bahri, Vamsi Aribandi, Zhen Qin, Donald Metzler:
Are Pre-trained Convolutions Better than Pre-trained Transformers? CoRR abs/2105.03322 (2021) - [i16]Heinrich Jiang, Harikrishna Narasimhan, Dara Bahri, Andrew Cotter, Afshin Rostamizadeh:
Churn Reduction via Distillation. CoRR abs/2106.02654 (2021) - [i15]Yi Tay, Vinh Q. Tran, Sebastian Ruder, Jai Prakash Gupta, Hyung Won Chung, Dara Bahri, Zhen Qin, Simon Baumgartner, Cong Yu, Donald Metzler:
Charformer: Fast Character Transformers via Gradient-based Subword Tokenization. CoRR abs/2106.12672 (2021) - [i14]Dara Bahri, Heinrich Jiang, Yi Tay, Donald Metzler:
SCARF: Self-Supervised Contrastive Learning using Random Feature Corruption. CoRR abs/2106.15147 (2021) - [i13]Dara Bahri, Hossein Mobahi, Yi Tay:
Sharpness-Aware Minimization Improves Language Model Generalization. CoRR abs/2110.08529 (2021) - [i12]Vamsi Aribandi, Yi Tay, Tal Schuster, Jinfeng Rao, Huaixiu Steven Zheng, Sanket Vaibhav Mehta, Honglei Zhuang, Vinh Q. Tran, Dara Bahri, Jianmo Ni, Jai Prakash Gupta, Kai Hui, Sebastian Ruder, Donald Metzler:
ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning. CoRR abs/2111.10952 (2021) - 2020
- [c5]Yi Tay, Dara Bahri, Che Zheng, Clifford Brunk, Donald Metzler, Andrew Tomkins:
Reverse Engineering Configurations of Neural Text Generation Models. ACL 2020: 275-279 - [c4]Dara Bahri, Heinrich Jiang, Maya R. Gupta:
Deep k-NN for Noisy Labels. ICML 2020: 540-550 - [c3]Yi Tay, Dara Bahri, Liu Yang, Donald Metzler, Da-Cheng Juan:
Sparse Sinkhorn Attention. ICML 2020: 9438-9447 - [c2]Dara Bahri, Yi Tay, Che Zheng, Donald Metzler, Andrew Tomkins:
Choppy: Cut Transformer for Ranked List Truncation. SIGIR 2020: 1513-1516 - [i11]Yi Tay, Dara Bahri, Liu Yang, Donald Metzler, Da-Cheng Juan:
Sparse Sinkhorn Attention. CoRR abs/2002.11296 (2020) - [i10]Yi Tay, Dara Bahri, Che Zheng, Clifford Brunk, Donald Metzler, Andrew Tomkins:
Reverse Engineering Configurations of Neural Text Generation Models. CoRR abs/2004.06201 (2020) - [i9]Dara Bahri, Heinrich Jiang, Maya R. Gupta:
Deep k-NN for Noisy Labels. CoRR abs/2004.12289 (2020) - [i8]Dara Bahri, Yi Tay, Che Zheng, Donald Metzler, Andrew Tomkins:
Choppy: Cut Transformer For Ranked List Truncation. CoRR abs/2004.13012 (2020) - [i7]Yi Tay, Dara Bahri, Donald Metzler, Da-Cheng Juan, Zhe Zhao, Che Zheng:
Synthesizer: Rethinking Self-Attention in Transformer Models. CoRR abs/2005.00743 (2020) - [i6]Yi Tay, Zhe Zhao, Dara Bahri, Donald Metzler, Da-Cheng Juan:
HyperGrid: Efficient Multi-Task Transformers with Grid-wise Decomposable Hyper Projections. CoRR abs/2007.05891 (2020) - [i5]Dara Bahri, Yi Tay, Che Zheng, Donald Metzler, Cliff Brunk, Andrew Tomkins:
Generative Models are Unsupervised Predictors of Page Quality: A Colossal-Scale Study. CoRR abs/2008.13533 (2020) - [i4]Yi Tay, Mostafa Dehghani, Dara Bahri, Donald Metzler:
Efficient Transformers: A Survey. CoRR abs/2009.06732 (2020) - [i3]Dara Bahri, Che Zheng, Yi Tay, Donald Metzler, Andrew Tomkins:
Surprise: Result List Truncation via Extreme Value Theory. CoRR abs/2010.09797 (2020) - [i2]Yi Tay, Mostafa Dehghani, Samira Abnar, Yikang Shen, Dara Bahri, Philip Pham, Jinfeng Rao, Liu Yang, Sebastian Ruder, Donald Metzler:
Long Range Arena: A Benchmark for Efficient Transformers. CoRR abs/2011.04006 (2020) - [i1]Yikang Shen, Yi Tay, Che Zheng, Dara Bahri, Donald Metzler, Aaron C. Courville:
StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling. CoRR abs/2012.00857 (2020)
2010 – 2019
- 2018
- [c1]Maya R. Gupta, Dara Bahri, Andrew Cotter, Kevin Robert Canini:
Diminishing Returns Shape Constraints for Interpretability and Regularization. NeurIPS 2018: 6835-6845
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-11 21:30 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint