default search action
Thomas Wolf 0008
Person information
- affiliation: Hugging Face, Brooklyn, NY, USA
Other persons with the same name
- Thomas Wolf — disambiguation page
- Thomas Wolf 0001 — Leibniz Institute for Natural Product Research and Infection Biology, Jena, Germany
- Thomas Wolf 0002 — University of Augsburg, FIM Research Center, Germany
- Thomas Wolf 0003 — University of Tübingen, Germany
- Thomas Wolf 0004 — University of Jena, Germany
- Thomas Wolf 0005 — Bayerische Staatsbibliothek, Munich, Germany
- Thomas Wolf 0006 — University of Munich, Germany
- Thomas Wolf 0007 — University of Karlsruhe, Germany
- Thomas Wolf 0009 — TU München, Institute of Automatic Control, Garching, Germany
- Thomas Wolf 0010 — Eidgenössische Technische Hochschule (ETH) Zürich, Switzerland
- Thomas Wolf 0011 — TU München, Institute for Human-Machine Communication, Garching, Germany
- Thomas Wolf 0012 — Brock University, Ontario, Canada
- Thomas Wolf 0013 — Swiss Federal Institute of Technology in Lausanne (EFPL), Switzerland
- Thomas Wolf 0014 — Karlsruher Institut für Technologie, Karlsruhe, Germany
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c17]Grégoire Mialon, Clémentine Fourrier, Thomas Wolf, Yann LeCun, Thomas Scialom:
GAIA: a benchmark for General AI Assistants. ICLR 2024 - [i23]Guilherme Penedo, Hynek Kydlícek, Loubna Ben Allal, Anton Lozhkov, Margaret Mitchell, Colin Raffel, Leandro von Werra, Thomas Wolf:
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale. CoRR abs/2406.17557 (2024) - 2023
- [j3]Alexandros Karargyris, Renato Umeton, Micah J. Sheller, Alejandro Aristizabal, Johnu George, Anna Wuest, Sarthak Pati, Hasan Kassem, Maximilian Zenk, Ujjwal Baid, Prakash Narayana Moorthy, Alexander Chowdhury, Junyi Guo, Sahil S. Nalawade, Jacob Rosenthal, David Kanter, Maria Xenochristou, Daniel J. Beutel, Verena Chung, Timothy Bergquist, James A. Eddy, Abubakar Abid, Lewis Tunstall, Omar Sanseviero, Dimitrios Dimitriadis, Yiming Qian, Xinxing Xu, Yong Liu, Rick Siow Mong Goh, Srini Bala, Victor Bittorf, Sreekar Reddy Puchala, Biagio Ricciuti, Soujanya Samineni, Eshna Sengupta, Akshay Chaudhari, Cody Coleman, Bala Desinghu, Gregory F. Diamos, Debo Dutta, Diane Feddema, Grigori Fursin, Xinyuan Huang, Satyananda Kashyap, Nicholas D. Lane, Indranil Mallick, Pietro Mascagni, Virendra Mehta, Cassiano Ferro Moraes, Vivek Natarajan, Nikola Nikolov, Nicolas Padoy, Gennady Pekhimenko, Vijay Janapa Reddi, G. Anthony Reina, Pablo Ribalta, Abhishek Singh, Jayaraman J. Thiagarajan, Jacob Albrecht, Thomas Wolf, Geralyn Miller, Huazhu Fu, Prashant Shah, Daguang Xu, Poonam Yadav, David Talby, Mark M. Awad, Jeremy P. Howard, Michael Rosenthal, Luigi Marchionni, Massimo Loda, Jason M. Johnson, Spyridon Bakas, Peter Mattson:
Federated benchmarking of medical artificial intelligence with MedPerf. Nat. Mac. Intell. 5(7): 799-810 (2023) - [j2]Denis Kocetkov, Raymond Li, Loubna Ben Allal, Jia Li, Chenghao Mou, Yacine Jernite, Margaret Mitchell, Carlos Muñoz Ferrandis, Sean Hughes, Thomas Wolf, Dzmitry Bahdanau, Leandro von Werra, Harm de Vries:
The Stack: 3 TB of permissively licensed source code. Trans. Mach. Learn. Res. 2023 (2023) - [j1]Raymond Li, Loubna Ben Allal, Yangtian Zi, Niklas Muennighoff, Denis Kocetkov, Chenghao Mou, Marc Marone, Christopher Akiki, Jia Li, Jenny Chim, Qian Liu, Evgenii Zheltonozhskii, Terry Yue Zhuo, Thomas Wang, Olivier Dehaene, Mishig Davaadorj, Joel Lamy-Poirier, João Monteiro, Oleh Shliazhko, Nicolas Gontier, Nicholas Meade, Armel Zebaze, Ming-Ho Yee, Logesh Kumar Umapathi, Jian Zhu, Benjamin Lipkin, Muhtasham Oblokulov, Zhiruo Wang, Rudra Murthy V, Jason T. Stillerman, Siva Sankalp Patel, Dmitry Abulkhanov, Marco Zocca, Manan Dey, Zhihan Zhang, Nour Fahmy, Urvashi Bhattacharyya, Wenhao Yu, Swayam Singh, Sasha Luccioni, Paulo Villegas, Maxim Kunakov, Fedor Zhdanov, Manuel Romero, Tony Lee, Nadav Timor, Jennifer Ding, Claire Schlesinger, Hailey Schoelkopf, Jan Ebert, Tri Dao, Mayank Mishra, Alex Gu, Jennifer Robinson, Carolyn Jane Anderson, Brendan Dolan-Gavitt, Danish Contractor, Siva Reddy, Daniel Fried, Dzmitry Bahdanau, Yacine Jernite, Carlos Muñoz Ferrandis, Sean Hughes, Thomas Wolf, Arjun Guha, Leandro von Werra, Harm de Vries:
StarCoder: may the source be with you! Trans. Mach. Learn. Res. 2023 (2023) - [c16]Risto Luukkonen, Ville Komulainen, Jouni Luoma, Anni Eskelinen, Jenna Kanerva, Hanna-Mari Kupari, Filip Ginter, Veronika Laippala, Niklas Muennighoff, Aleksandra Piktus, Thomas Wang, Nouamane Tazi, Teven Le Scao, Thomas Wolf, Osma Suominen, Samuli Sairanen, Mikko Merioksa, Jyrki Heinonen, Aija Vahtola, Samuel Antao, Sampo Pyysalo:
FinGPT: Large Generative Models for a Small Language. EMNLP 2023: 2710-2726 - [c15]Niklas Muennighoff, Alexander M. Rush, Boaz Barak, Teven Le Scao, Nouamane Tazi, Aleksandra Piktus, Sampo Pyysalo, Thomas Wolf, Colin A. Raffel:
Scaling Data-Constrained Language Models. NeurIPS 2023 - [i22]Raymond Li, Loubna Ben Allal, Yangtian Zi, Niklas Muennighoff, Denis Kocetkov, Chenghao Mou, Marc Marone, Christopher Akiki, Jia Li, Jenny Chim, Qian Liu, Evgenii Zheltonozhskii, Terry Yue Zhuo, Thomas Wang, Olivier Dehaene, Mishig Davaadorj, Joel Lamy-Poirier, João Monteiro, Oleh Shliazhko, Nicolas Gontier, Nicholas Meade, Armel Zebaze, Ming-Ho Yee, Logesh Kumar Umapathi, Jian Zhu, Benjamin Lipkin, Muhtasham Oblokulov, Zhiruo Wang, Rudra Murthy V, Jason Stillerman, Siva Sankalp Patel, Dmitry Abulkhanov, Marco Zocca, Manan Dey, Zhihan Zhang, Nour Moustafa-Fahmy, Urvashi Bhattacharyya, Wenhao Yu, Swayam Singh, Sasha Luccioni, Paulo Villegas, Maxim Kunakov, Fedor Zhdanov, Manuel Romero, Tony Lee, Nadav Timor, Jennifer Ding, Claire Schlesinger, Hailey Schoelkopf, Jan Ebert, Tri Dao, Mayank Mishra, Alex Gu, Jennifer Robinson, Carolyn Jane Anderson, Brendan Dolan-Gavitt, Danish Contractor, Siva Reddy, Daniel Fried, Dzmitry Bahdanau, Yacine Jernite, Carlos Muñoz Ferrandis, Sean Hughes, Thomas Wolf, Arjun Guha, Leandro von Werra, Harm de Vries:
StarCoder: may the source be with you! CoRR abs/2305.06161 (2023) - [i21]Niklas Muennighoff, Alexander M. Rush, Boaz Barak, Teven Le Scao, Aleksandra Piktus, Nouamane Tazi, Sampo Pyysalo, Thomas Wolf, Colin Raffel:
Scaling Data-Constrained Language Models. CoRR abs/2305.16264 (2023) - [i20]Lewis Tunstall, Edward Beeching, Nathan Lambert, Nazneen Rajani, Kashif Rasul, Younes Belkada, Shengyi Huang, Leandro von Werra, Clémentine Fourrier, Nathan Habib, Nathan Sarrazin, Omar Sanseviero, Alexander M. Rush, Thomas Wolf:
Zephyr: Direct Distillation of LM Alignment. CoRR abs/2310.16944 (2023) - [i19]Risto Luukkonen, Ville Komulainen, Jouni Luoma, Anni Eskelinen, Jenna Kanerva, Hanna-Mari Kupari, Filip Ginter, Veronika Laippala, Niklas Muennighoff, Aleksandra Piktus, Thomas Wang, Nouamane Tazi, Teven Le Scao, Thomas Wolf, Osma Suominen, Samuli Sairanen, Mikko Merioksa, Jyrki Heinonen, Aija Vahtola, Samuel Antao, Sampo Pyysalo:
FinGPT: Large Generative Models for a Small Language. CoRR abs/2311.05640 (2023) - [i18]Grégoire Mialon, Clémentine Fourrier, Craig Swift, Thomas Wolf, Yann LeCun, Thomas Scialom:
GAIA: a benchmark for General AI Assistants. CoRR abs/2311.12983 (2023) - 2022
- [c14]Victor Sanh, Albert Webson, Colin Raffel, Stephen H. Bach, Lintang Sutawika, Zaid Alyafeai, Antoine Chaffin, Arnaud Stiegler, Arun Raja, Manan Dey, M Saiful Bari, Canwen Xu, Urmish Thakker, Shanya Sharma Sharma, Eliza Szczechla, Taewoon Kim, Gunjan Chhablani, Nihal V. Nayak, Debajyoti Datta, Jonathan Chang, Mike Tian-Jian Jiang, Han Wang, Matteo Manica, Sheng Shen, Zheng Xin Yong, Harshit Pandey, Rachel Bawden, Thomas Wang, Trishala Neeraj, Jos Rozen, Abheesht Sharma, Andrea Santilli, Thibault Févry, Jason Alan Fries, Ryan Teehan, Teven Le Scao, Stella Biderman, Leo Gao, Thomas Wolf, Alexander M. Rush:
Multitask Prompted Training Enables Zero-Shot Task Generalization. ICLR 2022 - [i17]Alexander Borzunov, Max Ryabinin, Tim Dettmers, Quentin Lhoest, Lucile Saulnier, Michael Diskin, Yacine Jernite, Thomas Wolf:
Training Transformers Together. CoRR abs/2207.03481 (2022) - [i16]Leandro von Werra, Lewis Tunstall, Abhishek Thakur, Alexandra Sasha Luccioni, Tristan Thrush, Aleksandra Piktus, Felix Marty, Nazneen Rajani, Victor Mustar, Helen Ngo, Omar Sanseviero, Mario Sasko, Albert Villanova del Moral, Quentin Lhoest, Julien Chaumond, Margaret Mitchell, Alexander M. Rush, Thomas Wolf, Douwe Kiela:
Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements. CoRR abs/2210.01970 (2022) - [i15]Denis Kocetkov, Raymond Li, Loubna Ben Allal, Jia Li, Chenghao Mou, Carlos Muñoz Ferrandis, Yacine Jernite, Margaret Mitchell, Sean Hughes, Thomas Wolf, Dzmitry Bahdanau, Leandro von Werra, Harm de Vries:
The Stack: 3 TB of permissively licensed source code. CoRR abs/2211.15533 (2022) - [i14]Christopher Akiki, Giada Pistilli, Margot Mieskes, Matthias Gallé, Thomas Wolf, Suzana Ilic, Yacine Jernite:
BigScience: A Case Study in the Social Construction of a Multilingual Large Language Model. CoRR abs/2212.04960 (2022) - 2021
- [c13]Quentin Lhoest, Albert Villanova del Moral, Yacine Jernite, Abhishek Thakur, Patrick von Platen, Suraj Patil, Julien Chaumond, Mariama Drame, Julien Plu, Lewis Tunstall, Joe Davison, Mario Sasko, Gunjan Chhablani, Bhavitvya Malik, Simon Brandeis, Teven Le Scao, Victor Sanh, Canwen Xu, Nicolas Patry, Angelina McMillan-Major, Philipp Schmid, Sylvain Gugger, Clément Delangue, Théo Matussière, Lysandre Debut, Stas Bekman, Pierric Cistac, Thibault Goehringer, Victor Mustar, François Lagunas, Alexander M. Rush, Thomas Wolf:
Datasets: A Community Library for Natural Language Processing. EMNLP (Demos) 2021: 175-184 - [c12]Victor Sanh, Thomas Wolf, Yonatan Belinkov, Alexander M. Rush:
Learning from others' mistakes: Avoiding dataset biases without modeling them. ICLR 2021 - [c11]Alexander Borzunov, Max Ryabinin, Tim Dettmers, Quentin Lhoest, Lucile Saulnier, Michael Diskin, Yacine Jernite, Thomas Wolf:
Training Transformers Together. NeurIPS (Competition and Demos) 2021: 335-342 - [c10]Michael Diskin, Alexey Bukhtiyarov, Max Ryabinin, Lucile Saulnier, Quentin Lhoest, Anton Sinitsin, Dmitry Popov, Dmitry V. Pyrkin, Maxim Kashirin, Alexander Borzunov, Albert Villanova del Moral, Denis Mazur, Ilia Kobelev, Yacine Jernite, Thomas Wolf, Gennady Pekhimenko:
Distributed Deep Learning In Open Collaborations. NeurIPS 2021: 7879-7897 - [e2]Nafise Sadat Moosavi, Iryna Gurevych, Angela Fan, Thomas Wolf, Yufang Hou, Ana Marasovic, Sujith Ravi:
Proceedings of the Second Workshop on Simple and Efficient Natural Language Processing, SustaiNLP@EMNLP 2021, Virtual, November 10, 2021. Association for Computational Linguistics 2021, ISBN 978-1-955917-01-8 [contents] - [i13]Michael Diskin, Alexey Bukhtiyarov, Max Ryabinin, Lucile Saulnier, Quentin Lhoest, Anton Sinitsin, Dmitry Popov, Dmitry V. Pyrkin, Maxim Kashirin, Alexander Borzunov, Albert Villanova del Moral, Denis Mazur, Ilia Kobelev, Yacine Jernite, Thomas Wolf, Gennady Pekhimenko:
Distributed Deep Learning in Open Collaborations. CoRR abs/2106.10207 (2021) - [i12]Hao Tan, Jie Lei, Thomas Wolf, Mohit Bansal:
VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive Learning. CoRR abs/2106.11250 (2021) - [i11]Quentin Lhoest, Albert Villanova del Moral, Yacine Jernite, Abhishek Thakur, Patrick von Platen, Suraj Patil, Julien Chaumond, Mariama Drame, Julien Plu, Lewis Tunstall, Joe Davison, Mario Sasko, Gunjan Chhablani, Bhavitvya Malik, Simon Brandeis, Teven Le Scao, Victor Sanh, Canwen Xu, Nicolas Patry, Angelina McMillan-Major, Philipp Schmid, Sylvain Gugger, Clement Delangue, Théo Matussière, Lysandre Debut, Stas Bekman, Pierric Cistac, Thibault Goehringer, Victor Mustar, François Lagunas, Alexander M. Rush, Thomas Wolf:
Datasets: A Community Library for Natural Language Processing. CoRR abs/2109.02846 (2021) - [i10]Victor Sanh, Albert Webson, Colin Raffel, Stephen H. Bach, Lintang Sutawika, Zaid Alyafeai, Antoine Chaffin, Arnaud Stiegler, Teven Le Scao, Arun Raja, Manan Dey, M. Saiful Bari, Canwen Xu, Urmish Thakker, Shanya Sharma, Eliza Szczechla, Taewoon Kim, Gunjan Chhablani, Nihal V. Nayak, Debajyoti Datta, Jonathan Chang, Mike Tian-Jian Jiang, Han Wang, Matteo Manica, Sheng Shen, Zheng Xin Yong, Harshit Pandey, Rachel Bawden, Thomas Wang, Trishala Neeraj, Jos Rozen, Abheesht Sharma, Andrea Santilli, Thibault Févry, Jason Alan Fries, Ryan Teehan, Stella Biderman, Leo Gao, Tali Bers, Thomas Wolf, Alexander M. Rush:
Multitask Prompted Training Enables Zero-Shot Task Generalization. CoRR abs/2110.08207 (2021) - 2020
- [c9]Yangfeng Ji, Antoine Bosselut, Thomas Wolf, Asli Celikyilmaz:
The Amazing World of Neural Language Generation. EMNLP (Tutorial Abstracts) 2020: 37-42 - [c8]Thomas Wolf, Lysandre Debut, Victor Sanh, Julien Chaumond, Clement Delangue, Anthony Moi, Pierric Cistac, Tim Rault, Rémi Louf, Morgan Funtowicz, Joe Davison, Sam Shleifer, Patrick von Platen, Clara Ma, Yacine Jernite, Julien Plu, Canwen Xu, Teven Le Scao, Sylvain Gugger, Mariama Drame, Quentin Lhoest, Alexander M. Rush:
Transformers: State-of-the-Art Natural Language Processing. EMNLP (Demos) 2020: 38-45 - [c7]Alex Wang, Thomas Wolf:
Overview of the SustaiNLP 2020 Shared Task. SustaiNLP@EMNLP 2020: 174-178 - [c6]Victor Sanh, Thomas Wolf, Alexander M. Rush:
Movement Pruning: Adaptive Sparsity by Fine-Tuning. NeurIPS 2020 - [e1]Nafise Sadat Moosavi, Angela Fan, Vered Shwartz, Goran Glavas, Shafiq R. Joty, Alex Wang, Thomas Wolf:
Proceedings of SustaiNLP: Workshop on Simple and Efficient Natural Language Processing, SustaiNLP@EMNLP 2020, Online, November 20, 2020. Association for Computational Linguistics 2020, ISBN 978-1-952148-77-4 [contents] - [i9]Shaojie Jiang, Thomas Wolf, Christof Monz, Maarten de Rijke:
TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation. CoRR abs/2003.11963 (2020) - [i8]Victor Sanh, Thomas Wolf, Alexander M. Rush:
Movement Pruning: Adaptive Sparsity by Fine-Tuning. CoRR abs/2005.07683 (2020) - [i7]Victor Sanh, Thomas Wolf, Yonatan Belinkov, Alexander M. Rush:
Learning from others' mistakes: Avoiding dataset biases without modeling them. CoRR abs/2012.01300 (2020)
2010 – 2019
- 2019
- [c5]Victor Sanh, Thomas Wolf, Sebastian Ruder:
A Hierarchical Multi-Task Approach for Learning Embeddings from Semantic Tasks. AAAI 2019: 6949-6956 - [c4]Sergey Golovanov, Rauf Kurbanov, Sergey I. Nikolenko, Kyryl Truskovskyi, Alexander Tselousov, Thomas Wolf:
Large-Scale Transfer Learning for Natural Language Generation. ACL (1) 2019: 6053-6058 - [c3]Sebastian Ruder, Matthew E. Peters, Swabha Swayamdipta, Thomas Wolf:
Transfer Learning in Natural Language Processing. NAACL-HLT (Tutorial Abstracts) 2019: 15-18 - [i6]Thomas Wolf, Victor Sanh, Julien Chaumond, Clement Delangue:
TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents. CoRR abs/1901.08149 (2019) - [i5]Victor Sanh, Lysandre Debut, Julien Chaumond, Thomas Wolf:
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter. CoRR abs/1910.01108 (2019) - [i4]Thomas Wolf, Lysandre Debut, Victor Sanh, Julien Chaumond, Clement Delangue, Anthony Moi, Pierric Cistac, Tim Rault, Rémi Louf, Morgan Funtowicz, Jamie Brew:
HuggingFace's Transformers: State-of-the-art Natural Language Processing. CoRR abs/1910.03771 (2019) - 2018
- [c2]Thomas Wolf, Julien Chaumond, Clement Delangue:
Continuous Learning in a Hierarchical Multiscale Neural Network. ACL (2) 2018: 1-7 - [c1]Thomas Wolf, Julien Chaumond, Clement Delangue:
Meta-Learning a Dynamical Language Model. ICLR (Workshop) 2018 - [i3]Thomas Wolf, Julien Chaumond, Clement Delangue:
Meta-Learning a Dynamical Language Model. CoRR abs/1803.10631 (2018) - [i2]Thomas Wolf, Julien Chaumond, Clement Delangue:
Continuous Learning in a Hierarchical Multiscale Neural Network. CoRR abs/1805.05758 (2018) - [i1]Victor Sanh, Thomas Wolf, Sebastian Ruder:
A Hierarchical Multi-task Approach for Learning Embeddings from Semantic Tasks. CoRR abs/1811.06031 (2018)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 21:17 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint