default search action
Joel Hestness
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c12]Vithursan Thangarasa, Mahmoud Salem, Shreyas Saxena, Chen-Yu Leong, Joel Hestness, Sean Lie:
MediSwift: Efficient Sparse Pre-trained Biomedical Language Models. ACL (Findings) 2024: 214-230 - [i18]Vithursan Thangarasa, Mahmoud Salem, Shreyas Saxena, Kevin Leong, Joel Hestness, Sean Lie:
MediSwift: Efficient Sparse Pre-trained Biomedical Language Models. CoRR abs/2403.00952 (2024) - [i17]Nolan Dey, Shane Bergsma, Joel Hestness:
Sparse maximal update parameterization: A holistic approach to sparse training dynamics. CoRR abs/2405.15743 (2024) - [i16]Gurpreet Gosal, Yishi Xu, Gokul Ramakrishnan, Rituraj Joshi, Avraham Sheinin, Zhiming Chen, Biswajit Mishra, Natalia Vassilieva, Joel Hestness, Neha Sengupta, Sunil Kumar Sahu, Bokang Jia, Onkar Pandit, Satheesh Katipomu, Samta Kamboj, Samujjwal Ghosh, Rahul Pal, Parvez Mullah, Soundar Doraiswamy, Mohamed El Karim Chami, Preslav Nakov:
Bilingual Adaptation of Monolingual Foundation Models. CoRR abs/2407.12869 (2024) - [i15]Gavia Gray, Aman Tiwari, Shane Bergsma, Joel Hestness:
Normalization Layer Per-Example Gradients are Sufficient to Predict Gradient Noise Scale in Transformers. CoRR abs/2411.00999 (2024) - 2023
- [c11]Vitaliy Chiley, Vithursan Thangarasa, Abhay Gupta, Anshul Samar, Joel Hestness, Dennis DeCoste:
RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network. MLSys 2023 - [i14]Nolan Dey, Gurpreet Gosal, Zhiming Chen, Hemant Khachane, William Marshall, Ribhu Pathria, Marvin Tom, Joel Hestness:
Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster. CoRR abs/2304.03208 (2023) - [i13]Zhiqiang Shen, Tianhua Tao, Liqun Ma, Willie Neiswanger, Zhengzhong Liu, Hongyi Wang, Bowen Tan, Joel Hestness, Natalia Vassilieva, Daria Soboleva, Eric P. Xing:
SlimPajama-DC: Understanding Data Combinations for LLM Training. CoRR abs/2309.10818 (2023) - [i12]Nolan Dey, Daria Soboleva, Faisal Al-Khateeb, Bowen Yang, Ribhu Pathria, Hemant Khachane, Shaheer Muhammad, Zhiming Chen, Robert Myers, Jacob Robert Steeves, Natalia Vassilieva, Marvin Tom, Joel Hestness:
BTLM-3B-8K: 7B Parameter Performance in a 3B Parameter Model. CoRR abs/2309.11568 (2023) - [i11]Faisal Al-Khateeb, Nolan Dey, Daria Soboleva, Joel Hestness:
Position Interpolation Improves ALiBi Extrapolation. CoRR abs/2310.13017 (2023) - 2022
- [i10]Yuanpeng Li, Joel Hestness, Mohamed Elhoseiny, Liang Zhao, Kenneth Church:
Efficiently Disentangle Causal Representations. CoRR abs/2201.01942 (2022) - [i9]Ehsan Valavi, Joel Hestness, Newsha Ardalani, Marco Iansiti:
Time and the Value of Data. CoRR abs/2203.09118 (2022) - [i8]Ehsan Valavi, Joel Hestness, Marco Iansiti, Newsha Ardalani, Feng Zhu, Karim R. Lakhani:
Time Dependency, Data Flow, and Competitive Advantage. CoRR abs/2203.09128 (2022) - [i7]Vitaliy Chiley, Vithursan Thangarasa, Abhay Gupta, Anshul Samar, Joel Hestness, Dennis DeCoste:
RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network. CoRR abs/2206.14098 (2022) - 2021
- [c10]Atli Kosson, Vitaliy Chiley, Abhinav Venigalla, Joel Hestness, Urs Köster:
Pipelined Backpropagation at Scale: Training Large Models without Batches. MLSys 2021 - [i6]Mihir Pendse, Vithursan Thangarasa, Vitaliy Chiley, Ryan Holmdahl, Joel Hestness, Dennis DeCoste:
Memory Efficient 3D U-Net with Reversible Mobile Inverted Bottlenecks for Brain Tumor Segmentation. CoRR abs/2104.09648 (2021) - 2020
- [c9]Mihir Pendse, Vithursan Thangarasa, Vitaliy Chiley, Ryan Holmdahl, Joel Hestness, Dennis DeCoste:
Memory Efficient 3D U-Net with Reversible Mobile Inverted Bottlenecks for Brain Tumor Segmentation. BrainLes@MICCAI (2) 2020: 388-397 - [i5]Atli Kosson, Vitaliy Chiley, Abhinav Venigalla, Joel Hestness, Urs Köster:
Pipelined Backpropagation at Scale: Training Large Models without Batches. CoRR abs/2003.11666 (2020)
2010 – 2019
- 2019
- [j4]Kenneth Ward Church, Joel Hestness:
A survey of 25 years of evaluation. Nat. Lang. Eng. 25(6): 753-767 (2019) - [c8]Yuanpeng Li, Liang Zhao, Jianyu Wang, Joel Hestness:
Compositional Generalization for Primitive Substitutions. EMNLP/IJCNLP (1) 2019: 4292-4301 - [c7]Joel Hestness, Newsha Ardalani, Gregory F. Diamos:
Beyond human-level accuracy: computational challenges in deep learning. PPoPP 2019: 1-14 - [i4]Joel Hestness, Newsha Ardalani, Greg Diamos:
Beyond Human-Level Accuracy: Computational Challenges in Deep Learning. CoRR abs/1909.01736 (2019) - [i3]Yuanpeng Li, Liang Zhao, Jianyu Wang, Joel Hestness:
Compositional Generalization for Primitive Substitutions. CoRR abs/1910.02612 (2019) - 2017
- [c6]Sercan Ömer Arik, Markus Kliegl, Rewon Child, Joel Hestness, Andrew Gibiansky, Christopher Fougner, Ryan Prenger, Adam Coates:
Convolutional Recurrent Neural Networks for Small-Footprint Keyword Spotting. INTERSPEECH 2017: 1606-1610 - [i2]Sercan Ömer Arik, Markus Kliegl, Rewon Child, Joel Hestness, Andrew Gibiansky, Christopher Fougner, Ryan Prenger, Adam Coates:
Convolutional Recurrent Neural Networks for Small-Footprint Keyword Spotting. CoRR abs/1703.05390 (2017) - [i1]Joel Hestness, Sharan Narang, Newsha Ardalani, Gregory F. Diamos, Heewoo Jun, Hassan Kianinejad, Md. Mostofa Ali Patwary, Yang Yang, Yanqi Zhou:
Deep Learning Scaling is Predictable, Empirically. CoRR abs/1712.00409 (2017) - 2015
- [j3]Jason Power, Joel Hestness, Marc S. Orr, Mark D. Hill, David A. Wood:
gem5-gpu: A Heterogeneous CPU-GPU Simulator. IEEE Comput. Archit. Lett. 14(1): 34-36 (2015) - [c5]Joel Hestness, Stephen W. Keckler, David A. Wood:
GPU Computing Pipeline Inefficiencies and Optimization Opportunities in Heterogeneous CPU-GPU Processors. IISWC 2015: 87-97 - 2014
- [c4]Joel Hestness, Stephen W. Keckler, David A. Wood:
A comparative analysis of microarchitecture effects on CPU and GPU memory system behavior. IISWC 2014: 150-160 - 2012
- [j2]Boris Grot, Joel Hestness, Stephen W. Keckler, Onur Mutlu:
A QoS-Enabled On-Die Interconnect Fabric for Kilo-Node Chips. IEEE Micro 32(3): 17-25 (2012) - 2011
- [j1]Nathan L. Binkert, Bradford M. Beckmann, Gabriel Black, Steven K. Reinhardt, Ali G. Saidi, Arkaprava Basu, Joel Hestness, Derek Hower, Tushar Krishna, Somayeh Sardashti, Rathijit Sen, Korey Sewell, Muhammad Shoaib Bin Altaf, Nilay Vaish, Mark D. Hill, David A. Wood:
The gem5 simulator. SIGARCH Comput. Archit. News 39(2): 1-7 (2011) - [c3]Boris Grot, Joel Hestness, Stephen W. Keckler, Onur Mutlu:
Kilo-NOC: a heterogeneous network-on-chip architecture for scalability and service guarantees. ISCA 2011: 401-412 - 2010
- [c2]Joel Hestness, Boris Grot, Stephen W. Keckler:
Netrace: dependency-driven trace-based network-on-chip simulation. NoCArc@MICRO 2010: 31-36
2000 – 2009
- 2009
- [c1]Boris Grot, Joel Hestness, Stephen W. Keckler, Onur Mutlu:
Express Cube Topologies for on-Chip Interconnects. HPCA 2009: 163-174
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-12 20:59 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint