


default search action
William Merrill
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
[i38]Selim Jerad, Anej Svete, Sophie Hao, Ryan Cotterell, William Merrill:
Context-Free Recognition with Transformers. CoRR abs/2601.01754 (2026)- 2025
[c22]Michael Y. Hu, Jackson Petty, Chuan Shi, William Merrill, Tal Linzen:
Between Circuits and Chomsky: Pre-pretraining on Formal Languages Imparts Linguistic Biases. ACL (1) 2025: 9691-9709
[i37]Team OLMo, Pete Walsh, Luca Soldaini, Dirk Groeneveld, Kyle Lo, Shane Arora, Akshita Bhagia, Yuling Gu, Shengyi Huang, Matt Jordan, Nathan Lambert, Dustin Schwenk, Oyvind Tafjord, Taira Anderson, David Atkinson, Faeze Brahman, Christopher Clark, Pradeep Dasigi, Nouha Dziri, Michal Guerquin, Hamish Ivison, Pang Wei Koh, Jiacheng Liu, Saumya Malik, William Merrill, Lester James V. Miranda, Jacob Morrison, Tyler Murray, Crystal Nam, Valentina Pyatkin, Aman Rangapur, Michael Schmitz, Sam Skjonsberg, David Wadden, Christopher Wilhelm, Michael Wilson, Luke Zettlemoyer, Ali Farhadi, Noah A. Smith, Hannaneh Hajishirzi:
2 OLMo 2 Furious. CoRR abs/2501.00656 (2025)
[i36]Michael Y. Hu, Jackson Petty, Chuan Shi, William Merrill, Tal Linzen:
Between Circuits and Chomsky: Pre-pretraining on Formal Languages Imparts Linguistic Biases. CoRR abs/2502.19249 (2025)
[i35]William Merrill, Ashish Sabharwal:
A Little Depth Goes a Long Way: The Expressive Power of Log-Depth Transformers. CoRR abs/2503.03961 (2025)
[i34]Bo Peng, Ruichong Zhang, Daniel Goldstein, Eric Alcaide, Xingjian Du, Haowen Hou, Jiaju Lin, Jiaxing Liu, Janna Lu, William Merrill, Guangyu Song, Kaifeng Tan, Saiteja Utpala, Nathan Wilce, Johan S. Wind, Tianyi Wu, Daniel Wuttke, Christian Zhou-Zheng
:
RWKV-7 "Goose" with Expressive Dynamic State Evolution. CoRR abs/2503.14456 (2025)
[i33]William Merrill, Ashish Sabharwal:
Exact Expressive Power of Transformers with Padding. CoRR abs/2505.18948 (2025)
[i32]William Merrill, Shane Arora, Dirk Groeneveld, Hannaneh Hajishirzi:
Critical Batch Size Revisited: A Simple Empirical Approach to Large-Batch Language Model Training. CoRR abs/2505.23971 (2025)
[i31]Jackson Petty, Michael Y. Hu, Wentao Wang, Shauli Ravfogel, William Merrill, Tal Linzen:
RELIC: Evaluating Compositional Instruction Following via Language Recognition. CoRR abs/2506.05205 (2025)
[i30]Andy Yang, Christopher Watson, Anton Xue, Satwik Bhattamishra, Jose Llarena, William Merrill, Emile Dos Santos Ferreira, Anej Svete, David Chiang:
The Transformer Cookbook. CoRR abs/2510.00368 (2025)
[i29]Allyson Ettinger, Amanda Bertsch, Bailey Kuehl, David Graham, David Heineman, Dirk Groeneveld, Faeze Brahman, Finbarr Timbers, Hamish Ivison, Jacob Morrison, Jake Poznanski, Kyle Lo, Luca Soldaini, Matt Jordan, Mayee F. Chen, Michael Noukhovitch, Nathan Lambert, Pete Walsh, Pradeep Dasigi, Robert Berry, Saumya Malik, Saurabh Shah, Scott Geng, Shane Arora, Shashank Gupta, Taira Anderson, Teng Xiao, Tyler Murray, Tyler Romero, Victoria Graf, Akari Asai, Akshita Bhagia, Alexander Wettig, Alisa Liu, Aman Rangapur, Chloe Anastasiades, Costa Huang, Dustin Schwenk, Harsh Trivedi, Ian Magnusson, Jaron Lochner, Jiacheng Liu, Lester James V. Miranda, Maarten Sap, Malia Morgan, Michael Schmitz, Michal Guerquin, Michael Wilson, Regan Huff, Ronan Le Bras, Rui Xin, Rulin Shao, Sam Skjonsberg, Shannon Zejiang Shen, Shuyue Stella Li, Tucker Wilde, Valentina Pyatkin, William Merrill, Yapei Chang, Yuling Gu, Zhiyuan Zeng, Ashish Sabharwal, Luke Zettlemoyer, Pang Wei Koh, Ali Farhadi, Noah A. Smith, Hannaneh Hajishirzi:
Olmo 3. CoRR abs/2512.13961 (2025)- 2024
[j8]Lena Strobl, William Merrill, Gail Weiss, David Chiang, Dana Angluin:
What Formal Languages Can Transformers Express? A Survey. Trans. Assoc. Comput. Linguistics 12: 543-561 (2024)
[c21]Alexandra Butoi, Robin Chan, Ryan Cotterell, William Merrill, Franz Nowak, Clemente Pasti, Lena Strobl, Anej Svete:
Computational Expressivity of Neural Language Models. ACL (5) 2024: 5
[c20]William Merrill, Zhaofeng Wu, Norihito Naka, Yoon Kim, Tal Linzen:
Can You Learn Semantics Through Next-Word Prediction? The Case of Entailment. ACL (Findings) 2024: 2752-2773
[c19]Dirk Groeneveld, Iz Beltagy, Evan Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, Shane Arora
, David Atkinson, Russell Authur, Khyathi Raghavi Chandu, Arman Cohan, Jennifer Dumas, Yanai Elazar
, Yuling Gu, Jack Hessel, Tushar Khot, William Merrill, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Valentina Pyatkin, Abhilasha Ravichander, Dustin Schwenk, Saurabh Shah, Will Smith, Emma Strubell, Nishant Subramani, Mitchell Wortsman, Pradeep Dasigi, Nathan Lambert, Kyle Richardson, Luke Zettlemoyer, Jesse Dodge, Kyle Lo, Luca Soldaini, Noah A. Smith, Hannaneh Hajishirzi:
OLMo: Accelerating the Science of Language Models. ACL (1) 2024: 15789-15809
[c18]William Merrill, Noah A. Smith, Yanai Elazar
:
Evaluating n-Gram Novelty of Language Models Using Rusty-DAWG. EMNLP 2024: 14459-14473
[c17]William Merrill, Ashish Sabharwal:
The Expressive Power of Transformers with Chain of Thought. ICLR 2024
[c16]William Merrill, Jackson Petty, Ashish Sabharwal:
The Illusion of State in State-Space Models. ICML 2024: 35492-35506
[c15]Muru Zhang, Ofir Press, William Merrill, Alisa Liu, Noah A. Smith:
How Language Model Hallucinations Can Snowball. ICML 2024: 59670-59684
[i28]Dirk Groeneveld, Iz Beltagy, Pete Walsh, Akshita Bhagia
, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, Shane Arora, David Atkinson, Russell Authur, Khyathi Raghavi Chandu, Arman Cohan, Jennifer Dumas, Yanai Elazar
, Yuling Gu, Jack Hessel, Tushar Khot, William Merrill, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Valentina Pyatkin, Abhilasha Ravichander
, Dustin Schwenk, Saurabh Shah
, Will Smith, Emma Strubell, Nishant Subramani, Mitchell Wortsman, Pradeep Dasigi, Nathan Lambert, Kyle Richardson, Luke Zettlemoyer, Jesse Dodge, Kyle Lo, Luca Soldaini, Noah A. Smith, Hannaneh Hajishirzi:
OLMo: Accelerating the Science of Language Models. CoRR abs/2402.00838 (2024)
[i27]William Merrill, Zhaofeng Wu, Norihito Naka, Yoon Kim, Tal Linzen:
Can You Learn Semantics Through Next-Word Prediction? The Case of Entailment. CoRR abs/2402.13956 (2024)
[i26]William Merrill, Jackson Petty, Ashish Sabharwal:
The Illusion of State in State-Space Models. CoRR abs/2404.08819 (2024)
[i25]Jacob Pfau, William Merrill, Samuel R. Bowman:
Let's Think Dot by Dot: Hidden Computation in Transformer Language Models. CoRR abs/2404.15758 (2024)
[i24]William Merrill, Noah A. Smith, Yanai Elazar
:
Evaluating n-Gram Novelty of Language Models Using Rusty-DAWG. CoRR abs/2406.13069 (2024)- 2023
[j7]William Merrill, Ashish Sabharwal:
The Parallelism Tradeoff: Limitations of Log-Precision Transformers. Trans. Assoc. Comput. Linguistics 11: 531-545 (2023)
[j6]Zhaofeng Wu, William Merrill, Hao Peng, Iz Beltagy, Noah A. Smith:
Transparency Helps Reveal When Language Models Learn Meaning. Trans. Assoc. Comput. Linguistics 11: 617-634 (2023)
[c14]William Merrill:
Formal Languages and the NLP Black Box. DLT 2023: 1-8
[c13]William Merrill:
Formal languages and neural models for learning on sequences. ICGI 2023: 5
[c12]William Merrill, Ashish Sabharwal:
A Logic for Expressing Log-Precision Transformers. NeurIPS 2023
[i23]William Merrill, Nikolaos Tsilivis, Aman Shukla:
A Tale of Two Circuits: Grokking as Competition of Sparse and Dense Subnetworks. CoRR abs/2303.11873 (2023)
[i22]Muru Zhang, Ofir Press, William Merrill, Alisa Liu, Noah A. Smith:
How Language Model Hallucinations Can Snowball. CoRR abs/2305.13534 (2023)
[i21]William Merrill, Ashish Sabharwal:
The Expressive Power of Transformers with Chain of Thought. CoRR abs/2310.07923 (2023)
[i20]Lena Strobl, William Merrill, Gail Weiss
, David Chiang, Dana Angluin:
Transformers as Recognizers of Formal Languages: A Survey on Expressivity. CoRR abs/2311.00208 (2023)- 2022
[j5]William Merrill, Ashish Sabharwal, Noah A. Smith:
Saturated Transformers are Constant-Depth Threshold Circuits. Trans. Assoc. Comput. Linguistics 10: 843-856 (2022)
[c11]Sanjay Subramanian, William Merrill, Trevor Darrell, Matt Gardner, Sameer Singh
, Anna Rohrbach
:
ReCLIP: A Strong Zero-Shot Baseline for Referring Expression Comprehension. ACL (1) 2022: 5198-5215
[c10]William Merrill, Alex Warstadt, Tal Linzen:
Entailment Semantics Can Be Extracted from an Ideal Language Model. CoNLL 2022: 176-193
[i19]William Merrill, Nikolaos Tsilivis:
Extracting Finite Automata from RNNs Using State Merging. CoRR abs/2201.12451 (2022)
[i18]Sanjay Subramanian, William Merrill, Trevor Darrell, Matt Gardner, Sameer Singh
, Anna Rohrbach:
ReCLIP: A Strong Zero-Shot Baseline for Referring Expression Comprehension. CoRR abs/2204.05991 (2022)
[i17]William Merrill, Ashish Sabharwal:
Log-Precision Transformers are Constant-Depth Uniform Threshold Circuits. CoRR abs/2207.00729 (2022)
[i16]William Merrill, Alex Warstadt, Tal Linzen:
Entailment Semantics Can Be Extracted from an Ideal Language Model. CoRR abs/2209.12407 (2022)
[i15]William Merrill, Ashish Sabharwal:
Transformers Implement First-Order Logic with Majority Quantifiers. CoRR abs/2210.02671 (2022)
[i14]Zhaofeng Wu, William Merrill, Hao Peng, Iz Beltagy, Noah A. Smith:
Transparency Helps Reveal When Language Models Learn Meaning. CoRR abs/2210.07468 (2022)- 2021
[j4]William Merrill, Yoav Goldberg, Roy Schwartz, Noah A. Smith:
Provable Limitations of Acquiring Meaning from Ungrounded Form: What Will Future Language Models Understand? Trans. Assoc. Comput. Linguistics 9: 1047-1060 (2021)
[c9]William Merrill, Vivek Ramanujan, Yoav Goldberg
, Roy Schwartz, Noah A. Smith:
Effects of Parameter Norm Growth During Transformer Training: Inductive Bias from Gradient Descent. EMNLP (1) 2021: 1766-1781
[c8]Matt Gardner, William Merrill, Jesse Dodge, Matthew E. Peters, Alexis Ross, Sameer Singh
, Noah A. Smith:
Competency Problems: On Finding and Removing Artifacts in Language Data. EMNLP (1) 2021: 1801-1813
[i13]William Merrill:
Formal Language Theory Meets Modern NLP. CoRR abs/2102.10094 (2021)
[i12]Matt Gardner, William Merrill, Jesse Dodge, Matthew E. Peters, Alexis Ross, Sameer Singh, Noah A. Smith:
Competency Problems: On Finding and Removing Artifacts in Language Data. CoRR abs/2104.08646 (2021)
[i11]William Merrill, Yoav Goldberg, Roy Schwartz, Noah A. Smith:
Provable Limitations of Acquiring Meaning from Ungrounded Form: What will Future Language Models Understand? CoRR abs/2104.10809 (2021)
[i10]William Merrill, Yoav Goldberg, Roy Schwartz, Noah A. Smith:
On the Power of Saturated Transformers: A View from Circuit Complexity. CoRR abs/2106.16213 (2021)- 2020
[c7]William Merrill, Gail Weiss
, Yoav Goldberg
, Roy Schwartz, Noah A. Smith, Eran Yahav
:
A Formal Hierarchy of RNN Architectures. ACL 2020: 443-459
[i9]William Merrill:
On the Linguistic Capacity of Real-Time Counter Automata. CoRR abs/2004.06866 (2020)
[i8]William Merrill, Gail Weiss, Yoav Goldberg, Roy Schwartz, Noah A. Smith, Eran Yahav:
A Formal Hierarchy of RNN Architectures. CoRR abs/2004.08500 (2020)
[i7]Lucy Lu Wang
, Kyle Lo, Yoganand Chandrasekhar, Russell Reas, Jiangjiang Yang, Darrin Eide, Kathryn Funk
, Rodney Kinney, Ziyang Liu, William Merrill, Paul Mooney, Dewey A. Murdick
, Devvret Rishi, Jerry Sheehan, Zhihong Shen, Brandon Stilson, Alex D. Wade
, Kuansan Wang, Chris Wilhelm, Boya Xie, Douglas Raymond, Daniel S. Weld, Oren Etzioni, Sebastian Kohlmeier:
CORD-19: The Covid-19 Open Research Dataset. CoRR abs/2004.10706 (2020)
[i6]William Merrill, Vivek Ramanujan, Yoav Goldberg, Roy Schwartz, Noah A. Smith:
Parameter Norm Growth During Training of Transformers. CoRR abs/2010.09697 (2020)
2010 – 2019
- 2019
[c6]William Merrill, Gigi Felice Stark, Robert Frank:
Detecting Syntactic Change Using a Neural Part-of-Speech Tagger. LChange@ACL 2019: 167-174
[c5]William Merrill, Lenny Khazan, Noah Amsel
, Yiding Hao, Simon Mendelsohn, Robert Frank
:
Finding Hierarchical Structure in Neural Stacks Using Unsupervised Parsing. BlackboxNLP@ACL 2019: 224-232
[i5]William Merrill, Lenny Khazan, Noah Amsel, Yiding Hao, Simon Mendelsohn, Robert Frank:
Finding Syntactic Representations in Neural Stacks. CoRR abs/1906.01594 (2019)
[i4]William Merrill:
Sequential Neural Networks as Automata. CoRR abs/1906.01615 (2019)
[i3]William Merrill, Gigi Felice Stark, Robert Frank:
Detecting Syntactic Change Using a Neural Part-of-Speech Tagger. CoRR abs/1906.01661 (2019)- 2018
[c4]Tiwalayo Eisape, William Merrill, Joshua K. Hartshorne, Sven Dietz:
Using Machine Learning to Understand Transfer from First Language to Second Language. CogSci 2018
[c3]Yiding Hao, William Merrill, Dana Angluin, Robert Frank
, Noah Amsel
, Andrew Benz, Simon Mendelsohn:
Context-Free Transductions with Neural Stacks. BlackboxNLP@EMNLP 2018: 306-315
[c2]Jungo Kasai, Robert Frank, Pauli Xu, William Merrill, Owen Rambow:
End-to-End Graph-Based TAG Parsing with Neural Networks. NAACL-HLT 2018: 1181-1194
[i2]Jungo Kasai, Robert Frank, Pauli Xu, William Merrill, Owen Rambow:
End-to-end Graph-based TAG Parsing with Neural Networks. CoRR abs/1804.06610 (2018)
[i1]Yiding Hao, William Merrill, Dana Angluin, Robert Frank, Noah Amsel, Andrew Benz, Simon Mendelsohn:
Context-Free Transductions with Neural Stacks. CoRR abs/1809.02836 (2018)- 2010
[j3]William Merrill:
Where is the return on investment in wireless sensor networks? IEEE Wirel. Commun. 17(1): 4-6 (2010)
2000 – 2009
- 2004
[j2]William Merrill, Lewis Girod, Brian Schiffer, Dustin McIntire, Guillaume Rava, Katayoun Sohrabi, Fredric Newberg, Jeremy Elson, William J. Kaiser:
Dynamic Networking and Smart Sensing Enable Next-Generation Landmines. IEEE Pervasive Comput. 3(4): 84-90 (2004)
[j1]Katayoun Sohrabi, William Merrill, Jeremy Elson, Lewis Girod, Fredric Newberg, William J. Kaiser:
Methods for Scalable Self-Assembly of Ad Hoc Wireless Sensor Networks. IEEE Trans. Mob. Comput. 3(4): 317-331 (2004)- 2001
[c1]William Merrill:
Preserving and Protecting the Freedom to Learn Online. WebNet 2001: 857
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-02-13 00:43 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







