Skip to main content

Showing 1–9 of 9 results for author: Berwick, R

  1. arXiv:2403.00860  [pdf, other

    cs.LG cs.AI cs.NE

    Parallel Algorithms for Exact Enumeration of Deep Neural Network Activation Regions

    Authors: Sabrina Drammis, Bowen Zheng, Karthik Srinivasan, Robert C. Berwick, Nancy A. Lynch, Robert Ajemian

    Abstract: A feedforward neural network using rectified linear units constructs a mapping from inputs to outputs by partitioning its input space into a set of convex regions where points within a region share a single affine transformation. In order to understand how neural networks work, when and why they fail, and how they compare to biological intelligence, we need to understand the organization and forma… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  2. arXiv:2311.06189  [pdf, other

    cs.CL math.LO math.QA math.RA

    Syntax-semantics interface: an algebraic model

    Authors: Matilde Marcolli, Robert C. Berwick, Noam Chomsky

    Abstract: We extend our formulation of Merge and Minimalism in terms of Hopf algebras to an algebraic model of a syntactic-semantic interface. We show that methods adopted in the formulation of renormalization (extraction of meaningful physical values) in theoretical physics are relevant to describe the extraction of meaning from syntactic expressions. We show how this formulation relates to computational m… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: LaTeX, 75 pages, 19 figures

    MSC Class: 91F20; 16T05; 18C50

  3. arXiv:2306.10270  [pdf, other

    cs.CL math.QA math.RA

    Old and New Minimalism: a Hopf algebra comparison

    Authors: Matilde Marcolli, Robert C. Berwick, Noam Chomsky

    Abstract: In this paper we compare some old formulations of Minimalism, in particular Stabler's computational minimalism, and Chomsky's new formulation of Merge and Minimalism, from the point of view of their mathematical description in terms of Hopf algebras. We show that the newer formulation has a clear advantage purely in terms of the underlying mathematical structure. More precisely, in the case of Sta… ▽ More

    Submitted 17 June, 2023; originally announced June 2023.

    Comments: 27 pages, LaTeX, 3 figures

    MSC Class: 68Q70; 16T05

  4. arXiv:2305.18278  [pdf, ps, other

    cs.CL math.QA math.RA

    Mathematical Structure of Syntactic Merge

    Authors: Matilde Marcolli, Noam Chomsky, Robert Berwick

    Abstract: The syntactic Merge operation of the Minimalist Program in linguistics can be described mathematically in terms of Hopf algebras, with a formalism similar to the one arising in the physics of renormalization. This mathematical formulation of Merge has good descriptive power, as phenomena empirically observed in linguistics can be justified from simple mathematical arguments. It also provides a pos… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    MSC Class: 68Q70; 16T05

  5. arXiv:1906.06349  [pdf, other

    cs.CL cs.LG

    On the Computational Power of RNNs

    Authors: Samuel A. Korsky, Robert C. Berwick

    Abstract: Recent neural network architectures such as the basic recurrent neural network (RNN) and Gated Recurrent Unit (GRU) have gained prominence as end-to-end learning architectures for natural language processing tasks. But what is the computational power of such systems? We prove that finite precision RNNs with one hidden layer and ReLU activation and finite precision GRUs are exactly as computational… ▽ More

    Submitted 18 June, 2019; v1 submitted 14 June, 2019; originally announced June 2019.

  6. arXiv:1811.02611  [pdf, ps, other

    cs.CL

    Evaluating the Ability of LSTMs to Learn Context-Free Grammars

    Authors: Luzi Sennhauser, Robert C. Berwick

    Abstract: While long short-term memory (LSTM) neural net architectures are designed to capture sequence information, human language is generally composed of hierarchical structures. This raises the question as to whether LSTMs can learn hierarchical structures. We explore this question with a well-formed bracket prediction task using two types of brackets modeled by an LSTM. Demonstrating that such a system… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

    Journal ref: Proceedings of the EMNLP Workshop BlackboxNLP (2018) 115-124

  7. arXiv:1803.09832  [pdf, other

    cs.CL

    Heat Kernel analysis of Syntactic Structures

    Authors: Andrew Ortegaray, Robert C. Berwick, Matilde Marcolli

    Abstract: We consider two different data sets of syntactic parameters and we discuss how to detect relations between parameters through a heat kernel method developed by Belkin-Niyogi, which produces low dimensional representations of the data, based on Laplace eigenfunctions, that preserve neighborhood information. We analyze the different connectivity and clustering structures that arise in the two datase… ▽ More

    Submitted 26 March, 2018; originally announced March 2018.

    Comments: 20 pages, LaTeX, png figures

  8. arXiv:1712.01719  [pdf, other

    cs.CL

    Phylogenetics of Indo-European Language families via an Algebro-Geometric Analysis of their Syntactic Structures

    Authors: Kevin Shu, Andrew Ortegaray, Robert Berwick, Matilde Marcolli

    Abstract: Using Phylogenetic Algebraic Geometry, we analyze computationally the phylogenetic tree of subfamilies of the Indo-European language family, using data of syntactic structures. The two main sources of syntactic data are the SSWL database and Longobardi's recent data of syntactic parameters. We compute phylogenetic invariants and likelihood functions for two sets of Germanic languages, a set of Rom… ▽ More

    Submitted 24 June, 2019; v1 submitted 5 December, 2017; originally announced December 2017.

    Comments: 57 pages, LaTeX; v2: some corrections and more details

    MSC Class: 91F20; 14M12; 92B10; 13P25

  9. arXiv:cmp-lg/9503012  [pdf, ps

    cs.CL q-bio

    A Note on Zipf's Law, Natural Languages, and Noncoding DNA regions

    Authors: Partha Niyogi, Robert C. Berwick

    Abstract: In Phys. Rev. Letters (73:2, 5 Dec. 94), Mantegna et al. conclude on the basis of Zipf rank frequency data that noncoding DNA sequence regions are more like natural languages than coding regions. We argue on the contrary that an empirical fit to Zipf's ``law'' cannot be used as a criterion for similarity to natural languages. Although DNA is a presumably an ``organized system of signs'' in Mande… ▽ More

    Submitted 9 March, 1995; originally announced March 1995.

    Comments: compressed uuencoded postscript file: 14 pages

    Report number: MIT CBCL Memo No. 118