-
Who's the GOAT? Sports Rankings and Data-Driven Random Walks on the Symmetric Group
Authors:
Gian-Gabriel P. Garcia,
J. Carlos Martínez Mori
Abstract:
Given a collection of historical sports rankings, can one tell which player is the greatest of all time (i.e., the GOAT)? In this work, we design a data-driven random walk on the symmetric group to obtain a stationary distribution over player rankings, spanning across different time periods in sports history. We combine this distribution with a notion of stochastic dominance to obtain a partial or…
▽ More
Given a collection of historical sports rankings, can one tell which player is the greatest of all time (i.e., the GOAT)? In this work, we design a data-driven random walk on the symmetric group to obtain a stationary distribution over player rankings, spanning across different time periods in sports history. We combine this distribution with a notion of stochastic dominance to obtain a partial order over the players. Compared to existing methods, our approach is distinct in that i) using historical rankings captures the evolution of value systems and facilitates player comparisons when head-to-head data is unavailable, and i) aggregating into a partial order formally comes to terms with the possibility that, while some player comparisons can be established conclusively, others are "too close to call." We implement our methods using publicly available data from the Association of Tennis Professionals (ATP) and the Women's Tennis Association (WTA). Our main findings indicate that Steffi Graf and Serena Williams are the ones that come ahead as the GOATs of the WTA. Likewise, the "Big Three," that is Novak Djokovic, Roger Federer, and Rafael Nadal, are the ones that come ahead as the GOATs of the ATP. As a secondary finding, we note major differences in terms of career and dominance longevity for top players across the associations. While initially motivated by this application in sports analytics, our methods can also be applied to other practical domains where deriving rankings from historical data can inform operational decisions, such as route planning logistics.
△ Less
Submitted 15 October, 2024; v1 submitted 18 September, 2024;
originally announced September 2024.
-
What is a Parking Function?
Authors:
J. Carlos Martínez Mori
Abstract:
In this expository article I describe classical results in the combinatorics of parking functions. Its English-Spanish translation is included. -- --
En este artículo de difusión matemática describo resultados clásicos en la combinatoria de funciones de parqueo. Su traducción español-inglés está incluida.
In this expository article I describe classical results in the combinatorics of parking functions. Its English-Spanish translation is included. -- --
En este artículo de difusión matemática describo resultados clásicos en la combinatoria de funciones de parqueo. Su traducción español-inglés está incluida.
△ Less
Submitted 20 April, 2024;
originally announced April 2024.
-
Boolean intervals in the weak Bruhat order of a finite Coxeter group
Authors:
Ben Adenbaum,
Jennifer Elder,
Pamela E. Harris,
J. Carlos Martínez Mori
Abstract:
Given a Coxeter group $W$ with Coxeter system $(W,S)$, where $S$ is finite. We provide a complete characterization of Boolean intervals in the weak order of $W$ uniformly for all Coxeter groups in terms of independent sets of the Coxeter graph. Moreover, we establish that the number of Boolean intervals of rank $k$ in the weak order of $W$ is ${i_k(Γ_W)\cdot|W|}\,/\,2^{k}$, where $Γ_W$ is the Coxe…
▽ More
Given a Coxeter group $W$ with Coxeter system $(W,S)$, where $S$ is finite. We provide a complete characterization of Boolean intervals in the weak order of $W$ uniformly for all Coxeter groups in terms of independent sets of the Coxeter graph. Moreover, we establish that the number of Boolean intervals of rank $k$ in the weak order of $W$ is ${i_k(Γ_W)\cdot|W|}\,/\,2^{k}$, where $Γ_W$ is the Coxeter graph of $W$ and $i_k(Γ_W)$ is the number of independent sets of size $k$ of $Γ_W$ when $W$ is finite. Specializing to $A_n$, we recover the characterizations and enumerations of Boolean intervals in the weak order of $A_n$ given in arXiv:2306.14734. We provide the analogous results for types $C_n$ and $D_n$, including the related generating functions and additional connections to well-known integer sequences.
△ Less
Submitted 12 March, 2024;
originally announced March 2024.
-
Modeling Processes of Neighborhood Change
Authors:
J. Carlos Martínez Mori,
Zhanzhan Zhao
Abstract:
An urban planner might design the spatial layout of transportation amenities so as to improve accessibility for underserved communities -- a fairness objective. However, implementing such a design might trigger processes of neighborhood change that change who benefits from these amenities in the long term. If so, has the planner really achieved their fairness objective? Can algorithmic decision-ma…
▽ More
An urban planner might design the spatial layout of transportation amenities so as to improve accessibility for underserved communities -- a fairness objective. However, implementing such a design might trigger processes of neighborhood change that change who benefits from these amenities in the long term. If so, has the planner really achieved their fairness objective? Can algorithmic decision-making anticipate second order effects? In this paper, we take a step in this direction by formulating processes of neighborhood change as instances of no-regret dynamics; a collective learning process in which a set of strategic agents rapidly reach a state of approximate equilibrium. We mathematize concepts of neighborhood change to model the incentive structures impacting individual dwelling-site decision-making. Our model accounts for affordability, access to relevant transit amenities, community ties, and site upkeep. We showcase our model with computational experiments that provide semi-quantitative insights on the spatial economics of neighborhood change, particularly on the influence of residential zoning policy and the placement of transit amenities.
△ Less
Submitted 9 February, 2024; v1 submitted 6 January, 2024;
originally announced January 2024.
-
Interval and $\ell$-interval Rational Parking Functions
Authors:
Tomás Aguilar-Fraga,
Jennifer Elder,
Rebecca E. Garcia,
Kimberly P. Hadaway,
Pamela E. Harris,
Kimberly J. Harry,
Imhotep B. Hogan,
Jakeyl Johnson,
Jan Kretschmann,
Kobe Lawson-Chavanu,
J. Carlos Martínez Mori,
Casandra D. Monroe,
Daniel Quiñonez,
Dirk Tolson III,
Dwight Anderson Williams II
Abstract:
Interval parking functions are a generalization of parking functions in which cars have an interval preference for their parking. We generalize this definition to parking functions with $n$ cars and $m\geq n$ parking spots, which we call interval rational parking functions and provide a formula for their enumeration. By specifying an integer parameter $\ell\geq 0$, we then consider the subset of i…
▽ More
Interval parking functions are a generalization of parking functions in which cars have an interval preference for their parking. We generalize this definition to parking functions with $n$ cars and $m\geq n$ parking spots, which we call interval rational parking functions and provide a formula for their enumeration. By specifying an integer parameter $\ell\geq 0$, we then consider the subset of interval rational parking functions in which each car parks at most $\ell$ spots away from their initial preference. We call these $\ell$-interval rational parking functions and provide recursive formulas to enumerate this set for all positive integers $m\geq n$ and $\ell$. We also establish formulas for the number of nondecreasing $\ell$-interval rational parking functions via the outcome map on rational parking functions. We also consider the intersection between $\ell$-interval parking functions and Fubini rankings and show the enumeration of these sets is given by generalized Fibonacci numbers. We conclude by specializing $\ell=1$, and establish that the set of $1$-interval rational parking functions with $n$ cars and $m$ spots are in bijection with the set of barred preferential arrangements of $[n]$ with $m-n$ bars. This readily implies enumerative formulas. Further, in the case where $\ell=1$, we recover the results of Hadaway and Harris that unit interval parking functions are in bijection with the set of Fubini rankings, which are enumerated by the Fubini numbers.
△ Less
Submitted 16 September, 2024; v1 submitted 23 November, 2023;
originally announced November 2023.
-
Cost-sharing in Parking Games
Authors:
Jennifer Elder,
Pamela E. Harris,
Jan Kretschmann,
J. Carlos Martínez Mori
Abstract:
In this paper, we study the total displacement statistic of parking functions from the perspective of cooperative game theory. We introduce parking games, which are coalitional cost-sharing games in characteristic function form derived from the total displacement statistic. We show that parking games are supermodular cost-sharing games, indicating that cooperation is difficult (i.e., their core is…
▽ More
In this paper, we study the total displacement statistic of parking functions from the perspective of cooperative game theory. We introduce parking games, which are coalitional cost-sharing games in characteristic function form derived from the total displacement statistic. We show that parking games are supermodular cost-sharing games, indicating that cooperation is difficult (i.e., their core is empty). Next, we study their Shapley value, which formalizes a notion of "fair" cost-sharing and amounts to charging each car for its expected marginal displacement under a random arrival order. Our main contribution is a polynomial-time algorithm to compute the Shapley value of parking games, in contrast with known hardness results on computing the Shapley value of arbitrary games. The algorithm leverages the permutation-invariance of total displacement, combinatorial enumeration, and dynamic programming. We conclude with open questions around an alternative solution concept for supermodular cost-sharing games and connections to other areas in combinatorics.
△ Less
Submitted 16 September, 2024; v1 submitted 21 September, 2023;
originally announced September 2023.
-
Sparse Graphical Designs via Linear Programming
Authors:
Hessa Al-Thani,
Catherine Babecki,
J. Carlos Martínez Mori
Abstract:
Graphical designs are a framework for sampling and numerical integration of functions on graphs. In this note, we introduce a method to address the trade-off between graphical design sparsity and accuracy. We show how to obtain sparse graphical designs via linear programming and design objective functions that aim to maximize their accuracy. We showcase our approach using yellow taxicab data from…
▽ More
Graphical designs are a framework for sampling and numerical integration of functions on graphs. In this note, we introduce a method to address the trade-off between graphical design sparsity and accuracy. We show how to obtain sparse graphical designs via linear programming and design objective functions that aim to maximize their accuracy. We showcase our approach using yellow taxicab data from New York City.
△ Less
Submitted 15 September, 2023; v1 submitted 1 September, 2023;
originally announced September 2023.
-
Parking functions, Fubini rankings, and Boolean intervals in the weak order of $\mathfrak{S}_n$
Authors:
Jennifer Elder,
Pamela E. Harris,
Jan Kretschmann,
J. Carlos Martínez Mori
Abstract:
Let $\mathfrak{S}_n$ denote the symmetric group and let $W(\mathfrak{S}_n)$ denote the weak order of $\mathfrak{S}_n$. Through a surprising connection to a subset of parking functions, which we call unit Fubini rankings, we provide a complete characterization and enumeration for the total number of Boolean intervals in $W(\mathfrak{S}_n)$ and the total number of Boolean intervals of rank $k$ in…
▽ More
Let $\mathfrak{S}_n$ denote the symmetric group and let $W(\mathfrak{S}_n)$ denote the weak order of $\mathfrak{S}_n$. Through a surprising connection to a subset of parking functions, which we call unit Fubini rankings, we provide a complete characterization and enumeration for the total number of Boolean intervals in $W(\mathfrak{S}_n)$ and the total number of Boolean intervals of rank $k$ in $W(\mathfrak{S}_n)$. Furthermore, for any $π\in\mathfrak{S}_n$, we establish that the number of Boolean intervals in $W(\mathfrak{S}_n)$ with minimal element $π$ is a product of Fibonacci numbers. We conclude with some directions for further study.
△ Less
Submitted 27 May, 2024; v1 submitted 26 June, 2023;
originally announced June 2023.
-
Lucky Cars and the Quicksort Algorithm
Authors:
Pamela E. Harris,
Jan Kretschmann,
J. Carlos Martínez Mori
Abstract:
Quicksort is a classical divide-and-conquer sorting algorithm. It is a comparison sort that makes an average of $2(n+1)H_n - 4n$ comparisons on an array of size $n$ ordered uniformly at random, where $H_n = \sum_{i=1}^n\frac{1}{i}$ is the $n$th harmonic number. Therefore, it makes $n!\left[2(n+1)H_n - 4n\right]$ comparisons to sort all possible orderings of the array. In this article, we prove tha…
▽ More
Quicksort is a classical divide-and-conquer sorting algorithm. It is a comparison sort that makes an average of $2(n+1)H_n - 4n$ comparisons on an array of size $n$ ordered uniformly at random, where $H_n = \sum_{i=1}^n\frac{1}{i}$ is the $n$th harmonic number. Therefore, it makes $n!\left[2(n+1)H_n - 4n\right]$ comparisons to sort all possible orderings of the array. In this article, we prove that this count also enumerates the parking preference lists of $n$ cars parking on a one-way street with $n$ parking spots resulting in exactly $n-1$ lucky cars (i.e., cars that park in their preferred spot). For $n\geq 2$, both counts satisfy the second order recurrence relation $ f_n=2nf_{n-1}-n(n-1)f_{n-2}+2(n-1)! $ with $f_0=f_1=0$.
△ Less
Submitted 22 June, 2023;
originally announced June 2023.
-
Designing Equitable Transit Networks
Authors:
Sophie Pavia,
J. Carlos Martinez Mori,
Aryaman Sharma,
Philip Pugliese,
Abhishek Dubey,
Samitha Samaranayake,
Ayan Mukhopadhyay
Abstract:
Public transit is an essential infrastructure enabling access to employment, healthcare, education, and recreational facilities. While accessibility to transit is important in general, some sections of the population depend critically on transit. However, existing public transit is often not designed equitably, and often, equity is only considered as an additional objective post hoc, which hampers…
▽ More
Public transit is an essential infrastructure enabling access to employment, healthcare, education, and recreational facilities. While accessibility to transit is important in general, some sections of the population depend critically on transit. However, existing public transit is often not designed equitably, and often, equity is only considered as an additional objective post hoc, which hampers systemic changes. We present a formulation for transit network design that considers different notions of equity and welfare explicitly. We study the interaction between network design and various concepts of equity and present trade-offs and results based on real-world data from a large metropolitan area in the United States of America.
△ Less
Submitted 7 August, 2023; v1 submitted 22 December, 2022;
originally announced December 2022.
-
Permutation Invariant Parking Assortments
Authors:
Douglas M. Chen,
Pamela E. Harris,
J. Carlos Martínez Mori,
Eric J. Pabón-Cancel,
Gabriel Sargent
Abstract:
We introduce parking assortments, a generalization of parking functions with cars of assorted lengths. In this setting, there are $n\in\mathbb{N}$ cars of lengths $\mathbf{y}=(y_1,y_2,\ldots,y_n)\in\mathbb{N}^n$ entering a one-way street with $m=\sum_{i=1}^ny_i$ parking spots. The cars have parking preferences $\mathbf{x}=(x_1,x_2,\ldots,x_n)\in[m]^n$, where $[m]:=\{1,2,\ldots,m\}$, and enter the…
▽ More
We introduce parking assortments, a generalization of parking functions with cars of assorted lengths. In this setting, there are $n\in\mathbb{N}$ cars of lengths $\mathbf{y}=(y_1,y_2,\ldots,y_n)\in\mathbb{N}^n$ entering a one-way street with $m=\sum_{i=1}^ny_i$ parking spots. The cars have parking preferences $\mathbf{x}=(x_1,x_2,\ldots,x_n)\in[m]^n$, where $[m]:=\{1,2,\ldots,m\}$, and enter the street in order. Each car $i \in [n]$, with length $y_i$ and preference $x_i$, follows a natural extension of the classical parking rule: it begins looking for parking at its preferred spot $x_i$ and parks in the first $y_i$ contiguously available spots thereafter, if there are any. If all cars are able to park under the preference list $\mathbf{x}$, we say $\mathbf{x}$ is a parking assortment for $\mathbf{y}$. Parking assortments also generalize parking sequences, introduced by Ehrenborg and Happ, since each car seeks for the first contiguously available spots it fits in past its preference. Given a parking assortment $\mathbf{x}$ for $\mathbf{y}$, we say it is permutation invariant if all rearrangements of $\mathbf{x}$ are also parking assortments for $\mathbf{y}$. While all parking functions are permutation invariant, this is not the case for parking assortments in general, motivating the need for characterization of this property. Although obtaining a full characterization for arbitrary $n\in\mathbb{N}$ and $\mathbf{y}\in\mathbb{N}^n$ remains elusive, we do so for $n=2,3$. Given the technicality of these results, we introduce the notion of minimally invariant car lengths, for which the only invariant parking assortment is the all ones preference list. We provide a concise, oracle-based characterization of minimally invariant car lengths for any $n\in\mathbb{N}$. Our results around minimally invariant car lengths also hold for parking sequences.
△ Less
Submitted 4 August, 2023; v1 submitted 2 November, 2022;
originally announced November 2022.
-
On the Outcome Map of MVP Parking Functions: Permutations Avoiding 321 and 3412, and Motzkin Paths
Authors:
Pamela E. Harris,
Brian M. Kamau,
J. Carlos Martínez Mori,
Roger Tian
Abstract:
We introduce a new parking procedure called MVP parking in which $n$ cars sequentially enter a one-way street with a preferred parking spot from the $n$ parking spots on the street. If their preferred spot is empty, they park there. Otherwise, they park there and the car parked in that spot is bumped to the next unoccupied spot on the street. If all cars can park under this parking procedure, we s…
▽ More
We introduce a new parking procedure called MVP parking in which $n$ cars sequentially enter a one-way street with a preferred parking spot from the $n$ parking spots on the street. If their preferred spot is empty, they park there. Otherwise, they park there and the car parked in that spot is bumped to the next unoccupied spot on the street. If all cars can park under this parking procedure, we say the list of preferences of the $n$ cars is an MVP parking function of length $n$. We show that the set of (classical) parking functions is exactly the set of MVP parking functions although the parking outcome (order in which the cars park) is different under each parking process. Motivating the question: Given a permutation describing the outcome of the MPV parking process, what is the number of MVP parking functions resulting in that given outcome? Our main result establishes a bound for this count which is tight precisely when the permutation describing the parking outcome avoids the patterns 321 and 3412. We then consider special cases of permutations and give closed formulas for the number of MVP parking functions with those outcomes. In particular, we show that the number of MVP parking functions which park in reverse order (that is the permutation describing the outcome is the longest word in $\mathfrak{S}_n$, which does not avoid the pattern 321) is given by the $n$th Motzkin number. We also give families of permutations describing the parking outcome for which the cardinality of the set of cars parking in that order is exponential and others in which it is linear.
△ Less
Submitted 20 February, 2023; v1 submitted 26 July, 2022;
originally announced July 2022.
-
On Parking Functions and The Tower of Hanoi
Authors:
Yasmin Aguillon,
Dylan Alvarenga,
Pamela E. Harris,
Surya Kotapati,
J. Carlos Martínez Mori,
Casandra D. Monroe,
Zia Saylor,
Camelle Tieu,
Dwight Anderson Williams II
Abstract:
The displacement of a parking function measures the total difference between where cars want to park and where they ultimately park. In this article, we prove that the set of parking functions of length $n$ with displacement one is in bijection with the set of ideal states in the famous Tower of Hanoi game with $n+1$ disks and $n+1$ pegs, both sets being enumerated by the Lah numbers.
The displacement of a parking function measures the total difference between where cars want to park and where they ultimately park. In this article, we prove that the set of parking functions of length $n$ with displacement one is in bijection with the set of ideal states in the famous Tower of Hanoi game with $n+1$ disks and $n+1$ pegs, both sets being enumerated by the Lah numbers.
△ Less
Submitted 1 June, 2022;
originally announced June 2022.
-
Permutatorial Optimization via the Permutahedron
Authors:
J. Carlos Martinez Mori,
Samitha Samaranayake
Abstract:
A water company decides to expand its network with a set of water lines, but it cannot build them all at once. However, it starts reaping benefits from a partial expansion. In what order should the company build the lines? We formalize a class of permutatorial problems with combinatorial/continuous subproblems capturing applications of incremental deployment. We show that, for additive/linear obje…
▽ More
A water company decides to expand its network with a set of water lines, but it cannot build them all at once. However, it starts reaping benefits from a partial expansion. In what order should the company build the lines? We formalize a class of permutatorial problems with combinatorial/continuous subproblems capturing applications of incremental deployment. We show that, for additive/linear objective functions, efficient polyhedral methods for the subproblems extend to the permutatorial problem. Our main technical ingredient is the permutahedron.
△ Less
Submitted 25 July, 2022; v1 submitted 1 March, 2022;
originally announced March 2022.
-
On the Request-Trip-Vehicle Assignment Problem
Authors:
J. Carlos Martínez Mori,
Samitha Samaranayake
Abstract:
The request-trip-vehicle assignment problem is at the heart of a popular decomposition strategy for online vehicle routing. In this framework, assignments are done in batches in order to exploit any shareability among vehicles and incoming travel requests. We study a natural ILP formulation and its LP relaxation. Our main result is an LP-based randomized rounding algorithm that, whenever the insta…
▽ More
The request-trip-vehicle assignment problem is at the heart of a popular decomposition strategy for online vehicle routing. In this framework, assignments are done in batches in order to exploit any shareability among vehicles and incoming travel requests. We study a natural ILP formulation and its LP relaxation. Our main result is an LP-based randomized rounding algorithm that, whenever the instance is feasible, leverages mild assumptions to return an assignment whose: i) expected cost is at most that of an optimal solution, and ii) expected fraction of unassigned requests is at most $1/e$. If trip-vehicle assignment costs are $α$-approximate, we pay an additional factor of $α$ in the expected cost. We can relax the feasibility requirement by considering the penalty version of the problem, in which a penalty is paid for each unassigned request. We find that, whenever a request is repeatedly unassigned after a number of rounds, with high probability it is so in accordance with the sequence of LP solutions and not because of a rounding error. We additionally introduce a deterministic rounding heuristic inspired by our randomized technique. Our computational experiments show that our rounding algorithms achieve a performance similar to that of the ILP at a reduced computation time, far improving on our theoretical guarantee. The reason for this is that, although the assignment problem is hard in theory, the natural LP relaxation tends to be very tight in practice.
△ Less
Submitted 31 July, 2021; v1 submitted 19 November, 2020;
originally announced November 2020.
-
The Batched Set Cover Problem
Authors:
Juan C. Martínez Mori,
Samitha Samaranayake
Abstract:
We introduce the batched set cover problem, which is a generalization of the online set cover problem. In this problem, the elements of the ground set that need to be covered arrive in batches. Our main technical contribution is a tight $Ω(H_{m - 2^z + 1})$ lower bound on the competitive ratio of any fractional batched algorithm given an adversary that is required to produce batches of VC-dimensio…
▽ More
We introduce the batched set cover problem, which is a generalization of the online set cover problem. In this problem, the elements of the ground set that need to be covered arrive in batches. Our main technical contribution is a tight $Ω(H_{m - 2^z + 1})$ lower bound on the competitive ratio of any fractional batched algorithm given an adversary that is required to produce batches of VC-dimension at least $z$, for some $z \in \mathbb{N}^0$. This restriction on the adversary is motivated by the fact that, in some real world applications, decisions are made after collecting batches of data of non-trivial VC-dimension. In particular, ridesharing systems rely on the batch assignment of trip requests to vehicles, and some related problems such as that of optimal congregation points for passenger pickups and dropoffs can be modeled as a batched set cover problem with VC-dimension greater than or equal to two. Furthermore, we note that while any online algorithm may be used to solve the batched set cover problem by artificially sequencing the elements in a batch, this procedure may neglect the rich information encoded in the complex interactions between the elements of a batch and the sets that contain them. Therefore, we propose a minor modification to an online algorithm found in [8] to obtain an algorithm that attempts to exploit such information. Unfortunately, we are unable to improve its analysis in a way that reflects this intuition. However, we present computational experiments that provide empirical evidence of a constant factor improvement in the competitive ratio. To the best of our knowledge, we are the first to use the VC-dimension in the context of online (batched) covering problems.
△ Less
Submitted 26 November, 2018;
originally announced November 2018.