Balancing Gaussian vectors in high dimension

Abstract: Motivated by problems in controlled experiments, we study the discrepancy of random matrices with continuous entries where the number of columns $n$ is much larger than the number of rows $m$. Our first result shows that if $\omega(1) = m = o(n)$, a matrix with i.i.d. standard Gaussian entries has discrepancy $\Theta(\sqrt{n} \, 2^{-n/m})$ with high probability. This provides sharp guarantees for Gaussian discrepancy in a regime that had not been considered before in the existing literature. Our results also apply to a more general family of random matrices with continuous i.i.d. entries, assuming that $m = O(n/\log{n})$. The proof is non-constructive and is an application of the second moment method. Our second result is algorithmic and applies to random matrices whose entries are i.i.d. and have a Lipschitz density. We present a randomized polynomial-time algorithm that achieves discrepancy $e^{-\Omega(\log^2(n)/m)}$ with high probability, provided that $m = O(\sqrt{\log{n}})$. In the one-dimensional case, this matches the best known algorithmic guarantees due to Karmarkar--Karp. For higher dimensions $2 \leq m = O(\sqrt{\log{n}})$, this establishes the first efficient algorithm achieving discrepancy smaller than $O( \sqrt{m} )$.

06/12/2021

Data, Challenges, Implementations, and Software -> Benchmarks; Deep Learning -> Adversarial Networks, Deep Learning -> Generative Models

3:21

26/08/2020

Balancing Gaussian vectors in high dimension

Paxton M Turner, Raghu Meka, Philippe Rigollet

Comments

Similar Papers

Tight High Probability Bounds for Linear Stochastic Approximation with Fixed Stepsize

Alain Durmus, Eric Moulines, Alexey Naumov and Sergey Samsonov, Kevin Scaman, Hoi-To Wai

Keywords Abstract Paper

Algorithms for heavy-tailed statistics: Regression, covariance estimation, and beyond

Yeshwanth Cherapanamjeri, Samuel B. Hopkins, Tarun Kathuria and Prasad Raghavendra, Nilesh Tripuraneni

Keywords Abstract Paper

Sum-of-squares, Algorithms, Heavy-Tailed Estimation

Deterministic Sparse Fourier Transform with an 𝓁_{∞} Guarantee

Yi Li, Vasileios Nakos

Keywords Abstract Paper

Fourier sparse recovery, derandomization, incoherent matrices

Approximation algorithms for orthogonal non-negative matrix factorization

Moses Charikar, Lunjia Hu

Keywords Abstract Paper

Near-Optimal Entrywise Sampling of Numerically Sparse Matrices

Vladimir Braverman, Robert Krauthgamer, Aditya R Krishnan, Shay Sapir

Keywords Abstract Paper

Better Algorithms for Estimating Non-Parametric Models in Crowd-Sourcing and Rank Aggregation

Allen X Liu, Ankur Moitra

Keywords Abstract Paper

Matrix/tensor estimation, Learning with algebraic or combinatorial structure, Ranking and preference learning

Consistent Estimation for PCA and Sparse Regression with Oblivious Outliers

Tommaso d'Orsi, Chih-Hung Liu, Rajai Nasser and Gleb Novikov, David Steurer, Stefan Tiegel

Keywords Abstract Paper

Robust Gaussian Covariance Estimation in Nearly-Matrix Multiplication Time

Jerry Li, Guanghao Ye

Keywords Abstract Paper

Consistent regression when oblivious outliers overwhelm

Tommaso d'Orsi, Gleb Novikov, David Steurer

Keywords Abstract Paper

Theory, Game Theory and Computational Economics, Theory, Theory, Computational Complexity

Global Convergence of Gradient Descent for Asymmetric Low-Rank Matrix Factorization

Tian Ye, Simon Du

Keywords Abstract Paper

theory, optimization

Extractors for adversarial sources via extremal hypergraphs

Eshan Chattopadhyay, Jesse Goodman, Vipul Goyal, Xin Li

Keywords Abstract Paper

randomness extractors, non-malleable extractors, extremal hypergraphs, explicit constructions, cap sets, Ramsey graphs

A Fast Spectral Algorithm for Mean Estimation with Sub-Gaussian Rates

Zhixian Lei, Kyle Luh, Prayaag Venkat, Fred Zhang

Keywords Abstract Paper

High-dimensional statistics, Adversarial learning and robustness

Optimal Estimator for Unlabeled Linear Regression

Hang Zhang, Ping Li

Keywords Abstract Paper

Faster Binary Embeddings for Preserving Euclidean Distances

Jinjie Zhang, Rayan Saab

Keywords Abstract Paper

Binary Embeddings, Sigma Delta Quantization, Johnson-Lindenstrauss Transforms

Fast Matrix Square Roots with Applications to Gaussian Processes and Bayesian Optimization

Geoff Pleiss, Martin Jankowiak, David Eriksson and Anil Damle, Jacob Gardner

Keywords Abstract Paper

Data, Challenges, Implementations, and Software -> Benchmarks; Deep Learning -> Adversarial Networks, Deep Learning -> Generative Models

Minimax Rank-$1$ Matrix Factorization

Venkatesh Saligrama, Alexander Olshevsky, Julien Hendrickx

Keywords Abstract Paper

Optimal Sketching for Trace Estimation

Shuli Jiang, Hai Pham, David Woodruff, Richard Zhang

Keywords Abstract Paper

Asymptotic Errors for High-Dimensional Convex Penalized Linear Regression beyond Gaussian Matrices

Alia Abbara, Florent Krzakala, Cedric Gerbelot

Keywords Abstract Paper

Statistical physics, Convex optimization, High-dimensional statistics, Regression, Supervised learning

Analysis of one-hidden-layer neural networks via the resolvent method

Vanessa Piccolo, Dominik Schröder

Keywords Abstract Paper

theory, deep learning

The Bethe and Sinkhorn Permanents of Low Rank Matrices and Implications for Profile Maximum Likelihood

Nima Anari, Moses Charikar, Kirankumar Shiragur, Aaron Sidford

Keywords Abstract Paper

Streaming and Distributed Algorithms for Robust Column Subset Selection

Shuli Jiang, Dongyu Li, Irene Mengze Li and Arvind Mahankali, David Woodruff

Keywords Abstract Paper

Algorithms, Deep Learning, Generative Models, Deep Learning, Predictive Models; Deep Learning, Recurrent Networks

On the hardness of massively parallel computation

Alain Durmus, Eric Moulines, Alexey Naumov and
Sergey Samsonov, Kevin Scaman, Hoi-To Wai

Keywords Paper

Yeshwanth Cherapanamjeri, Samuel B. Hopkins, Tarun Kathuria and
Prasad Raghavendra, Nilesh Tripuraneni

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Tommaso d'Orsi, Chih-Hung Liu, Rajai Nasser and
Gleb Novikov, David Steurer, Stefan Tiegel

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Geoff Pleiss, Martin Jankowiak, David Eriksson and
Anil Damle, Jacob Gardner

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Shuli Jiang, Dongyu Li, Irene Mengze Li and
Arvind Mahankali, David Woodruff

Keywords Paper

Keywords Paper

Keywords Paper

Marine Le Morvan, Julie Josse, Thomas Moreau and
Erwan Scornet, Gael Varoquaux

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Ilias Diakonikolas, Daniel Kane, Daniel Kongsgaard and
Jerry Li, Kevin Tian

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper