Beating the Union Bound by Geometric Techniques

Download Report

Transcript Beating the Union Bound by Geometric Techniques

Beating the Union Bound by Geometric Techniques Raghu Meka (IAS & DIMACS)

Union Bound

Popularized by Erdos

impossible, whatever remains, however improbable, must be the truth ”

Probabilistic Method 101 • Ramsey graphs – Erdos • Coding theory – Shannon • Metric embeddings – • … Johnson-Lindenstrauss

Beating the Union Bound • Not always enough Lovasz Local Lemma: 𝐸 1 , 𝐸 2 , … , 𝐸 𝑛 , 𝑑 Pr 𝐸 𝑖 dependent. < 1/4𝑑, ⇒ Pr ∪ 𝐸 𝑖 < 1.

• Constructive: Beck’91, …, Moser’09, …

Beating the Union Bound I. Optimal, explicit 𝜀 -nets for Gaussians • Kanter’s lemma, convex geometry II. Constructive Discrepancy Minimization • EdgeWalk: New LP rounding method

Geometric techniques “Truly” constructive

Outline I. Optimal, explicit 𝜀 -nets for Gaussians • Kanter’s lemma, convex geometry II. Constructive Discrepancy Minimization • EdgeWalk: New LP rounding method

Epsilon Nets • Discrete approximations • Applications: integration, comp. geometry, …

Epsilon Nets for Gaussians Discrete approximations of Gaussian Explicit Even existence not clear!

Nets in Gaussian space Thm: Explicit 𝜀 -net of size (1/𝜀) 𝑂 𝑘 .

• Optimal: Matching lower bound • Union bound: (𝑘/𝜀) 𝑘 • Dadusch-Vempala’12: ((log 𝑘)/𝜀) 𝑘

First: Application to Gaussian Processes and Cover Times 10

Gaussian Processes (GPs) Multivariate Gaussian Distribution

Supremum of Gaussian Processes (GPs) Given (𝑋 𝑖 ) want to study • Supremum is natural: eg., balls and bins

Supremum of Gaussian Processes (GPs) Given 𝑣 1 , … , 𝑣 𝑛 ∈ 𝑅 𝑑 , want to study log 𝑛 .

Why Gaussian Processes?

Stochastic Processes Functional analysis Convex Geometry Machine Learning Many more!

Cover times of Graphs Aldous-Fill 94: Compute cover time Eg: 𝑐𝑜𝑣𝑒𝑟 𝐾 𝑛 = Θ(𝑛 log 𝑛) • KKLV00: 𝑂((log log 𝑛) 2 approximation 2 𝑛) • Feige-Zeitouni’09: FPTAS for trees

Cover Times and GPs Thm (Ding, Lee, Peres 10): O(1) det. poly. time approximation for cover time.

Transfer to GPs Compute supremum of GP

Computing the Supremum Question (Lee10, Ding11): PTAS for computing the supremum of GPs?

• Ding, Lee, Peres 10: 𝑂(1) approximation • Can’t beat 𝑂(1) : Talagrand’s majorizing measures

Main Result Thm: PTAS for computing the supremum of Gaussian processes.

Thm: PTAS for computing cover time of bounded degree graphs.

Heart of PTAS: Epsilon net (Dimension reduction ala JL, use exp. size net)

Construction of Net 19

Construction of 𝜀 -net Simplest possible: univariate to multivariate 𝑘 𝑘 1. How fine a net?

Construction of 𝜀 -net Simplest possible: univariate to multivariate 𝑘 𝑘 Lem: Granularity 𝛿~𝑓(𝜀) enough.

Key point that beats union bound

Construction of 𝜀 -net This talk: Analyze ‘step-wise’ approximator Even out mass in interval [−𝛿, 𝛿] .

−4𝛿 −3𝛿 −2𝛿 𝛿 𝛾 ℓ 𝛿 2𝛿 3𝛿 4𝛿

Construction of 𝜀 -net Take univariate net and lift to multivariate 𝑘 𝑘 𝛾 −4𝛿 −3𝛿 −2𝛿 𝛿 𝛿 𝛾 ℓ 2𝛿 3𝛿 4𝛿 Lem: Granularity 𝛿~𝑓(𝜖) enough.

Dimension Free Error Bounds Thm: For 𝛿~ 𝜖 1.5

, 𝜑 a norm, 𝛾 • Proof by “sandwiching” • Exploit convexity critically −4𝛿 −3𝛿 −2𝛿 𝛿 𝛿 𝛾 ℓ 2𝛿 3𝛿 4𝛿

Analysis of Error Def: Sym. p, q.

∀ p ≼ 𝑞 (less peaked), if sym. convex sets K , 𝑝 𝐾 ≤ 𝑞 𝐾 .

• Why interesting? For any norm,

Analysis for Univarate Case Fact: 𝛾 ℓ ≼ 𝛾.

Proof: Spreading away from origin!

−4𝛿 −3𝛿 −2𝛿 𝛿 𝛿 2𝛿 3𝛿 4𝛿

Analysis for Univariate Case Fact: 𝑢 𝛾 ≼ 𝛾 𝑢 .

𝛾 ℓ .

Proof: For , 𝛿 ≪ 𝜖, , 𝛾 𝑢 = pdf of 𝑌 .

earlier spreading.

Push mass towards origin.

𝛾 𝑢

Analysis for Univariate Case Combining upper and lower: 𝛾 ℓ 𝛾 𝛾 𝑢

Lifting to Multivariate Case 𝑘 𝑘 𝑘 𝛾 ℓ 𝛾 𝑢 Kanter’s Lemma(77): and

unimodal,

Dimension free!

Lifting to Multivariate Case 𝑘 𝑘 𝑘 𝛾 ℓ 𝛾 𝛾 𝑢 Dimension free: key point that beats union bound!

Summary of Net Construction 1. Granularity 𝜀 1.5

enough 2. Cut points outside 𝑘 -ball Optimal 𝜀 -net

Outline I. Optimal, explicit 𝜀 -nets for Gaussians • Kanter’s lemma, convex geometry II. Constructive Discrepancy Minimization • EdgeWalk: New LP rounding method

Discrepancy • Subsets 𝑆 1 , 𝑆 2 , … , 𝑆 𝑚 ⊆ [𝑛] • Color with 1 or 1 to minimize imbalance 3 1 1 0 1

Discrepancy Examples • Fundamental combinatorial concept Arithmetic Progressions Roth 64: Ω(𝑛 1/4 ) Matousek, Spencer 96: Θ(𝑛 1/4 )

Discrepancy Examples • Fundamental combinatorial concept Halfspaces Alexander 90: Matousek 95:

Why Discrepancy?

Complexity theory Communication Complexity Computational Geometry Pseudorandomness Many more!

Spencer’s Six Sigma Theorem “Six standard deviations suffice” Spencer 85: System with n sets has discrepancy at most . • Central result in discrepancy theory.

• Tight: Hadamard • Beats union bound :

A Conjecture and a Disproof Spencer 85: System with n sets has discrepancy at most . • Non-constructive pigeon-hole proof

Six Sigma Theorem New elementary geometric proof of Spencer’s result Main: Can efficiently find a coloring with discrepancy 𝑂 𝑛 .

constructive • Algorithmic partial coloring lemma • Extends to other settings

Outline of Algorithm 1. Partial coloring method 2. EDGE-WALK: geometric picture

Partial Coloring Method • Beck 80: find partial assignment with < 𝑛/2 zeros 0 1

Partial Coloring Method Input: Output: Lemma: Can do this in randomized time.

Outline of Algorithm 1. Partial coloring Method 2. EDGE-WALK: Geometric picture

Discrepancy: Geometric View • Subsets 𝑆 1 , 𝑆 2 , … , 𝑆 𝑚 ⊆ [𝑛] • Color with 1 or 1 to minimize imbalance 1 2 3 4 5

1 * 1 1 *

3

1 3 * 1 1 * 1

1

-1 1 1 1 1 1 1

1

1 1 * * * 1 1

0

1 0 1 * 1 * 1

1

-1 1

Discrepancy: Geometric View • Vectors 𝑣 1 , 𝑣 2 , … , 𝑣 𝑚 ∈ 0,1 𝑛 .

• Want 1 2 3 4 5

1 * 1 1 * 1 3 * 1 1 * 1 -1 1 1 1 1 1 1 1 1 * * * 1 1 1 0 1 * 1 * 1 -1 1

Discrepancy: Geometric View • Vectors 𝑣 1 , 𝑣 2 , … , 𝑣 𝑚 ∈ 0,1 𝑛 .

• Want Gluskin 88: Polytopes, Kanter’s lemma, ... !

Goal: Find non-zero lattice point inside

Edge-Walk • Start at origin • Brownian motion till you hit a face • Brownian motion within the face

Edge-Walk: Algorithm Gaussian random walk in subspaces • Subspace V, rate 𝛾 • Gaussian walk in V Standard normal in V: Orthonormal basis change

Edge-Walk Algorithm Discretization issues: hitting faces • Might not hit face • Slack: face hit if close to it.

Edge-Walk: Algorithm • Input: Vectors 𝑣 1 , 𝑣 2 , … , 𝑣 𝑚 .

• Parameters: 𝛿, Δ, 𝛾 ≪ 𝛿 , 𝑇 = 1/𝛾 2 1.

𝑋 0 = 0.

For 𝑡 = 1, … , 𝑇.

2.

𝑉𝑎𝑟 𝑡 = Cube faces nearly hit by 𝑋 𝑡 .

𝐷𝑖𝑠𝑐 𝑡 = Disc. faces nearly hit by 𝑋 𝑡 .

𝑉 𝑡 = Subspace orthogonal to 𝑉𝑎𝑟 𝑡 , 𝐷𝑖𝑠𝑐 𝑡

Edgewalk: Partial Coloring Lem: For with prob 0.1 and

Edgewalk: Analysis Discrepancy faces much farther than cube’s .

′ 𝑠] 1

Six Suffice 1. Edge-Walk: Algorithmic partial coloring 2. Recurse on unfixed variables Spencer’s Theorem

Summary I. Optimal, explicit 𝜀 -nets for Gaussians • Kanter’s lemma, convex geometry II. Constructive Discrepancy Minimization • EdgeWalk: New LP rounding method

Geometric techniques

Others: Invariance principle for polytopes (Harsha, Klivans, M.’10), …

Open Problems • FPTAS for computing supremum?

• Beck-Fiala conjecture 81? – Discrepancy 𝑂( 𝑡) for degree 𝑡 .

• Applications of Edgewalk rounding?

Rothvoss’13: Improvements for bin-packing!

Thank you

Edgewalk Rounding Th: Given 𝑣 1 , … , 𝑣 𝑚 , thresholds 𝜆 1 , 𝜆 2 , … 𝜆 𝑚 , Can find 𝑋 ∈ −1,1 1.

2. 𝑛 with