Vempala - IAS Video Lectures

Transcript Vempala - IAS Video Lectures

Cool with a Gaussian:
An 𝑂∗ (𝑛3 ) volume algorithm
Ben Cousins and Santosh Vempala
The Volume Problem
Given a measurable, compact set K in n-dimensional space, find a
number A such that:
1 − 𝜖 vol 𝐾 ≤ 𝐴 ≤ 1 + 𝜖 vol 𝐾
K is given by
 a point 𝑥0 ∈ 𝐾, s.t. 𝑥0 + 𝐵𝑛 ⊆ 𝐾
 a membership oracle: answers YES/NO to “𝑥 ∈ 𝐾? "
Volume: first attempt

Divide and conquer:

Difficulty: number of parts grows exponentially in n.
More generally: Integration
Input: integrable function f: 𝑅𝑛 → 𝑅+ specified by an
oracle, point x, error parameter ε .
Output: number A such that:
1 − 𝜖 ∫ 𝑓 ≤ 𝐴 ≤ (1 + 𝜖)∫ 𝑓
Volume is the special case when f is a 0-1 function.
High-dimensional problems





Integration (volume)
Optimization
Learning
Rounding
Sampling
All hopelessly intractable in general, even to approximate.
High-dimensional problems
Input:
 A set of points S in n-dimensional space 𝑅 𝑛
or a distribution in 𝑅𝑛
 A function f that maps points to real values (could be the
indicator of a set)

What is the complexity of computational problems as the
dimension grows?

Dimension = number of variables

Typically, size of input is a function of the dimension.
Structure
Q. What structure makes high-dimensional problems
computationally tractable? (i.e., solvable with polynomial
complexity)

Convexity and its extensions appear to be the
frontier of polynomial-time solvability.
Volume: second attempt: Sandwiching
Thm (John). Any convex body K has an ellipsoid E s.t.
𝐸 ⊆ 𝐾 ⊆ 𝑛𝐸.
E = maximum volume ellipsoid contained in K.
Thm (KLS95). For a convex body K in isotropic position,
𝑛+1
𝐵𝑛
𝑛

⊆𝐾⊆
𝑛 𝑛 + 1 𝐵𝑛
Also a factor n sandwiching, but with a different ellipsoid.
Isotropic position and sandwiching

For any convex body K (in fact any set/distribution with
bounded second moments), we can apply an affine
transformation so that for a random point x from K :
𝐸 𝑥 = 0,
𝐸 𝑥𝑥 𝑇 = 𝐼𝑛 .

Thus K “looks like a ball” up to second moments.

How close is it really to a ball?
K lies between two balls with radii within a factor of n.

Volume via Sandwiching

The John ellipsoid can be approximated using the Ellipsoid
algorithm, s.t.
𝐸 ⊆ 𝐾 ⊆ 𝑛1.5 𝐸

The Inertial ellipsoid can be approximated to within any
constant factor (we’ll see how)

Using either one,
𝐸 ⊆ 𝐾 ⊆ 𝑛𝑂(1) 𝐸

Polytime algorithm, 𝑛𝑂

Can we do better?
𝑣𝑜𝑙 𝐸 ≤ 𝑣𝑜𝑙 𝐾 ≤ 𝑛𝑂(𝑛) 𝑣𝑜𝑙 𝐸 .
𝑛
approximation
Complexity of Volume Estimation
Thm [E86, BF87]. For any deterministic algorithm that uses at
most 𝑛𝑎 membership calls to the oracle for a convex body K
and computes two numbers A and B such that A ≤ vol K ≤ B,
there is some convex body for which the ratio B/A is at least
𝑐𝑛
𝑎 log 𝑛
n
2
where c is an absolute constant.
Thm [DF88]. Computing the volume of an explicit polytope
𝐴𝑥 ≤ 𝑏 is #P-hard, even for a totally unimodular matrix A and
rational b.
Complexity of Volume Estimation
Thm [BF]. For deterministic algorithms:
# oracle calls
approximation factor
Thm [Dadush-V.13].
Matching upper bound of 1 + 𝜖
𝑛
in time
1 𝑂 𝑛
𝜖
poly(𝑛).
Randomized Volume/Integration
[DFK89]. Polynomial-time randomized algorithm that
estimates volume to within relative error 1 + 𝜖 with
1
1
probability at least 1 − 𝛿 in time poly(n, , log
).
𝜖
𝛿
[Applegate-K91]. Polytime randomized algorithm to
estimate integral of any (Lipshitz) logconcave function.
Volume Computation: an ongoing adventure
Dyer-Frieze-Kannan 89 23
Lovász-Simonovits 90
Applegate-K 90
L 90
DF 91
LS 93
KLS 97
5
LV 03,04
LV 06
Cousins-V. 13, 14
Power
New aspects
everything
16
localization
10
logconcave integration
10
ball walk
8
error analysis
7
multiple improvements
speedy walk, isotropy
4
annealing, wt. isoper.
4
integration, local analysis
3
Gaussian cooling
Does it work?

[Lovász-Deák 2012] implemented [LV] 𝑛4 algorithm



worked for cubes up to dimension 9
but too slow after that.
[CV13] Matlab implementation of a new algorithm
Rotated cubes
Volume: third attempt: Sampling

Pick random samples from ball/cube containing K.
Compute fraction c of sample in K.
Output c.vol(outer ball).

Need too many samples!


Volume via Sampling [DFK89]
𝐵 ⊆ 𝐾 ⊆ 𝑅𝐵.
Let 𝐾𝑖 = 𝐾 ∩ 2𝑖/𝑛 𝐵, 𝑖 = 0, 1, … , 𝑚 = 𝑛 log 𝑅.
vol(K1 ) vol(K 2 )
vol(K m )
vol K = vol B .
…
.
vol(K 0 ) vol(K1 ) vol(K m−1 )
Estimate each ratio with random samples.
Volume via Sampling
𝐾𝑖 = 𝐾 ∩ 2𝑖/𝑛 𝐵,
𝑖 = 0, 1, … , 𝑚 = 𝑛 log 𝑅.
vol(K1 ) vol(K 2 )
vol(K m )
vol K = vol B .
…
.
vol(K 0 ) vol(K1 ) vol(K m−1 )
Claim. vol K i+1 ≤ 2. vol K i .
Total #samples =
𝑚
𝑚. 2
𝜖
But, how to sample?
= 𝑂 ∗ 𝑛2 .
Sampling

Generate



a uniform random point from a compact set S
or with density proportional to a function f.
Numerous applications in diverse areas: statistics,
networking, biology, computer vision, privacy, operations
research etc.
Sampling
Input: function f: 𝑅𝑛 → 𝑅+ specified by an oracle,
point x, error parameter ε.
Output: A point y from a distribution within distance ε
of distribution with density proportional to f.
Logconcave functions

𝑓: 𝑅𝑛 → 𝑅+ is logconcave if for any 𝑥, 𝑦 ∈ 𝑅𝑛 ,
𝑓 𝜆𝑥 + 1 − 𝜆 𝑦 ≥ 𝑓 𝑥 𝜆 𝑓 𝑦

Examples:





1−𝜆
Indicator functions of convex sets are logconcave
Gaussian density function
exponential function
Level sets of f, 𝐿𝑡 = 𝑥 ∶ 𝑓 𝑥 ≥ 𝑡 , are convex.
Product, minimum, convolution preserve logconcavity
Algorithmic Applications
Given a blackbox for sampling logconcave densities, we get
efficient algorithms for:




Rounding
Convex Optimization
Volume Computation/Integration
some Learning problems
Rounding via Sampling
Sample m random points from K;
Compute sample mean and sample covariance matrix
1.
2.

3.
𝐴 = 𝐸( 𝑥 − 𝑧 𝑥 − 𝑧 𝑇 ).
𝑧=𝐸 𝑥
1
2
−
Output 𝐵 = 𝐴 .
B(K-z) is nearly isotropic.
Thm. C(𝜖).n random points suffice to get 𝐸
𝐴−𝐼
2
[Adamczak et al; improving on Bourgain, Rudelson]
I.e., for any unit vector v,
1 + 𝜖 ≤ 𝐸 𝑣𝑇𝑥
2
≤ 1 + 𝜖.
≤ 𝜖.
How to Sample?
Ball walk:
At x,
-pick random y from 𝑥 + 𝛿𝐵𝑛
-if y is in K, go to y
Hit-and-Run:
At x,
-pick a random chord L through x
-go to a random point y on L
Complexity of Sampling
Thm. [KLS97] For a convex body, the ball walk with an M-warm
start reaches an (independent) nearly random point in poly(n, R,
M) steps.
𝑄0 𝑆
𝑀 = sup
𝑄 𝑆
𝑜𝑟 𝑀 = 𝐸𝑄0
𝑄0 𝑥
𝑄 𝑥
Thm. [LV03]. Same holds for arbitary logconcave density
functions. From a warm start, complexity is 𝑂∗ (𝑛2 𝑅2 ).

Isotropic transformation makes R=O( 𝑛).
KLS volume algorithm: 𝑛 × 𝑛 × 𝑛3 = 𝑛5
Markov chains




State space K
set of measurable subsets that form a 𝜎-algebra, i.e.,
closed under finite unions and intersections
A next step distribution 𝑃𝑢 . associated with each point
u in the state space.
A starting point.
𝑤0 , 𝑤1 , … , 𝑤𝑘 , … s.t.
𝑃 𝑤𝑘 ∈ 𝐴 𝑤0 , 𝑤1 , … , 𝑤𝑘−1 ) = 𝑃(𝑤𝑘 ∈ 𝐴 | 𝑤𝑘−1 )

Convergence
Stationary distribution Q, ergodic “flow” is:
Φ 𝐴 =
𝐴
𝑃𝑢 𝐾\A 𝑑𝑄(𝑢)
For any subset 𝐴, we have Φ 𝐴 = Φ(𝐾\A)
Conductance:
∫𝐴 𝑃𝑢 𝐾\A 𝑑𝑄(𝑢)
𝜙 𝐴 =
min 𝑄 𝐴 , 𝑄 𝐾\A
𝜙 = inf 𝜙(𝐴)
𝐴
1
Rate of convergence is bounded by 2 [LS93, JS86].
𝜙
Conductance
Arbitrary measurable subset S.
How large is the conditional escape probability from S?
Local conductance can be arbitrarily small for the ball walk.
vol 𝑥 + 𝛿𝐵𝑛 ∩ 𝐾
ℓ 𝑥 =
vol(𝛿𝐵𝑛 )
Conductance
Need:
 Nearby points have overlapping one-step distributions
 Large subsets have large boundaries [isoperimetry]
𝑐
𝜋 𝑆3 ≥ 𝑑 𝑆1 , 𝑆2 min 𝜋 𝑆1 , 𝜋 𝑆2
𝑅
where 𝑅2 = 𝐸𝐾 | 𝑥| 2
Isoperimetry and the KLS conjecture
𝑐
𝑅
𝜋 𝑆3 ≥ 𝑑(𝑆1 , 𝑆2 ) min 𝜋 𝑆1 , 𝜋 𝑆2
A = 𝐸((𝑥 − 𝑥)(𝑥 − 𝑥)𝑇 ) : covariance matrix of 𝜋
𝑅2 = 𝐸𝜋
𝑥−𝑥
Thm. [KLS95].
𝜋 𝑆3 ≥
Conj. [KLS95].
𝜋 𝑆3 ≥
2
𝑐
𝑇𝑟 𝐴
𝑐
𝜆1 𝐴
= 𝑇𝑟 𝐴 =
𝜆𝑖 (𝐴)
𝑖
𝑑(𝑆1 , 𝑆2 ) min 𝜋 𝑆1 , 𝜋(𝑆2 )
𝑑(𝑆1 , 𝑆2 ) min 𝜋 𝑆1 , 𝜋(𝑆2 )
KLS hyperplane conjecture
𝐴 = 𝐸(𝑥𝑥 𝑇 )
Conj. [KLS95].
𝜋 𝑆3 ≥
𝑐
𝜆1 𝐴
𝑑(𝑆1 , 𝑆2 ) min 𝜋 𝑆1 , 𝜋(𝑆2 )
• Could improve sampling complexity by a factor of n
• Implies well-known conjectures in convex geometry: slicing
conjecture and thin-shell conjecture
• But wide open!
KLS, Slicing, Thin-shell
thin shell
slicing
KLS
current bound
𝑛1/3
[Guedon-Milman]
𝑛1/4
[Bourgain, Klartag]
~ 𝑛1/3
[Bobkov;
Eldan-Klartag]
All are conjectured to be O(1).
Conjectures are equivalent! [Ball, Eldan-Klartag].
Is rapid mixing possible?
Ball walk can have bad starts, but
Hit-and-run escapes from corners
Min distance isoperimetry is
too coarse
Average distance isoperimetry


How to average distance?
ℎ 𝑥 ≤ 𝑚𝑖𝑛 𝑑 𝑢, 𝑣 ∶ 𝑢 ∈ 𝑆1 , 𝑣 ∈ 𝑆2 , 𝑥 ∈ ℓ(𝑥, 𝑦)
Thm.[LV04]
𝜋 𝑆3 ≥ 𝐸 ℎ 𝑥 𝜋(𝑆1 )𝜋(𝑆2 )
Hit-and-run mixes rapidly

Thm [LV04]. Hit-and-run mixes in polynomial time from any
starting point inside a convex body.
1
𝑛𝐷

Conductance = Ω

Along with isotropic transformation, gives 𝑂∗ 𝑛3 sampling
algorithm.
Simulated Annealing [LV03, Kalai-V.04]
To estimate ∫ 𝑓 consider a sequence 𝑓0 , 𝑓1 , 𝑓2 , … , 𝑓 = 𝑓𝑚
with ∫ 𝑓0 being easy, e.g., constant function over ball.
Then, ∫ 𝑓 =
∫ 𝑓1 ∫ 𝑓2
∫ 𝑓𝑚
∫ 𝑓0 .
.
…
.
∫ 𝑓0 ∫ 𝑓1
∫ 𝑓𝑚−1
Each ratio can be estimated by sampling:
1.
Sample X with density proportional to 𝑓𝑖
2.
Compute 𝑌 =
Then, 𝐸 𝑌 = ∫
𝑓𝑖+1 𝑋
𝑓𝑖 𝑋
𝑓𝑖+1 𝑋 𝑓𝑖 𝑋
.
𝑓𝑖 𝑋
∫ 𝑓𝑖 𝑋
𝑑𝑋 =
∫ 𝑓𝑖+1
.
∫ 𝑓𝑖
Annealing [LV06]

Define:
𝑓𝑖 𝑋 = 𝑒 −𝑎𝑖
𝑋

𝑎0 = 2𝑅, 𝑎𝑖+1 = 𝑎𝑖 / 1 +

𝑚 ~ 𝑛 log(2𝑅/𝜖) phases
1
𝑛
, 𝑎𝑚 =
𝜖
2𝑅
∫ 𝑓1 ∫ 𝑓2
∫ 𝑓𝑚
 𝑓0 .
.
…
.
∫ 𝑓0 ∫ 𝑓1
∫ 𝑓𝑚−1

The final estimate could be 𝑛Ω(𝑛) , so each ratio could be
𝑛 𝑛 or higher. How can we estimate it with a few
samples?!
Annealing [LV03, 06]


𝑓𝑖 𝑋 = 𝑒 −𝑎𝑖
𝑋
𝑎0 = 2𝑅, 𝑎𝑖+1 = 𝑎𝑖 / 1 +
Lemma. 𝑉𝐴𝑅 𝑌 =

𝑓𝑖+1 𝑋
𝑓𝑖 𝑋
1
𝑛
, 𝑎𝑚 =
𝜖
2𝑅
< 4 𝐸 𝑌 2.
Although expectation of Y can be large (exponential
even), we need only a few samples to estimate it!
LoVe algorithm: 𝑛 × 𝑛 × 𝑛3 = 𝑛4
Variance of ratio estimator
Let 𝑓 𝑎𝑖 , 𝑋 =
Then, 𝐸 𝑌 = ∫
𝐸 𝑌
𝐸 𝑌
2
2
𝑓 𝑎𝑖+1 ,𝑋
, 𝑌=
𝑓 𝑎𝑖 ,𝑋
𝑓 𝑎𝑖+1 , 𝑋 𝑓 𝑎𝑖 ,𝑋
.
𝑑𝑋
𝑓 𝑎𝑖 , 𝑋
∫ 𝑓 𝑎𝑖 , 𝑋
𝑒 −𝑎𝑖 𝑋
=
∫ 𝑓 𝑎𝑖+1 ,𝑋
∫ 𝑓 𝑎𝑖 ,𝑋
2
=
𝑓 𝑎𝑖+1 , 𝑋
𝑓 𝑎𝑖 , 𝑋
∫ 𝑓 𝑎𝑖 , 𝑋
𝑑𝑋 ⋅
2
𝑓 𝑎𝑖 , 𝑋
∫ 𝑓 𝑎𝑖 , 𝑋
∫ 𝑓 𝑎𝑖+1 , 𝑋
=
∫ 𝑓 2𝑎𝑖+1 − 𝑎𝑖 , 𝑋 ∫ 𝑓 𝑎𝑖 , 𝑋
∫ 𝑓 𝑎𝑖+1 , 𝑋
2
𝐹 𝑎 1−𝛼 𝐹 𝑎 1+𝛼
=
𝐹 𝑎 2
(would be at most 1, if F was logconcave…)
2
Variance of ratio estimator
Lemma. For any logconcave f and a >0, the function
𝑍 𝑎 = 𝑎𝑛 ∫ 𝑓 𝑎, 𝑋 𝑑𝑋 is also logconcave.
So 𝑎𝑛 𝐹(𝑎) is logconcave and
(𝑎𝑛 𝐹
𝑎
)2
≥ 𝑎 1−𝛼
𝑛
𝐹(𝑎 1 − 𝛼 ) 𝑎 1 + 𝛼
Therefore:
𝐹 𝑎 1−𝛼 𝐹 𝑎 1+𝛼
𝐹 𝑎 2
for 𝛼 ≤
1
.
𝑛
≤
1
1−𝛼 1+𝛼
𝑛
≤
𝑛
1
1−𝛼2
≤ 4.
𝑛
𝐹(𝑎 1 + 𝛼 )
Volume Computation: an ongoing adventure
Dyer-Frieze-Kannan 89 23
Lovász-Simonovits 90
Applegate-K 90
L 90
DF 91
LS 93
KLS 97
5
LV 03,04
LV 06
Cousins-V. 13, 14
Power
New aspects
everything
16
localization
10
logconcave integration
10
ball walk
8
error analysis
7
multiple improvements
speedy walk, isotropy
4
annealing, wt. isoper.
4
integration, local analysis
3
Gaussian cooling
Gaussian sampling/volume

Sample from Gaussian restricted to K
Compute Gaussian measure of K

Anneal with a Gaussian


Define
𝑓𝑖 𝑋 = 𝑒
−
𝑋
2
2𝜎2
𝑖
1
, increase
𝑛

Start with 𝜎0 small ~

Compute ratios of integrals of consecutive phases:
∫ 𝑓𝑖+1
∫ 𝑓𝑖
in phases.
Gaussian sampling

KLS conjecture holds for Gaussian restricted to any convex
body (via Brascamp-Lieb inequality).
Thm. 𝜋 𝑆3 ≥

𝑐
𝜎
𝑑(𝑆1 , 𝑆2 ) min 𝜋 𝑆1 , 𝜋(𝑆2 )
Not enough on its own, but can be used to show:
Thm. [Cousins-V. 13]. For 𝜎 2 = 𝑂 1 , Ball walk applied to
Gaussian 𝑁(0, 𝜎 2 𝐼𝑛 ) restricted to any convex body containing
the unit ball mixes in 𝑂∗ 𝑛2 time from a warm start.
Speedy walk: a thought experiment

Take sequence of points visited by ball walk:
𝑤0 , 𝑤1 , 𝑤2 , 𝑤3 , … , 𝑤𝑖 , 𝑤𝑖+1 , 𝑤𝑖+3 …

Subsequence of “proper” attempts that stay inside K

This subsequence is a Markov chain and is rapidly mixing
from any point

For a warm start, the total number of steps is only a
constant factor higher
Gaussian volume

Theorem [Cousins-V.13] The Gaussian volume of a
convex body K containing the unit ball can be estimated
in time 𝑂∗ 𝑛3 .


No need to adjust for isotropy!
Each step samples a 1-d Gaussian from an interval

Can we use this to compute the volume?
Gaussian Cooling

𝑓𝑖 𝑋 = 𝑒
2
 𝜎0

=
−
1
2
, 𝜎𝑚
𝑛
Estimate
𝜎𝑖2
𝑋
2
2𝜎2
𝑖
=𝑂 𝑛 .
∫ 𝑓𝑖+1
∫ 𝑓𝑖
using samples drawn according to 𝑓𝑖
≤ 1, set
𝜎𝑖2
=
2
𝜎𝑖−1
1+

For

2
For 𝜎𝑖2 > 1, set 𝜎𝑖2 = 𝜎𝑖−1
1+
1
𝑛
𝜎𝑖−1
𝑛
Gaussian Cooling

𝑓𝑖 𝑋 = 𝑒
−
𝑋
2
2𝜎2
𝑖
2
For 𝜎𝑖2 ≤ 1, we set 𝜎𝑖2 = 𝜎𝑖−1
1+
1
𝑛

Sampling time: 𝑛2 ,
#phases, #samples per phase: 𝑛

So, total time = 𝑛2 × 𝑛 × 𝑛 = 𝑛3

Gaussian Cooling

𝑓𝑖 𝑋 = 𝑒
For




𝜎𝑖2
−
𝑋
2
2𝜎2
𝑖
> 1, we set
𝜎𝑖2
=
Sampling time: 𝜎 2 𝑛2
#phases to double 𝜎 is
2
𝜎𝑖−1
1+
𝜎𝑖−1
𝑛
(too much??)
𝑛
𝜎
#samples per phase is also
𝑛
𝜎
So, total time to double 𝜎 is
𝑛
𝜎
×
𝑛
𝜎
× 𝜎 2 𝑛2 = 𝑛3 !!!
Variance of ratio estimator

2
Why can we set 𝜎𝑖2 as high as 𝜎𝑖−1
1+
2
𝑥
− 2
2𝜎
𝜎𝑖−1
𝑛
?
𝑓 𝜎𝑖2 , 𝑥 = 𝑒
for 𝑥 ∈ 𝐾
𝐹 𝜎𝑖2 = ∫ 𝑓 𝜎𝑖2 , 𝑥 𝑑𝑥
Lemma. 𝑌 =
𝑓 𝜎2 , 𝑋
𝜎2
𝑓 1+𝛼, 𝑋
𝐸 𝑌2
𝐸 𝑌 2
for 𝛼 = 𝑂
𝜎
𝑛
.
𝐸 𝑌 =
𝐹 𝜎2
𝐹
𝜎2
1+𝛼
𝜎2
𝜎2
𝐹
𝐹
1+𝛼
1−𝛼
𝑂
=
=
𝑒
𝐹 𝜎2 2
𝛼2 𝑛
𝜎2
=𝑂 1
Variance of ratio estimator
𝐸 𝑌2
𝐸 𝑌 2
𝜎2
𝜎2
𝐹 1+𝛼 𝐹 1−𝛼
𝑂
=
=𝑒
2
2
𝐹 𝜎
First use localization to reduce to 1-d inequality,
for a restricted family of logconcave functions:
For 𝐾 ∈ 𝑅 ⋅ 𝐵𝑛 and −𝑅 ≤ 𝑙 ≤ 𝑢 ≤ 𝑅
𝑢
𝐺 𝜎2 =
𝑡+𝑏
t2
𝑛−1 𝑒 −2𝜎 2 dt
𝑙
𝜎2
𝜎2
𝐺 1+𝛼 𝐺 1−𝛼
𝑂
=𝑒
2
2
𝐺 𝜎
𝛼2 𝑅2
𝜎2
𝛼2 𝑛
𝜎2
Variance of ratio estimator
𝜎2
𝜎2
𝐺 1+𝛼 𝐺 1−𝛼
𝑂
=𝑒
2
2
𝐺 𝜎
𝛼2 𝑅2
𝜎2
where
𝐸
𝑡2
=
𝑢 2
∫𝑙 𝑡
𝑢
∫𝑙
𝑡+𝑏
𝑡+𝑏
⇔
𝑑𝐸 𝑡 2
≤𝑐
2
d𝜎
𝑡2
𝑛−1 −2𝜎 2
𝑒
𝑑𝑡
𝑡2
𝑛−1 𝑒 −2𝜎 2 𝑑𝑡
Warm start

With
𝜎𝑖2
=
2
𝜎𝑖−1
1
𝜎𝑖−1
+
𝑛
, random point from one
distribution gives a warm start for the next.
𝑓 𝜎𝑖2 , 𝑥 = 𝑒
𝑀 = 𝐸𝑖
for 𝛼 = 𝑂
2
𝑥
− 2
2𝜎
𝑄𝑖 𝑥
𝑄𝑖+1 𝑥
𝜎
𝑛
for 𝑥 ∈ 𝐾
𝑓𝑖 𝑥
𝐹 𝜎𝑖2 = ∫ 𝑓 𝜎𝑖2 , 𝑥 𝑑𝑥
∫ 𝑓𝑖+1 𝑥 𝑓𝑖 𝑥 𝑑𝑥
=
⋅
⋅
𝑓𝑖+1 𝑥
∫ 𝑓𝑖 𝑥
∫ 𝑓𝑖 𝑥
1+𝛼
𝐹 𝜎 2 (1 + 𝛼) 𝐹 𝜎 2 ⋅
1 + 2𝛼 = 𝑂 1
=
𝐹 𝜎2 2
. Same lemma!
Gaussian Cooling [CV14]

Accelerated annealing
𝜎
𝑛

1+

Thm. The volume of any well-rounded convex body K can
be estimated using 𝑂∗ (𝑛3 ) membership queries.
is best possible rate
CV algorithm:
𝑛
𝜎
×
𝑛
𝜎
× 𝜎 2 𝑛2 × log 𝑛 = 𝑂∗ (𝑛3 )
Practical volume/integration
Start with a concentrated Gaussian
Run the algorithm till the Gaussian is nearly flat
In each phase, flatten Gaussian as much as possible while keeping
variance of ratio of integrals bounded
Variance can be estimated with a small constant number of samples
If covariance is skewed (as seen by SVD of O(n) points), scale down
high variance subspace
“Adaptive” annealing (also used in [Stefankovic-Vigoda-V.] for discrete
problems)
Open questions

How true is the KLS conjecture?
Open questions
When to stop a random walk?
(how to decide if current point is “random”?)


Faster isotropy/rounding?

How to get information before reaching stationary?
To make isotropic:
run for N steps; transform using covariance; repeat.
Open questions

How efficiently can we learn a polytope given only random
points?

With O(mn) points, cannot “see” structure, but enough
information to estimate the polytope! Algorithms?
For convex bodies:



[KOS][GR] need 2Ω 𝑛 points
𝑐
[Eldan] need 2𝑛 even to estimate volume!
Open questions

Can we estimate the volume of an explicit polytope in
deterministic polynomial time?
𝐴𝑥 ≤ 𝑏