+ All Categories
Home > Documents > Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main...

Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main...

Date post: 18-Oct-2020
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
64
The innite swapping limit for parallel tempering Paul Dupuis Division of Applied Mathematics Brown University co-PIs J.D. Doll, J. Gubernatis, H. Wang October, 2011 Paul Dupuis (Brown University) October, 2011
Transcript
Page 1: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

The in�nite swapping limit for parallel tempering

Paul Dupuis

Division of Applied MathematicsBrown University

co-PIs J.D. Doll, J. Gubernatis, H. Wang

October, 2011

Paul Dupuis (Brown University) October, 2011

Page 2: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Overview

Main theme of the proposal:The use of large deviations ideas to design accelerated schemes for MonteCarlo and related topics (e.g., molecular dynamics)

Topic of today�s talk:An application of the Donsker-Varadhan theory to parallel tempering

Initial goal and �nal outcome:Intention was to use large deviation theory to choose parameters in paralleltempering, but ended up constructing a new scheme we call in�niteswapping

Paul Dupuis (Brown University) October, 2011

Page 3: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Overview

Main theme of the proposal:The use of large deviations ideas to design accelerated schemes for MonteCarlo and related topics (e.g., molecular dynamics)

Topic of today�s talk:An application of the Donsker-Varadhan theory to parallel tempering

Initial goal and �nal outcome:Intention was to use large deviation theory to choose parameters in paralleltempering, but ended up constructing a new scheme we call in�niteswapping

Paul Dupuis (Brown University) October, 2011

Page 4: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Overview

Main theme of the proposal:The use of large deviations ideas to design accelerated schemes for MonteCarlo and related topics (e.g., molecular dynamics)

Topic of today�s talk:An application of the Donsker-Varadhan theory to parallel tempering

Initial goal and �nal outcome:Intention was to use large deviation theory to choose parameters in paralleltempering, but ended up constructing a new scheme we call in�niteswapping

Paul Dupuis (Brown University) October, 2011

Page 5: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Outline

Problem formulation

Large deviations for the empirical measure

The idea of parallel tempering

Large deviation rate for parallel tempering

In�nite swapping limit

Implementation issues and partial in�nite swapping

Numerical examples

Remark on function minimization

Paul Dupuis (Brown University) October, 2011

Page 6: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Problem formulation

A representative example. Compute the average potential energy andother functionals with respect to a Gibbs measure of the form

�(dx) = e�V (x)=�dx.Z (�);

and V is the potential of a (relatively) complex physical system.

Acorresponding process model is

dX = �rV (X )dt +p2�dW ; X (0) = x0;

where � is a �xed temperature. Monte Carlo approximation isZRdf (x)�T (dx) =

1T

Z T

0f (X (t))dt:

When parts of Rd communicate poorly under dynamics of X , theapproximation can be extremely slow to converge (the rare event problem).Hence interest in accelerated Monte Carlo.

Simulations are done using a discrete time model.

Paul Dupuis (Brown University) October, 2011

Page 7: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Problem formulation

A representative example. Compute the average potential energy andother functionals with respect to a Gibbs measure of the form

�(dx) = e�V (x)=�dx.Z (�);

and V is the potential of a (relatively) complex physical system. Acorresponding process model is

dX = �rV (X )dt +p2�dW ; X (0) = x0;

where � is a �xed temperature.

Monte Carlo approximation isZRdf (x)�T (dx) =

1T

Z T

0f (X (t))dt:

When parts of Rd communicate poorly under dynamics of X , theapproximation can be extremely slow to converge (the rare event problem).Hence interest in accelerated Monte Carlo.

Simulations are done using a discrete time model.

Paul Dupuis (Brown University) October, 2011

Page 8: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Problem formulation

A representative example. Compute the average potential energy andother functionals with respect to a Gibbs measure of the form

�(dx) = e�V (x)=�dx.Z (�);

and V is the potential of a (relatively) complex physical system. Acorresponding process model is

dX = �rV (X )dt +p2�dW ; X (0) = x0;

where � is a �xed temperature. Monte Carlo approximation isZRdf (x)�T (dx) =

1T

Z T

0f (X (t))dt:

When parts of Rd communicate poorly under dynamics of X , theapproximation can be extremely slow to converge (the rare event problem).Hence interest in accelerated Monte Carlo.

Simulations are done using a discrete time model.

Paul Dupuis (Brown University) October, 2011

Page 9: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Problem formulation

A representative example. Compute the average potential energy andother functionals with respect to a Gibbs measure of the form

�(dx) = e�V (x)=�dx.Z (�);

and V is the potential of a (relatively) complex physical system. Acorresponding process model is

dX = �rV (X )dt +p2�dW ; X (0) = x0;

where � is a �xed temperature. Monte Carlo approximation isZRdf (x)�T (dx) =

1T

Z T

0f (X (t))dt:

When parts of Rd communicate poorly under dynamics of X , theapproximation can be extremely slow to converge (the rare event problem).Hence interest in accelerated Monte Carlo.

Simulations are done using a discrete time model.Paul Dupuis (Brown University) October, 2011

Page 10: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Problem formulation

At low temperatures the main contribution comes from the globalminimum and �important� local minima, which often do not communicatewell.

To extract useful information requires large scale computing, e.g., 1010

times steps for a more complex model than X itself.

Dimensions on the order of 10,000, but can be much larger.

Paul Dupuis (Brown University) October, 2011

Page 11: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Problem formulation

At low temperatures the main contribution comes from the globalminimum and �important� local minima, which often do not communicatewell.

To extract useful information requires large scale computing, e.g., 1010

times steps for a more complex model than X itself.

Dimensions on the order of 10,000, but can be much larger.

Paul Dupuis (Brown University) October, 2011

Page 12: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Problem formulation

At low temperatures the main contribution comes from the globalminimum and �important� local minima, which often do not communicatewell.

To extract useful information requires large scale computing, e.g., 1010

times steps for a more complex model than X itself.

Dimensions on the order of 10,000, but can be much larger.

Paul Dupuis (Brown University) October, 2011

Page 13: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Problem formulation

An example of such is the Lennard-Jones cluster of 38 atoms. Thispotential has � 1014 local minima.

The lowest 150 and their�connectivity�graph are as in the �gure (taken from Doyle, Miller &Wales, JCP, 1999).

Paul Dupuis (Brown University) October, 2011

Page 14: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Problem formulation

An example of such is the Lennard-Jones cluster of 38 atoms. Thispotential has � 1014 local minima. The lowest 150 and their�connectivity�graph are as in the �gure (taken from Doyle, Miller &Wales, JCP, 1999).

Paul Dupuis (Brown University) October, 2011

Page 15: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Problem formulation

Problems with the same �rare event� issue:

Bayesian statistics

Pattern theory (image analysis and related estimation)

Counting problems in computer science

Computational physics/chemistry (condensed matter, quantumsystems)

Computational biology (motif sampling)

...

Paul Dupuis (Brown University) October, 2011

Page 16: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Large deviations for the empirical measure

ConsiderdX = b(X )dt +

p2�dW ; X (0) = x0

and for large T the empirical or occupation measure

�T (dx) =1T

Z T

0�X (t)(dx)dt:

Then considered as taking values in P(Rd ),

Pn�T � �

o� e�TI (�):

Here I (�) > 0 if �(dx) 6= �(x)dx , and it takes a fairly explicit form. Wewill use the LD rate I , where a larger rate implies faster convergence.

Paul Dupuis (Brown University) October, 2011

Page 17: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Large deviations for the empirical measure

ConsiderdX = b(X )dt +

p2�dW ; X (0) = x0

and for large T the empirical or occupation measure

�T (dx) =1T

Z T

0�X (t)(dx)dt:

Then considered as taking values in P(Rd ),

Pn�T � �

o� e�TI (�):

Here I (�) > 0 if �(dx) 6= �(x)dx , and it takes a fairly explicit form. Wewill use the LD rate I , where a larger rate implies faster convergence.

Paul Dupuis (Brown University) October, 2011

Page 18: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

The idea of parallel tempering

Setting of two temperatures.

Besides � 1 = � , introduce highertemperature � 2 > � 1. Thus

dX1 = �rV (X1)dt +p2� 1dW1

dX2 = �rV (X2)dt +p2� 2dW2;

with W1 and W2 independent. Then one obtains a Monte Carloapproximation to

�(x1; x2) = e�V (x1)�1 e�

V (x2)�2

�Z (� 1; � 2):

Paul Dupuis (Brown University) October, 2011

Page 19: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

The idea of parallel tempering

Setting of two temperatures. Besides � 1 = � , introduce highertemperature � 2 > � 1.

Thus

dX1 = �rV (X1)dt +p2� 1dW1

dX2 = �rV (X2)dt +p2� 2dW2;

with W1 and W2 independent. Then one obtains a Monte Carloapproximation to

�(x1; x2) = e�V (x1)�1 e�

V (x2)�2

�Z (� 1; � 2):

Paul Dupuis (Brown University) October, 2011

Page 20: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

The idea of parallel tempering

Setting of two temperatures. Besides � 1 = � , introduce highertemperature � 2 > � 1. Thus

dX1 = �rV (X1)dt +p2� 1dW1

dX2 = �rV (X2)dt +p2� 2dW2;

with W1 and W2 independent. Then one obtains a Monte Carloapproximation to

�(x1; x2) = e�V (x1)�1 e�

V (x2)�2

�Z (� 1; � 2):

Paul Dupuis (Brown University) October, 2011

Page 21: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

The idea of parallel tempering

Now introduce swaps (Swendsen, Geyer), i.e., X1 and X2 exchangelocations with state dependent intensity

ag(x1; x2) = a�1 ^ �(x2; x1)

�(x1; x2)

�= a

�1 ^ e�

hV (x1)�1

+V (x2)�2

i+hV (x2)�1

+V (x1)�2

i�;

with a > 0 the �swap rate.�

Paul Dupuis (Brown University) October, 2011

Page 22: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

The idea of parallel tempering

Now have a Markov jump-di¤usion. Easy to check: owing to detailedbalance still have

�(x1; x2) = e�V (x1)�1 e�

V (x2)�2

�Z (� 1; � 2):

Increased temperature � higher di¤usivity of X a2� easier communication for X a2� passed to X a1 via swaps

Natural question: how does convergence depend on a and � 2? We focuson a (may consider � 2 in future).Implementation uses discrete time version, and conventional wisdom isthat swap rate should be chosen so that

one swap attempt ! six steps discrete dynamics

Paul Dupuis (Brown University) October, 2011

Page 23: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

The idea of parallel tempering

Now have a Markov jump-di¤usion. Easy to check: owing to detailedbalance still have

�(x1; x2) = e�V (x1)�1 e�

V (x2)�2

�Z (� 1; � 2):

Increased temperature � higher di¤usivity of X a2� easier communication for X a2� passed to X a1 via swaps

Natural question: how does convergence depend on a and � 2? We focuson a (may consider � 2 in future).Implementation uses discrete time version, and conventional wisdom isthat swap rate should be chosen so that

one swap attempt ! six steps discrete dynamics

Paul Dupuis (Brown University) October, 2011

Page 24: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

The idea of parallel tempering

Now have a Markov jump-di¤usion. Easy to check: owing to detailedbalance still have

�(x1; x2) = e�V (x1)�1 e�

V (x2)�2

�Z (� 1; � 2):

Increased temperature � higher di¤usivity of X a2� easier communication for X a2� passed to X a1 via swaps

Natural question: how does convergence depend on a and � 2? We focuson a (may consider � 2 in future).

Implementation uses discrete time version, and conventional wisdom isthat swap rate should be chosen so that

one swap attempt ! six steps discrete dynamics

Paul Dupuis (Brown University) October, 2011

Page 25: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

The idea of parallel tempering

Now have a Markov jump-di¤usion. Easy to check: owing to detailedbalance still have

�(x1; x2) = e�V (x1)�1 e�

V (x2)�2

�Z (� 1; � 2):

Increased temperature � higher di¤usivity of X a2� easier communication for X a2� passed to X a1 via swaps

Natural question: how does convergence depend on a and � 2? We focuson a (may consider � 2 in future).Implementation uses discrete time version, and conventional wisdom isthat swap rate should be chosen so that

one swap attempt ! six steps discrete dynamics

Paul Dupuis (Brown University) October, 2011

Page 26: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Large deviation rate for parallel tempering

What does the Donsker-Varadhan theory say?

Suppose � given such that

�(x1; x2) =d�d�(x1; x2)

is smooth. Then we have monotonic form

I a(�) = J0(�) + aJ1(�)

where J0 is the rate for �no swap�dynamics and

J1(�) =ZRd�Rd

g(x1; x2)`

s�(x2; x1)�(x1; x2)

!�(dx1dx2)

with` (z) = z log z � z + 1 = 0 i¤ z = 1:

Paul Dupuis (Brown University) October, 2011

Page 27: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Large deviation rate for parallel tempering

What does the Donsker-Varadhan theory say? Suppose � given such that

�(x1; x2) =d�d�(x1; x2)

is smooth.

Then we have monotonic form

I a(�) = J0(�) + aJ1(�)

where J0 is the rate for �no swap�dynamics and

J1(�) =ZRd�Rd

g(x1; x2)`

s�(x2; x1)�(x1; x2)

!�(dx1dx2)

with` (z) = z log z � z + 1 = 0 i¤ z = 1:

Paul Dupuis (Brown University) October, 2011

Page 28: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Large deviation rate for parallel tempering

What does the Donsker-Varadhan theory say? Suppose � given such that

�(x1; x2) =d�d�(x1; x2)

is smooth. Then we have monotonic form

I a(�) = J0(�) + aJ1(�)

where J0 is the rate for �no swap�dynamics and

J1(�) =ZRd�Rd

g(x1; x2)`

s�(x2; x1)�(x1; x2)

!�(dx1dx2)

with` (z) = z log z � z + 1 = 0 i¤ z = 1:

Paul Dupuis (Brown University) October, 2011

Page 29: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Large deviation rate for parallel tempering

What does the Donsker-Varadhan theory say? Suppose � given such that

�(x1; x2) =d�d�(x1; x2)

is smooth. Then we have monotonic form

I a(�) = J0(�) + aJ1(�)

where J0 is the rate for �no swap�dynamics and

J1(�) =ZRd�Rd

g(x1; x2)`

s�(x2; x1)�(x1; x2)

!�(dx1dx2)

with` (z) = z log z � z + 1 = 0 i¤ z = 1:

Paul Dupuis (Brown University) October, 2011

Page 30: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Large deviation properties and the in�nite swapping limit

Rate for low temperature marginal. By contraction principle, forprobability measure

I a1 ( ) = inf fI a(�) : �rst marginal of � is g :

If (dx1) 6= �1(dx1) = e�V (x1)�1 dx1

�Z (� 1), then for a 2 (0;1)

I a1 ( ) > I01 ( )

andI a1 ( ) " some �nite limit.

Paul Dupuis (Brown University) October, 2011

Page 31: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Large deviation properties and the in�nite swapping limit

Rate for low temperature marginal. By contraction principle, forprobability measure

I a1 ( ) = inf fI a(�) : �rst marginal of � is g :

If (dx1) 6= �1(dx1) = e�V (x1)�1 dx1

�Z (� 1), then for a 2 (0;1)

I a1 ( ) > I01 ( )

andI a1 ( ) " some �nite limit.

Paul Dupuis (Brown University) October, 2011

Page 32: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Large deviation properties and the in�nite swapping limit

This suggests one consider the in�nite swapping limit a!1, except

if a is large but �nite almost all computational e¤ort is directed atswap attempts, rather than di¤usion dynamics,

if a!1 then limit process not well de�ned (no tightness).

An alternative perspective: rather than swap particles, swaptemperatures, and use �weighted�empirical measure.

Particle swapping. Process:

(X a1 ;Xa2 ) ;

Approximation to �(dx):

1T

Z T

0�(X a1 ;X a2 )

(dx)dt

Paul Dupuis (Brown University) October, 2011

Page 33: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Large deviation properties and the in�nite swapping limit

This suggests one consider the in�nite swapping limit a!1, except

if a is large but �nite almost all computational e¤ort is directed atswap attempts, rather than di¤usion dynamics,

if a!1 then limit process not well de�ned (no tightness).

An alternative perspective: rather than swap particles, swaptemperatures, and use �weighted�empirical measure.

Particle swapping. Process:

(X a1 ;Xa2 ) ;

Approximation to �(dx):

1T

Z T

0�(X a1 ;X a2 )

(dx)dt

Paul Dupuis (Brown University) October, 2011

Page 34: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Large deviation properties and the in�nite swapping limit

This suggests one consider the in�nite swapping limit a!1, except

if a is large but �nite almost all computational e¤ort is directed atswap attempts, rather than di¤usion dynamics,

if a!1 then limit process not well de�ned (no tightness).

An alternative perspective: rather than swap particles, swaptemperatures, and use �weighted�empirical measure.

Particle swapping. Process:

(X a1 ;Xa2 ) ;

Approximation to �(dx):

1T

Z T

0�(X a1 ;X a2 )

(dx)dt

Paul Dupuis (Brown University) October, 2011

Page 35: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Large deviation properties and the in�nite swapping limit

This suggests one consider the in�nite swapping limit a!1, except

if a is large but �nite almost all computational e¤ort is directed atswap attempts, rather than di¤usion dynamics,

if a!1 then limit process not well de�ned (no tightness).

An alternative perspective: rather than swap particles, swaptemperatures, and use �weighted�empirical measure.

Particle swapping. Process:

(X a1 ;Xa2 ) ;

Approximation to �(dx):

1T

Z T

0�(X a1 ;X a2 )

(dx)dt

Paul Dupuis (Brown University) October, 2011

Page 36: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Large deviation properties and the in�nite swapping limit

This suggests one consider the in�nite swapping limit a!1, except

if a is large but �nite almost all computational e¤ort is directed atswap attempts, rather than di¤usion dynamics,

if a!1 then limit process not well de�ned (no tightness).

An alternative perspective: rather than swap particles, swaptemperatures, and use �weighted�empirical measure.

Particle swapping. Process:

(X a1 ;Xa2 ) ;

Approximation to �(dx):

1T

Z T

0�(X a1 ;X a2 )

(dx)dt

Paul Dupuis (Brown University) October, 2011

Page 37: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Large deviation properties and the in�nite swapping limit

This suggests one consider the in�nite swapping limit a!1, except

if a is large but �nite almost all computational e¤ort is directed atswap attempts, rather than di¤usion dynamics,

if a!1 then limit process not well de�ned (no tightness).

An alternative perspective: rather than swap particles, swaptemperatures, and use �weighted�empirical measure.

Particle swapping. Process:

(X a1 ;Xa2 ) ;

Approximation to �(dx):

1T

Z T

0�(X a1 ;X a2 )

(dx)dt

Paul Dupuis (Brown University) October, 2011

Page 38: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Large deviation properties and the in�nite swapping limit

Temperature swapping.

Process:

dY a1 = �rV (Y a1 )dt +p2r1(Z a)dW1

dY a2 = �rV (Y a2 )dt +p2r2(Z a)dW2;

where r(Z a(t)) jumps between � 1 and � 2 with intensity ag(Y a1 (t);Ya2 (t)).

Approximation to �(dx):

1T

Z T

0

h1f0g(Z

a)�(Y a1 ;Y a2 )(dx) + 1f1g(Za)�(Y a2 ;Y a1 )(dx)

idt:

Paul Dupuis (Brown University) October, 2011

Page 39: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Large deviation properties and the in�nite swapping limit

Temperature swapping. Process:

dY a1 = �rV (Y a1 )dt +p2r1(Z a)dW1

dY a2 = �rV (Y a2 )dt +p2r2(Z a)dW2;

where r(Z a(t)) jumps between � 1 and � 2 with intensity ag(Y a1 (t);Ya2 (t)).

Approximation to �(dx):

1T

Z T

0

h1f0g(Z

a)�(Y a1 ;Y a2 )(dx) + 1f1g(Za)�(Y a2 ;Y a1 )(dx)

idt:

Paul Dupuis (Brown University) October, 2011

Page 40: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Large deviation properties and the in�nite swapping limit

Temperature swapping. Process:

dY a1 = �rV (Y a1 )dt +p2r1(Z a)dW1

dY a2 = �rV (Y a2 )dt +p2r2(Z a)dW2;

where r(Z a(t)) jumps between � 1 and � 2 with intensity ag(Y a1 (t);Ya2 (t)).

Approximation to �(dx):

1T

Z T

0

h1f0g(Z

a)�(Y a1 ;Y a2 )(dx) + 1f1g(Za)�(Y a2 ;Y a1 )(dx)

idt:

Paul Dupuis (Brown University) October, 2011

Page 41: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Large deviation properties and the in�nite swapping limit

The advantage is a weak limit well de�ned as a!1:

dY1 = �rV (Y1)dt +p2� 1�1(Y1;Y2) + 2� 2�2(Y1;Y2)dW1

dY2 = �rV (Y2)dt +p2� 2�1(Y1;Y2) + 2� 1�2(Y1;Y2)dW2;

�T (dx) =Z T

0

��1(Y1;Y2)�(Y1;Y2) + �2(Y1;Y2)�(Y2;Y1)

�ds;

and

�1(x1; x2) =e�hV (x1)�1

+V (x2)�2

iZ�(x1; x2)

; �2(x1; x2) =e�hV (x2)�1

+V (x1)�2

iZ�(x1; x2)

:

Theorem:��Tsatis�es the large deviation principle with rate I1.

Paul Dupuis (Brown University) October, 2011

Page 42: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Large deviation properties and the in�nite swapping limit

The advantage is a weak limit well de�ned as a!1:

dY1 = �rV (Y1)dt +p2� 1�1(Y1;Y2) + 2� 2�2(Y1;Y2)dW1

dY2 = �rV (Y2)dt +p2� 2�1(Y1;Y2) + 2� 1�2(Y1;Y2)dW2;

�T (dx) =Z T

0

��1(Y1;Y2)�(Y1;Y2) + �2(Y1;Y2)�(Y2;Y1)

�ds;

and

�1(x1; x2) =e�hV (x1)�1

+V (x2)�2

iZ�(x1; x2)

; �2(x1; x2) =e�hV (x2)�1

+V (x1)�2

iZ�(x1; x2)

:

Theorem:��Tsatis�es the large deviation principle with rate I1.

Paul Dupuis (Brown University) October, 2011

Page 43: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Large deviation properties and the in�nite swapping limit

The advantage is a weak limit well de�ned as a!1:

dY1 = �rV (Y1)dt +p2� 1�1(Y1;Y2) + 2� 2�2(Y1;Y2)dW1

dY2 = �rV (Y2)dt +p2� 2�1(Y1;Y2) + 2� 1�2(Y1;Y2)dW2;

�T (dx) =Z T

0

��1(Y1;Y2)�(Y1;Y2) + �2(Y1;Y2)�(Y2;Y1)

�ds;

and

�1(x1; x2) =e�hV (x1)�1

+V (x2)�2

iZ�(x1; x2)

; �2(x1; x2) =e�hV (x2)�1

+V (x1)�2

iZ�(x1; x2)

:

Theorem:��Tsatis�es the large deviation principle with rate I1.

Paul Dupuis (Brown University) October, 2011

Page 44: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Implementation issues and partial in�nite swapping

Applications of parallel tempering use many temperatures (e.g.,K = 30 to 50) when V is complicated to overcome barriers of alldi¤erent heights.

Swaps typically between near-neighbor only, with randomized ordeterministic schedule of which pair to try (discrete time).

Straightforward extension of in�nite swapping to K temperatures� 1 < � 2 < � � � < �K .But, coe¢ cients become complex, e.g., K ! weights, and each involvesmany calculations. Not practical if K � 7.Need for computational feasibility leads to partial in�nite swapping.

Paul Dupuis (Brown University) October, 2011

Page 45: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Implementation issues and partial in�nite swapping

Applications of parallel tempering use many temperatures (e.g.,K = 30 to 50) when V is complicated to overcome barriers of alldi¤erent heights.

Swaps typically between near-neighbor only, with randomized ordeterministic schedule of which pair to try (discrete time).

Straightforward extension of in�nite swapping to K temperatures� 1 < � 2 < � � � < �K .But, coe¢ cients become complex, e.g., K ! weights, and each involvesmany calculations. Not practical if K � 7.Need for computational feasibility leads to partial in�nite swapping.

Paul Dupuis (Brown University) October, 2011

Page 46: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Implementation issues and partial in�nite swapping

Applications of parallel tempering use many temperatures (e.g.,K = 30 to 50) when V is complicated to overcome barriers of alldi¤erent heights.

Swaps typically between near-neighbor only, with randomized ordeterministic schedule of which pair to try (discrete time).

Straightforward extension of in�nite swapping to K temperatures� 1 < � 2 < � � � < �K .

But, coe¢ cients become complex, e.g., K ! weights, and each involvesmany calculations. Not practical if K � 7.Need for computational feasibility leads to partial in�nite swapping.

Paul Dupuis (Brown University) October, 2011

Page 47: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Implementation issues and partial in�nite swapping

Applications of parallel tempering use many temperatures (e.g.,K = 30 to 50) when V is complicated to overcome barriers of alldi¤erent heights.

Swaps typically between near-neighbor only, with randomized ordeterministic schedule of which pair to try (discrete time).

Straightforward extension of in�nite swapping to K temperatures� 1 < � 2 < � � � < �K .But, coe¢ cients become complex, e.g., K ! weights, and each involvesmany calculations. Not practical if K � 7.

Need for computational feasibility leads to partial in�nite swapping.

Paul Dupuis (Brown University) October, 2011

Page 48: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Implementation issues and partial in�nite swapping

Applications of parallel tempering use many temperatures (e.g.,K = 30 to 50) when V is complicated to overcome barriers of alldi¤erent heights.

Swaps typically between near-neighbor only, with randomized ordeterministic schedule of which pair to try (discrete time).

Straightforward extension of in�nite swapping to K temperatures� 1 < � 2 < � � � < �K .But, coe¢ cients become complex, e.g., K ! weights, and each involvesmany calculations. Not practical if K � 7.Need for computational feasibility leads to partial in�nite swapping.

Paul Dupuis (Brown University) October, 2011

Page 49: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Implementation issues and partial in�nite swapping

Partial in�nite swapping. Given any subgroup of set of permutations onecan construct a corresponding partial in�nite swapping dynamic.

Subgroup property means the corresponding dynamic can be realized asweak limit of a parallel tempering algorithm.

Examples are Dynamics A and B in �gure:

Paul Dupuis (Brown University) October, 2011

Page 50: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Implementation issues and partial in�nite swapping

Partial in�nite swapping. Given any subgroup of set of permutations onecan construct a corresponding partial in�nite swapping dynamic.

Subgroup property means the corresponding dynamic can be realized asweak limit of a parallel tempering algorithm.

Examples are Dynamics A and B in �gure:

Paul Dupuis (Brown University) October, 2011

Page 51: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Implementation issues and partial in�nite swapping

Partial in�nite swapping. Given any subgroup of set of permutations onecan construct a corresponding partial in�nite swapping dynamic.

Subgroup property means the corresponding dynamic can be realized asweak limit of a parallel tempering algorithm.

Examples are Dynamics A and B in �gure:

Paul Dupuis (Brown University) October, 2011

Page 52: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Implementation issues and partial in�nite swapping

Using partial in�nite swapping one can control the complexity of thecoe¢ cients and algorithm.

If one alternates between subgroups that generate full group ofpermutations, one approximates full in�nite swapping (convergencetheorem in continuous time).

However, particles lose their physical identity in in�nite swapping limit(partial or otherwise). Cannot simply alternate�need a proper�hando¤� rule.

Can identify the �distributionally correct�hando¤ rule, using thatpartial swappings are limits of �physically meaningful�processes.

Paul Dupuis (Brown University) October, 2011

Page 53: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Implementation issues and partial in�nite swapping

Using partial in�nite swapping one can control the complexity of thecoe¢ cients and algorithm.

If one alternates between subgroups that generate full group ofpermutations, one approximates full in�nite swapping (convergencetheorem in continuous time).

However, particles lose their physical identity in in�nite swapping limit(partial or otherwise). Cannot simply alternate�need a proper�hando¤� rule.

Can identify the �distributionally correct�hando¤ rule, using thatpartial swappings are limits of �physically meaningful�processes.

Paul Dupuis (Brown University) October, 2011

Page 54: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Implementation issues and partial in�nite swapping

Using partial in�nite swapping one can control the complexity of thecoe¢ cients and algorithm.

If one alternates between subgroups that generate full group ofpermutations, one approximates full in�nite swapping (convergencetheorem in continuous time).

However, particles lose their physical identity in in�nite swapping limit(partial or otherwise). Cannot simply alternate�need a proper�hando¤� rule.

Can identify the �distributionally correct�hando¤ rule, using thatpartial swappings are limits of �physically meaningful�processes.

Paul Dupuis (Brown University) October, 2011

Page 55: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Implementation issues and partial in�nite swapping

Using partial in�nite swapping one can control the complexity of thecoe¢ cients and algorithm.

If one alternates between subgroups that generate full group ofpermutations, one approximates full in�nite swapping (convergencetheorem in continuous time).

However, particles lose their physical identity in in�nite swapping limit(partial or otherwise). Cannot simply alternate�need a proper�hando¤� rule.

Can identify the �distributionally correct�hando¤ rule, using thatpartial swappings are limits of �physically meaningful�processes.

Paul Dupuis (Brown University) October, 2011

Page 56: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Numerical example

Relaxation study of convergence to equilibrium for LJ-38.

quantity of interest: average potential energy at various temperatures

used 45 temperatures, alternate 3�6�6�� � ��6 and 6�6�� � ��6�3 typedynamics for partial in�nite swapping

lowest 1/3 of temperatures raised to push process away fromequilibrium (low temperature components pushed away from deepminima)

then reduced to correct temperatures for 600 discrete time steps tostudy return to equilibria

repeated 2000 times, we plot averages for lowest (and hardest)temperature

Paul Dupuis (Brown University) October, 2011

Page 57: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Numerical example

Convergence to equilibrium for LJ-38: parallel tempering versus partialin�nite swapping, only lowest temperature illustrated.

For this system, reduction relative to best parallel tempering: 1010 reducedto 106 steps with additional overhead of approximately 10%.Paul Dupuis (Brown University) October, 2011

Page 58: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

References

Mathematical paper:

�On the in�nite swapping limit for parallel tempering�, D, Liu,Plattner and Doll, preprint, to be submitted to SIAM J. on MMS

Applications paper (lots of numerical data):

�An in�nite swapping approach to the rare-event sampling problem�,Plattner, Doll, D, Wang, Liu and Gubernatis, to appear in J. ofChem. Phy.

Paul Dupuis (Brown University) October, 2011

Page 59: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Concluding remarks

Function minimization attributes:

The in�nite swapping process (Y1;Y2) has symmetric dynamics,interesting qualitative properties w.r.t. function minimization.

Paul Dupuis (Brown University) October, 2011

Page 60: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Concluding remarks

Convergence to equilibium, single sample, 12 lowest temperatures:

Paul Dupuis (Brown University) October, 2011

Page 61: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Concluding remarks

Future work.

Application to problems where parallel tempering fails

Selection of �best�partial in�nite swapping approximations.

Use of other parameters besides temperature.

Better quantitative understanding of rate of marginals such as I11 ( )

Paul Dupuis (Brown University) October, 2011

Page 62: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Concluding remarks

Future work.

Application to problems where parallel tempering fails

Selection of �best�partial in�nite swapping approximations.

Use of other parameters besides temperature.

Better quantitative understanding of rate of marginals such as I11 ( )

Paul Dupuis (Brown University) October, 2011

Page 63: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Concluding remarks

Future work.

Application to problems where parallel tempering fails

Selection of �best�partial in�nite swapping approximations.

Use of other parameters besides temperature.

Better quantitative understanding of rate of marginals such as I11 ( )

Paul Dupuis (Brown University) October, 2011

Page 64: Paul Dupuis - csm.ornl.gov€¦ · Paul Dupuis (Brown University) October, 2011. Overview Main theme of the proposal: The use of large deviations ideas to design accelerated schemes

Concluding remarks

Future work.

Application to problems where parallel tempering fails

Selection of �best�partial in�nite swapping approximations.

Use of other parameters besides temperature.

Better quantitative understanding of rate of marginals such as I11 ( )

Paul Dupuis (Brown University) October, 2011


Recommended